Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spro_0429 |
Symbol | mutL |
ID | 5603683 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Serratia proteamaculans 568 |
Kingdom | Bacteria |
Replicon accession | NC_009832 |
Strand | + |
Start bp | 480246 |
End bp | 482120 |
Gene Length | 1875 bp |
Protein Length | 624 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640935936 |
Product | DNA mismatch repair protein |
Protein accession | YP_001476665 |
Protein GI | 157368676 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | [TIGR00585] DNA mismatch repair protein MutL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000237281 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000000886041 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCCATCC AGGTGTTACC GCCCCAGCTT GCCAACCAGA TTGCCGCCGG TGAAGTGGTC GAGCGGCCCG CGTCGGTGGT CAAGGAATTG GTGGAAAACA GTCTGGATGC CGGGGCGACG CGTATTGATA TTGATATCGA GCGTGGTGGC GCCAAGCTGA TCCGCATTCG TGATAACGGC AGTGGTATTG GCAAGGACGA CCTGGCTCTG GCATTGGCTC GTCACGCCAC CAGTAAAATC AGCACGCTCG ACGATTTGGA AGCCATTGTC AGCCTCGGCT TTCGCGGCGA GGCGTTGGCC AGCATCAGCT CGGTTTCTCG CTTAACCCTC ACTTCACGTA CCGCAGAACA AAACGAAGCC TGGCAGGCTT ATGCCGAAGG CCGCGATCAG GCGGTGACGG TCAAGCCGGC GGCACACCCG ATAGGCAGCA CGCTGGAAGT GCTGGATCTG TTCTACAACA CCCCGGCGCG GCGCAAATTT ATGCGCACCG AGAAAACCGA ATTTGGCCAT ATCGATGAAG TGGTGCGACG TATTGCACTG GCGCGTTTCG ATGTGGCGAT CAACCTTAGT CACAATGGCA AGCTGATGCG TCAATATCGC GCAGCGAAAG ACGAGAGCCA GTATGAGCGC CGTCTGGGCA GTATTTGCGG CCCGGCCTTT TTGCAGCATG CGCTGAACAT CTCCTGGCAG CACGGTGACC TGACCATTCG CGGCTGGGTG GCCGATCCTG CCGGTGCGCG GCAACTGGGC GAAATGCAGT ATTGCTACGT CAACAGCCGC ATGATGCGCG ATCGTTTGAT CAATCACGCT ATCCGCCAGG CTTATCAGGA TCAACTGAAA GACGACCAGC AGCCTGCCTA TGTGCTGTAT CTCGAGGTAG ACCCGCATCA GGTGGATGTG AATGTTCACC CGGCCAAGCA CGAGGTGCGT TTCCATCAGG CTCGGCTGGT GCACGACTTT ATCTATCAGG CGGTAACCAC CGTGTTGCAG CAGGTCGGCA ATGCGCCGCT GCCGTTGACC GATGAAACCG AGCAGCAACC GACACCGGTC TGGCAGCCGG AAAACCGCGT CGCCGCTGGC GGCAACCATT TTTCGCAGCC TGCACCGCGC CGAGAAACCG CATCAACCGA GCCTGCCGTC GCGCGTGAAC GTGCGCCGCA ACCGGCCTAT CATTCGGGCA GTGGTTACCA GAAGCGGGAA GGTGAGCTGT ACGGCAAGCT GTTGCAGGCC ACGCCAGTGG CTGAGCCACG GCAAGAAGCA CCAAAGCAAC CGCTGTTTCC ACCGGTAAAA ACCGAGCAGG AAACGCCACT GGCCGGGAGT CAGCACAGTT TCGGCCGTGT GCTGATGATC TACCCGCCGT GTTATGCGCT GATTGAAAAC GGTCAGCAGT TGATGTTGCT TAACCTGCCG GTGGCCGAAC GCTGGTTACG TCAGGCGCAA CTTAATCCTT CGCAAGAAGG CCTGCGGCCA CAGCCGCTGC TGATCCCCAT CAAGCTGACG TTGAACAAAC AAGAGGCGGC AGCCTGCATA CATCATCAGC CGCTATTGGT AACAATGGGG TTGGATCTGC AAGTAGATCA CGGGCGTGTG ACGTTGCGCG CAGTACCTTT ACCATTACGC CAACAAAATT TACAAAAACT GATACCCGAA CTGTTAGGCT ATCTGGCCGA GCATCAGGAG ATGTCGCCCG CGGTATTGGC CACCTGGTTT GCCCGCCATT TAGGTAGCGA ACATGAACAG TGGAACACCT CGCAAGCGAT ACAATTGCTG ACCGACGTTG AACGACTTTG CCCGCAGCTG GTCAAATCAC CACCCAGCGG ACTTTTACAA CCTGTTGATT TACAGGCTGC ACTGACAGCA CTTAGGCATG ATTGA
|
Protein sequence | MPIQVLPPQL ANQIAAGEVV ERPASVVKEL VENSLDAGAT RIDIDIERGG AKLIRIRDNG SGIGKDDLAL ALARHATSKI STLDDLEAIV SLGFRGEALA SISSVSRLTL TSRTAEQNEA WQAYAEGRDQ AVTVKPAAHP IGSTLEVLDL FYNTPARRKF MRTEKTEFGH IDEVVRRIAL ARFDVAINLS HNGKLMRQYR AAKDESQYER RLGSICGPAF LQHALNISWQ HGDLTIRGWV ADPAGARQLG EMQYCYVNSR MMRDRLINHA IRQAYQDQLK DDQQPAYVLY LEVDPHQVDV NVHPAKHEVR FHQARLVHDF IYQAVTTVLQ QVGNAPLPLT DETEQQPTPV WQPENRVAAG GNHFSQPAPR RETASTEPAV ARERAPQPAY HSGSGYQKRE GELYGKLLQA TPVAEPRQEA PKQPLFPPVK TEQETPLAGS QHSFGRVLMI YPPCYALIEN GQQLMLLNLP VAERWLRQAQ LNPSQEGLRP QPLLIPIKLT LNKQEAAACI HHQPLLVTMG LDLQVDHGRV TLRAVPLPLR QQNLQKLIPE LLGYLAEHQE MSPAVLATWF ARHLGSEHEQ WNTSQAIQLL TDVERLCPQL VKSPPSGLLQ PVDLQAALTA LRHD
|
| |