Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_03891 |
Symbol | |
ID | 4777044 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 391408 |
End bp | 393894 |
Gene Length | 2487 bp |
Protein Length | 828 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640085892 |
Product | putative DNA mismatch repair protein MutS family protein |
Protein accession | YP_001016406 |
Protein GI | 124022099 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1193] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01069] MutS2 family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.331058 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCCGC TTGGTTACGG CAGGGATGAT TGGCTTCTTT CAAGTTCAGT GGGGCGCAAC AGCGTTGAAA TGAGCGCTGT AGTCATGAGC TCTGAGGTCT CGACTGCTTC GCTTGCCTTT CAGGACACAC TCAAACAGCT TGAATGGCCA CGCTTATGTG AACACCTCGC TGGATTTGCC AGCACCTTGC AGGGCCGGCG TCATTGCCAG ACTTTGTCGC TGCCTGCCGA TTTGTCTGAC AGCCGTTTGC GCTTGGCAGA AACTCTTGAG ATCGGTGCTT TGGATGGTCT GATTGATGGA GGCCTCAGCT TTCAGGGTGT CCATGACCTT GGCCATATCC TGGCTCGCTG TACAAAAGGT GGTGTGGCTT CTGGGGAAGA GCTTTTAGCT GTTGCCGACA CCCTTGCAGC AGCGCGCCGC TTGCGCCGCC AGATTAATGA TCCAGAGCTT CGTCCCACGA TCTCGGCGTT GTTGCTGGAT GTCGCAACGA TGCCGGAGCT GGAACGGCGA CTCAAATTTG CATTGGAAGA GGGTGGTCGG GTGGCTGATC GCGTCAGCTC CAAGCTCGCT GGCTTGCGAC GACAGTGGCA AGGTTTGCGT CAGGAACGTC GCGATTGCCT ACAGGAGGTG ATCAGGCGTC ATGCGGCCAT GTTGCAGGAC ACTGTGATCG CTGATCGCCA TGGCCGGCCA GTGCTAGCGG TGAAGGCCGC TGCAGTGTCT CAACTCCCCG GCCTGGTCCA CGACAGCTCA GCATCTGGCA GCACAGTCTT TGTTGAGCCT CAGGTTGTGA TCACCCTGAG TAATCGTCTT GCTGAATTGG ATGGTCATAT CCGAGAGCAG CAGCAGCTCG TCTTGGCAGA GTTGAGTGCG GCGGTGGCTG AGTCCGGCGT ATCGATTAGT CGCCTGGGAG AGGTTCTGCT TCAACTCGAC TTGGCTCTGG CTCGTGGCCG TTATGGTCAA TGGCTTGGTG GGGTCCCCCC TGCGTTGCAT GCAGAGGCTG CAGCTCCTTT CAGCCTTCAG GAGCTGCGAC ATCCCTTGCT GGTTTGGCAG CATCGCCGTG AGCATGGCGA AGCTGTGGTG CCGATCAGCG TTGAGGTCTC CTCCACCCTG AAAGTGGTCG CGATTACAGG ACCCAACACC GGTGGAAAGA CTGTCACGCT TAAGAGTGTT GGCCTGGCCT TGCTGATGGC TCGGGCGGGT CTCTTGTTGC CTTGCACAGG GAGTCCATCA ATGCCCTGGT GTGCACAGGT GCTTGCTGAT ATCGGTGATG AACAGTCACT ACAACAGAAT CTCTCCACCT TTAGTGGCCA TGTGAAGCGC ATTGGCCGCA TTCTTGAAGC CTTGATCGAG GGGCCTGGTC CAGCTTTGGT GCTTCTTGAT GAGGTCGGGG CGGGGACTGA TCCCAGTGAA GGGACGGCCC TGGCCACAGC CCTGCTGCGC ACGCTTGCCG ACCGGGCTCG TCTGACCATC GCGACAACTC ATTTCGGCAA ACTCAAGGCC CTCAAATATG GCGATTCTCG CTTTGAAAAC GCTTCTGTGG CCTTTGATAG CGAGACGATG TTGCCCACCT ATCGACTGCA ATGGGGAATC CCTGGTCGCA GTAATGCCCT CAGCATCGCG ATGCGGCTTG GCCTAGACGA CGCAGTGATT GCTCAGGCGC AAGAGCTGCT GGGACCTTGT GGTGATGGAG AGGTGAATGA GGTGATCCGT GGTCTTGAAG AGCAACGCAG CCTTCAACAG GCAGCTGCTG AAGATGCCGC GGCTCTGCTG GCACGTACGG AATTGCTGCA TGAGGAACTT CTCAGTCGCT GGCAGAAGCA GCGCAAGCAA TCTGCAGATT TGCAGGAACA AGGCCGCCAG AAGCTGGAGA GCTCCATTCG AGAGGGGCAG AAGGAAGTCC GTCAGTTGAT TCGCCGGTTG CGCGAGGGTC GTGCCGATGG TGAGTCCGCA CGACGTGCAG GTCAGCGCTT GCGTCGCATT CAGGCTGACC ATCGGATTCA GCCTCAACGC AAACAGCACA TTGGCTGGCG TCCTGAGGTG GGAGAGCGCA TTCGGCTGTT GGCCCTTGGC AAGGCGGCAG AAGTGATTGC TATCTCTGAA GATGGCAAAC AACTGACAGT GCGTTGTGGA GTCATGCGCA GCACTGTGGA GTTGTCAGGG GTGGAAAGCC TTGATGGCCT GAAGCCAAGT CCTCCAGAGT TGGTGGTGAA GGTGAAGGTC CGTTCAGGTC TGGGCCGTGG CACGGAGGTA CGCACGACCC GCAACACCGT GGATGTTCGC GGACTGCGGG TGCATGAAGC CGAGGTTGCT GTTGAGGAGC ACTTGCGCAG TAGCACTGGT CCGATATGGG TCATTCATGG CATCGGCAGC GGCAAACTCA AGCGAGGGCT CCGCCAGTGG CTTGAGACCG TTCCTTATGT GGAACGCGTA AACGATGCCG ATCAGAGTGA TGGTGGAGCG GGTTGCAGTG TGATTTGGCT GCATTAG
|
Protein sequence | MQPLGYGRDD WLLSSSVGRN SVEMSAVVMS SEVSTASLAF QDTLKQLEWP RLCEHLAGFA STLQGRRHCQ TLSLPADLSD SRLRLAETLE IGALDGLIDG GLSFQGVHDL GHILARCTKG GVASGEELLA VADTLAAARR LRRQINDPEL RPTISALLLD VATMPELERR LKFALEEGGR VADRVSSKLA GLRRQWQGLR QERRDCLQEV IRRHAAMLQD TVIADRHGRP VLAVKAAAVS QLPGLVHDSS ASGSTVFVEP QVVITLSNRL AELDGHIREQ QQLVLAELSA AVAESGVSIS RLGEVLLQLD LALARGRYGQ WLGGVPPALH AEAAAPFSLQ ELRHPLLVWQ HRREHGEAVV PISVEVSSTL KVVAITGPNT GGKTVTLKSV GLALLMARAG LLLPCTGSPS MPWCAQVLAD IGDEQSLQQN LSTFSGHVKR IGRILEALIE GPGPALVLLD EVGAGTDPSE GTALATALLR TLADRARLTI ATTHFGKLKA LKYGDSRFEN ASVAFDSETM LPTYRLQWGI PGRSNALSIA MRLGLDDAVI AQAQELLGPC GDGEVNEVIR GLEEQRSLQQ AAAEDAAALL ARTELLHEEL LSRWQKQRKQ SADLQEQGRQ KLESSIREGQ KEVRQLIRRL REGRADGESA RRAGQRLRRI QADHRIQPQR KQHIGWRPEV GERIRLLALG KAAEVIAISE DGKQLTVRCG VMRSTVELSG VESLDGLKPS PPELVVKVKV RSGLGRGTEV RTTRNTVDVR GLRVHEAEVA VEEHLRSSTG PIWVIHGIGS GKLKRGLRQW LETVPYVERV NDADQSDGGA GCSVIWLH
|
| |