Gene P9303_03891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_03891 
Symbol 
ID4777044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp391408 
End bp393894 
Gene Length2487 bp 
Protein Length828 aa 
Translation table11 
GC content58% 
IMG OID640085892 
Productputative DNA mismatch repair protein MutS family protein 
Protein accessionYP_001016406 
Protein GI124022099 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.331058 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCCGC TTGGTTACGG CAGGGATGAT TGGCTTCTTT CAAGTTCAGT GGGGCGCAAC 
AGCGTTGAAA TGAGCGCTGT AGTCATGAGC TCTGAGGTCT CGACTGCTTC GCTTGCCTTT
CAGGACACAC TCAAACAGCT TGAATGGCCA CGCTTATGTG AACACCTCGC TGGATTTGCC
AGCACCTTGC AGGGCCGGCG TCATTGCCAG ACTTTGTCGC TGCCTGCCGA TTTGTCTGAC
AGCCGTTTGC GCTTGGCAGA AACTCTTGAG ATCGGTGCTT TGGATGGTCT GATTGATGGA
GGCCTCAGCT TTCAGGGTGT CCATGACCTT GGCCATATCC TGGCTCGCTG TACAAAAGGT
GGTGTGGCTT CTGGGGAAGA GCTTTTAGCT GTTGCCGACA CCCTTGCAGC AGCGCGCCGC
TTGCGCCGCC AGATTAATGA TCCAGAGCTT CGTCCCACGA TCTCGGCGTT GTTGCTGGAT
GTCGCAACGA TGCCGGAGCT GGAACGGCGA CTCAAATTTG CATTGGAAGA GGGTGGTCGG
GTGGCTGATC GCGTCAGCTC CAAGCTCGCT GGCTTGCGAC GACAGTGGCA AGGTTTGCGT
CAGGAACGTC GCGATTGCCT ACAGGAGGTG ATCAGGCGTC ATGCGGCCAT GTTGCAGGAC
ACTGTGATCG CTGATCGCCA TGGCCGGCCA GTGCTAGCGG TGAAGGCCGC TGCAGTGTCT
CAACTCCCCG GCCTGGTCCA CGACAGCTCA GCATCTGGCA GCACAGTCTT TGTTGAGCCT
CAGGTTGTGA TCACCCTGAG TAATCGTCTT GCTGAATTGG ATGGTCATAT CCGAGAGCAG
CAGCAGCTCG TCTTGGCAGA GTTGAGTGCG GCGGTGGCTG AGTCCGGCGT ATCGATTAGT
CGCCTGGGAG AGGTTCTGCT TCAACTCGAC TTGGCTCTGG CTCGTGGCCG TTATGGTCAA
TGGCTTGGTG GGGTCCCCCC TGCGTTGCAT GCAGAGGCTG CAGCTCCTTT CAGCCTTCAG
GAGCTGCGAC ATCCCTTGCT GGTTTGGCAG CATCGCCGTG AGCATGGCGA AGCTGTGGTG
CCGATCAGCG TTGAGGTCTC CTCCACCCTG AAAGTGGTCG CGATTACAGG ACCCAACACC
GGTGGAAAGA CTGTCACGCT TAAGAGTGTT GGCCTGGCCT TGCTGATGGC TCGGGCGGGT
CTCTTGTTGC CTTGCACAGG GAGTCCATCA ATGCCCTGGT GTGCACAGGT GCTTGCTGAT
ATCGGTGATG AACAGTCACT ACAACAGAAT CTCTCCACCT TTAGTGGCCA TGTGAAGCGC
ATTGGCCGCA TTCTTGAAGC CTTGATCGAG GGGCCTGGTC CAGCTTTGGT GCTTCTTGAT
GAGGTCGGGG CGGGGACTGA TCCCAGTGAA GGGACGGCCC TGGCCACAGC CCTGCTGCGC
ACGCTTGCCG ACCGGGCTCG TCTGACCATC GCGACAACTC ATTTCGGCAA ACTCAAGGCC
CTCAAATATG GCGATTCTCG CTTTGAAAAC GCTTCTGTGG CCTTTGATAG CGAGACGATG
TTGCCCACCT ATCGACTGCA ATGGGGAATC CCTGGTCGCA GTAATGCCCT CAGCATCGCG
ATGCGGCTTG GCCTAGACGA CGCAGTGATT GCTCAGGCGC AAGAGCTGCT GGGACCTTGT
GGTGATGGAG AGGTGAATGA GGTGATCCGT GGTCTTGAAG AGCAACGCAG CCTTCAACAG
GCAGCTGCTG AAGATGCCGC GGCTCTGCTG GCACGTACGG AATTGCTGCA TGAGGAACTT
CTCAGTCGCT GGCAGAAGCA GCGCAAGCAA TCTGCAGATT TGCAGGAACA AGGCCGCCAG
AAGCTGGAGA GCTCCATTCG AGAGGGGCAG AAGGAAGTCC GTCAGTTGAT TCGCCGGTTG
CGCGAGGGTC GTGCCGATGG TGAGTCCGCA CGACGTGCAG GTCAGCGCTT GCGTCGCATT
CAGGCTGACC ATCGGATTCA GCCTCAACGC AAACAGCACA TTGGCTGGCG TCCTGAGGTG
GGAGAGCGCA TTCGGCTGTT GGCCCTTGGC AAGGCGGCAG AAGTGATTGC TATCTCTGAA
GATGGCAAAC AACTGACAGT GCGTTGTGGA GTCATGCGCA GCACTGTGGA GTTGTCAGGG
GTGGAAAGCC TTGATGGCCT GAAGCCAAGT CCTCCAGAGT TGGTGGTGAA GGTGAAGGTC
CGTTCAGGTC TGGGCCGTGG CACGGAGGTA CGCACGACCC GCAACACCGT GGATGTTCGC
GGACTGCGGG TGCATGAAGC CGAGGTTGCT GTTGAGGAGC ACTTGCGCAG TAGCACTGGT
CCGATATGGG TCATTCATGG CATCGGCAGC GGCAAACTCA AGCGAGGGCT CCGCCAGTGG
CTTGAGACCG TTCCTTATGT GGAACGCGTA AACGATGCCG ATCAGAGTGA TGGTGGAGCG
GGTTGCAGTG TGATTTGGCT GCATTAG
 
Protein sequence
MQPLGYGRDD WLLSSSVGRN SVEMSAVVMS SEVSTASLAF QDTLKQLEWP RLCEHLAGFA 
STLQGRRHCQ TLSLPADLSD SRLRLAETLE IGALDGLIDG GLSFQGVHDL GHILARCTKG
GVASGEELLA VADTLAAARR LRRQINDPEL RPTISALLLD VATMPELERR LKFALEEGGR
VADRVSSKLA GLRRQWQGLR QERRDCLQEV IRRHAAMLQD TVIADRHGRP VLAVKAAAVS
QLPGLVHDSS ASGSTVFVEP QVVITLSNRL AELDGHIREQ QQLVLAELSA AVAESGVSIS
RLGEVLLQLD LALARGRYGQ WLGGVPPALH AEAAAPFSLQ ELRHPLLVWQ HRREHGEAVV
PISVEVSSTL KVVAITGPNT GGKTVTLKSV GLALLMARAG LLLPCTGSPS MPWCAQVLAD
IGDEQSLQQN LSTFSGHVKR IGRILEALIE GPGPALVLLD EVGAGTDPSE GTALATALLR
TLADRARLTI ATTHFGKLKA LKYGDSRFEN ASVAFDSETM LPTYRLQWGI PGRSNALSIA
MRLGLDDAVI AQAQELLGPC GDGEVNEVIR GLEEQRSLQQ AAAEDAAALL ARTELLHEEL
LSRWQKQRKQ SADLQEQGRQ KLESSIREGQ KEVRQLIRRL REGRADGESA RRAGQRLRRI
QADHRIQPQR KQHIGWRPEV GERIRLLALG KAAEVIAISE DGKQLTVRCG VMRSTVELSG
VESLDGLKPS PPELVVKVKV RSGLGRGTEV RTTRNTVDVR GLRVHEAEVA VEEHLRSSTG
PIWVIHGIGS GKLKRGLRQW LETVPYVERV NDADQSDGGA GCSVIWLH