Gene NATL1_02951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_02951 
Symbol 
ID4780110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp273492 
End bp275906 
Gene Length2415 bp 
Protein Length804 aa 
Translation table11 
GC content38% 
IMG OID640083560 
ProductDNA mismatch repair protein MutS family protein 
Protein accessionYP_001014124 
Protein GI124025008 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0619291 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATTAA CAAAAAATCA TGATGATTCT AAAAAGGCAC AAATAATATC AGAGTCTTTG 
GATTTGCTTG AATGGCCAAC TGTTTGTAGC CATTTGTCTA CATTCGCTCT TACTCAACAA
GGTCGTAAAA AATGTGAAAG CTTTGATTTG CCACGAAATC TATCTTTAAG CCAAGAGCTA
TTGTCCCAAA CATTAGAAAT TGGGTCATTA GATAGTTCTC TTAATGAAGG AATATCTTTT
GATGGTGTTC ATGATTTGGA AAATATACTT TTGATATGCT CCAAAGGAGG TATTGCTATT
GGTGAGGATT TATTAAAAGT AGCTGATACT TTAAGAGCTG CTAGAAAATT ACGAAAACTA
ATATTTGATC AAGTGATACG TCCACGACTT TCTGAATTGC TCAAAGATGT TGCAACTTTG
CCAGATTTGC AAAAACTCCT CGAATTCGGG CTTGATGAAG GTGGGCGAAT TGCAGATCGT
GCTAGCCCAA AGCTTTCTGA ATTACGACGT TATAGAAATT CCGTACGTCT TCAAAGAAAA
GATATTCTAC AAGATATCAT CCGGAAATAT GGTGGATTAC TTCAAGATAA TATTATTTCA
GAGAGGTATG GACGACCTGT TTTAGCGTTT AAGGCTGGGA CTTCTGATCA AATTAAAGGA
ATGGTTCATG ATAGTTCGGC CTCTGGAAAC ACGATATATG TCGAGCCCCA AGTTGTCATA
TCAATAGGAA ATCGTTTAGC TAAGATAGAT TCTGAAATCT CAGATGAAGA GAGGAGACTT
TTAGCTGATT GGAGTAAAGA GGTTGGTCTT AATGCAATTG TAATAGCTCA TTTAGTAGAG
ATCCTTTTGC AAATTGAGTT TGCATTGTCT CGAGCACGTT ATTCTAAATG GCTTAATGGG
GTCCCTGCAA TTCTTGATCA AGAAGAACAT TCACTCTTTG AGATCAAAGA TTTTCGTCAT
CCTTTATTAG TATGGAATGA CTTCCATGAG AAAAAGAATA CAGTAGTTCC AACTAGTTTT
GATGTCGCCC CTGATTTAAA AGTTGTTGCG ATTACAGGCC CTAATACTGG AGGGAAAACA
GTTGCTTTGA AAAGTATTGG TTTAGCAGTT TTAATGGCAA AAGCGGGGTT GCTTTTGCCA
TGTACAGGCT CACCAAGATT ACCATGGTGT AAAAATGTTT TCGCTGATAT TGGTGATGAG
CAATCTTTAC AGCAAAATTT GTCTACATTT AGCGGACATA TTCTTCGTAT AAGTCGAATA
CTCGACGCTA TAGATGTGTT CCCTGGTACG ACTCTCGTTC TTTTAGATGA AGTTGGAGCT
GGAACTGATC CAACTGAAGG CACAGCATTG GCCATGGCAC TCCTACAGGT AATGGCTAAT
AGAGCAAGAT TAACTATCGC GACTACTCAT TTTGGACAAT TAAAAGCGCT CAAATATAGT
GATTCAAGAT TTGAAAATGC TTCAGTTTCT TTTGATAGTG AAACTATACA ACCAACTTTT
CATTTGCAAT GGGGAATTCC TGGTCGAAGT AATGCAATTG AAATTTCAAA GAGACTTGGT
CTCGATGAGC AAGTAATCAT AAGTGCTCAA AAATTTATCA ATCCTGAAAG GGTTGATAAT
GTTAATCAAG TTATTCAAGG CTTAGAAAAA CAACGCGAGC GTCAGCAATC AGCAGCTGAA
GATGCTGCTG CATTATTGGC TAAAACCGAA TTACTGCATG AGGAATTACT TAATAGTTGG
CAGAAACAAC GTCAACAATC GGAAGAGTTT AATGAGCAAG GAAGGTTCAA ATTGGAGTCA
TCAATTCGTG AAGGTCAAAA AGAAGTTAGA CATTTAATTA AACGTTTGCG CGATCAAAAC
GCTAGTGGTG AGACAGCAAG AATTGCCGGT CAACGATTAC GGCAAATAGA AAAGGGATAT
CGAAACGACA AGCGAATTAA CCACACACAG AGTTGGACCC CAAAGATTGG GGAAAAAGTT
AGATTGTCTT CTATTGGTAA AGCAGGTGAA ATAATTTCTT TTTCAGATGA TGGAATGCAA
TTAACGGTGC TATGCGGAGT ATTTCGAAGC AAAGTCAATT TAACCGAAGT TGAAAGTCTT
GATGGTCAAA AGGTCGAAAT AAACCAAAAT GTGCAAGTAA AAACTTCGCA GGTAAGAAAG
AATTTATCTT TAGTAAGAAC TAAAAAAAAT ACCTTAGATG TAAGAGGGTT ACGCGTTCAT
GAAGCCGAGG GGGTAATTGA AGAAAAATTG AGAAATTGTT CCGGAGCTTT ATGGGTTATT
CATGGAATTG GTTCTGGAAA ACTGAAAAAA GGTTTGAGGA AATGGTTTGA TTCACTTCCA
TATATTGAAA AAGTAGCCGA TGCTGAACCT CATGATGGCG GCCCTGGATG TAGCGTTGTG
TGGATGGTTG ATTGA
 
Protein sequence
MGLTKNHDDS KKAQIISESL DLLEWPTVCS HLSTFALTQQ GRKKCESFDL PRNLSLSQEL 
LSQTLEIGSL DSSLNEGISF DGVHDLENIL LICSKGGIAI GEDLLKVADT LRAARKLRKL
IFDQVIRPRL SELLKDVATL PDLQKLLEFG LDEGGRIADR ASPKLSELRR YRNSVRLQRK
DILQDIIRKY GGLLQDNIIS ERYGRPVLAF KAGTSDQIKG MVHDSSASGN TIYVEPQVVI
SIGNRLAKID SEISDEERRL LADWSKEVGL NAIVIAHLVE ILLQIEFALS RARYSKWLNG
VPAILDQEEH SLFEIKDFRH PLLVWNDFHE KKNTVVPTSF DVAPDLKVVA ITGPNTGGKT
VALKSIGLAV LMAKAGLLLP CTGSPRLPWC KNVFADIGDE QSLQQNLSTF SGHILRISRI
LDAIDVFPGT TLVLLDEVGA GTDPTEGTAL AMALLQVMAN RARLTIATTH FGQLKALKYS
DSRFENASVS FDSETIQPTF HLQWGIPGRS NAIEISKRLG LDEQVIISAQ KFINPERVDN
VNQVIQGLEK QRERQQSAAE DAAALLAKTE LLHEELLNSW QKQRQQSEEF NEQGRFKLES
SIREGQKEVR HLIKRLRDQN ASGETARIAG QRLRQIEKGY RNDKRINHTQ SWTPKIGEKV
RLSSIGKAGE IISFSDDGMQ LTVLCGVFRS KVNLTEVESL DGQKVEINQN VQVKTSQVRK
NLSLVRTKKN TLDVRGLRVH EAEGVIEEKL RNCSGALWVI HGIGSGKLKK GLRKWFDSLP
YIEKVADAEP HDGGPGCSVV WMVD