Gene NATL1_20941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_20941 
Symbol 
ID4779328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1735572 
End bp1736948 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content31% 
IMG OID640085390 
Producthypothetical protein 
Protein accessionYP_001015914 
Protein GI124026799 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1134] ABC-type polysaccharide/polyol phosphate transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.140726 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAATA TTGACGCAAA TGAATTGCAA AAACTATATA TCGCCTATTT TGGAAGACCT 
GGAGATCCAT CTGGCATAAA TTATTGGCTT TCACGTTCTA ATCAATCACT TGATCTAACG
GGAATATCAA ATGAATTATC TATGCAAGAT GAATATATTA AATATACTTC TTATGATAAA
TCTTTTGATT TTAAGATTAA CAAATTATAT TTGAATTTAT ACAATAGAAA AGCTGATTTT
GATGGTTTAA ATTATTGGAT GAAGATGACT AATAGTAAGG AATATAAGAT TTCAGATATT
GTATATGAAT TAGTTTTTTC TTCTAGTAAG CCTTATTCGG TTGACTTAAA GCAAGAGAAA
AAGGATAGTC ATATTCTTCA AAATAAAATT TTTGCGGCAG AACTATTTAC TAAGCAAATT
AGTAAAAGTA TTACTTTAAT TAATTTATAT AAACCCGATT CAATATCTCC TTGGATTTCA
GGTAACTCAT TTATAAGAGT ATCTAATTTC TTCAGTCATA TTAATGAGAA GACGGTCTCA
ATAGATCATA TAAATAATTT TATTGCTTCT TTATCTGACA CTCCAGTAAG CATACTCAAT
GAGCCAGCTA TAGAAATTAA AGATACTTCA CTTTCTATTC CTATTTATCA AACAGAAAAT
AGATCTTTTG CAAAAAAAAT TACAAAAAAT GTAATAAATA TTACAGGTGG AGCTCTTAGG
AAGTCTAAAA ATAAGACTAG TATTATTGCA CTAAATAATA TTAATTTGAC AATCATGAAG
GGAGAGAGAG TCGGACTGAT TGGTCATAAT GGGTCCGGAA AAAGTAGTTT TTTGAGGTTG
ATTTCAGGTA TTTATATCCC CACGAGTGGA AATATAAATG TTTTAGTAGA TGTGTATCCT
ATGCTCCAAA AAACTTTTCT GACTAGTACA GAGCTATCTG GAATCGATGC TTGTAAAGCG
CATTATTTAC TTAAAAACCA TAGCCTTGAT GGCTTTGAAT CTTTCTTGAA TGAAATAACA
GAATTTTCGG GCCTTGGCTC ATATATCTCT TTACCTATAA AAACTTACAG TGAGGGTATG
TCTGCAAGGT TGGTTTTTTC AATTCTTACT TCAACTCCTC ATGAATGTTT AGCAATAGAT
GAAGGTTTCG GTACTGGTGA CGCAGACTTT TGTGATAGAG CTGAAGAAAG AATGAAACAA
TTTATGGAAT CCGCCGCAAC CTTATTTTTG GCTAGTCATT CGGAGGAACT TTTAAAACAG
TTTTGTAATA GAGGTATTGT TTTTAGCCAT GGATCAATTG TATATGATGG CCCTTTAGAT
GCTGCCTTGA ATTATTATCA TACCCATGAC TATTATCGTA AAAATGTTGT TGGATAA
 
Protein sequence
MINIDANELQ KLYIAYFGRP GDPSGINYWL SRSNQSLDLT GISNELSMQD EYIKYTSYDK 
SFDFKINKLY LNLYNRKADF DGLNYWMKMT NSKEYKISDI VYELVFSSSK PYSVDLKQEK
KDSHILQNKI FAAELFTKQI SKSITLINLY KPDSISPWIS GNSFIRVSNF FSHINEKTVS
IDHINNFIAS LSDTPVSILN EPAIEIKDTS LSIPIYQTEN RSFAKKITKN VINITGGALR
KSKNKTSIIA LNNINLTIMK GERVGLIGHN GSGKSSFLRL ISGIYIPTSG NINVLVDVYP
MLQKTFLTST ELSGIDACKA HYLLKNHSLD GFESFLNEIT EFSGLGSYIS LPIKTYSEGM
SARLVFSILT STPHECLAID EGFGTGDADF CDRAEERMKQ FMESAATLFL ASHSEELLKQ
FCNRGIVFSH GSIVYDGPLD AALNYYHTHD YYRKNVVG