Gene NATL1_20571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_20571 
SymbolmanC 
ID4780010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1700713 
End bp1702170 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content33% 
IMG OID640085353 
Productmannose-1-phosphate guanylyltransferase 
Protein accessionYP_001015877 
Protein GI124026762 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0662] Mannose-6-phosphate isomerase
[COG0836] Mannose-1-phosphate guanylyltransferase 
TIGRFAM ID[TIGR01479] mannose-1-phosphate guanylyltransferase/mannose-6-phosphate isomerase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATAA CTGATAACTC CATAATTCCT GTAATTCTTA GTGGTGGATC AGGAACAAGA 
CTTTGGCCAC TTTCTCGAGA AAGTTATCCT AAGCAATTTC TAGCATTAGA TTCACGCACA
AAAAAAACAC TTTTGCAGAA AACTTATGAA AGGCTTCTAG GTTTAGAGGG ACTAGAAAAT
CCTATATTAA TATGCAATGA AGATCACAGA TTTATAGTTG CAGAGCAATT TAGAGAGATA
AATACTGATC CTCAAGCAAT TATTTTAGAA CCCGTCGGAC GTAACACAGC TCCAGCAATA
GCAGTTGCTG CTCTTCAAGC GATTTCTCTA GGTAAAGACC CTTTGCTATT GATTTTGGCT
TCTGATCACT TGATAGAGAA TAACGTTGAA TTTCAAAGAG TAATTCAATC TGCAAAAATA
TATGCAAATC AAGGAAAGCT AGTAACTTTT GGTATAGTTC CAACAAGCGC AGAAACTGGT
TACGGTTATA TTGAAACAAA AGAATTTCCT ACCAAAGACA ATCAAATAAT TGGTTTAGAG
ATAAACAAAT TTATAGAAAA GCCTAATAAA GATATAGCTG AGAAATTAAT CAAAGATTCT
CGCTTCACAT GGAATAGTGG TATGTTTCTT TTTAAAGCGA GTACGATAAT TAGTGAATTA
AAAAAATTCT CTCCAGAGAT ACTCAATAAT TGCAAAATTG CTCTCGAGAA AGATATAGAA
GATCTTGATT TTCTACGTTT AGAAACTGAA TCCTTCAAGA AATGTCCAAA AATATCAATA
GATATCGCAG TGATGGAAAA AACGAATTTA GGAATCGTTC TTCCTTTAAA TGTTGGATGG
AATGATATAG GAAGTTGGAA ATCTTTATGG GATATTAGCA AAAAAAATAA TGATGGAAAC
TATATCAACG GTAGAATAAT AGCTGAAAAA AGTAAAAACT GTTATCTAAA AGGTGAACAA
CGTCTAATTG TAGGAATAGG GATAGAAAAT CTAATAGTTG TTGATACAAA TGATGCTATA
TTAGTAGCCA ATAGAGATCA ATCTCAAAAT ATTGGAAATA TAGTCAAAAG TCTAAGTTCA
TCGGCATTTC CAGAAGGCAA AGTGCATAGA AAAATTTATC GACCCTGGGG GAATTACACT
ACAATTGTCG AGGGAAATAG ATGGTTAGTG AAACTCATCG AGGTAAAGCC AAATGCTTCT
CTTTCTTTAC AAATGCATCA TCATAGAGCA GAACACTGGG TAGTAGTTAA TGGAACAGCA
TTGATAGAAA AAAATGGAGA AAAAAAACTT TTAAGTGAAA ATGAAAGTAC ATTTATCCCT
TTAGGTTGCA AACATAGATT AAGTAATCCA GGAAAAATGA GACTTGAACT TATTGAAGTT
CAAAGCGGAA CGTATTTAGG CGAAGAAGAC ATCATTCGTT TTGAAGATTC TTATGGCAGA
ATAAAAAATC AGAATTAA
 
Protein sequence
MNITDNSIIP VILSGGSGTR LWPLSRESYP KQFLALDSRT KKTLLQKTYE RLLGLEGLEN 
PILICNEDHR FIVAEQFREI NTDPQAIILE PVGRNTAPAI AVAALQAISL GKDPLLLILA
SDHLIENNVE FQRVIQSAKI YANQGKLVTF GIVPTSAETG YGYIETKEFP TKDNQIIGLE
INKFIEKPNK DIAEKLIKDS RFTWNSGMFL FKASTIISEL KKFSPEILNN CKIALEKDIE
DLDFLRLETE SFKKCPKISI DIAVMEKTNL GIVLPLNVGW NDIGSWKSLW DISKKNNDGN
YINGRIIAEK SKNCYLKGEQ RLIVGIGIEN LIVVDTNDAI LVANRDQSQN IGNIVKSLSS
SAFPEGKVHR KIYRPWGNYT TIVEGNRWLV KLIEVKPNAS LSLQMHHHRA EHWVVVNGTA
LIEKNGEKKL LSENESTFIP LGCKHRLSNP GKMRLELIEV QSGTYLGEED IIRFEDSYGR
IKNQN