Gene P9211_01851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_01851 
SymbolnagA 
ID5731705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp177014 
End bp178165 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content40% 
IMG OID641284529 
ProductN-acetylglucosamine-6-phosphate deacetylase 
Protein accessionYP_001550070 
Protein GI159902726 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1820] N-acetylglucosamine-6-phosphate deacetylase 
TIGRFAM ID[TIGR00221] N-acetylglucosamine-6-phosphate deacetylase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTGGA TTAATAATAT TCGTTTACCA ACTCCTTTCA CGGCTAATGC AGATTGCTGT 
TGGTCTGTTT TATTAGATTC AAGGGATATT GTTCGATCAA TTGAGCCAAG CTCTAGCTCA
ATTGATAAGG AGGAGAATTG GCATGGAGAT TGGTTGAGTC CAATGGGTCT CGATCTTCAA
ATCAATGGGG GGCTAGGGGT TTCGTTTAAT GCTCTTGATC GCGAGGATTT GCCAAATATT
AATAAGTTAC TTGATCGCCT ATGGATGGAA GGTGTTGATG AGATATGTCC GACAATAGTG
ACATGCAGTC TTTCTTCATT AAGGAAGTCC TTGGGAGTAT TACACCAGGC CCGTAAAAGA
GTCTCAGATA AGTCATGCAG GCTCATTGGT GCTCATCTAG AAGGCCCTTT CTTATCAAGG
GATTATGTTG GAGCTCATGA CTCGGATTTC CTCATCAATC CTACGCTTTC TTCTTTACAT
GAGCGAATTC AAGAGTTTGA GACTGAAATA GCGATTGTTA CGCTTGCTCC AGAACTTTTG
GGGTCTTTCG AAGTTGTTCA AAAATTAATA GATCTTGGGG TTGTGGTTTC TTTAGGCCAC
TCGGGGGCTG ATGCTGAATT GAGTTCATTA GCATTTGACC ATGGGGTCAG CATGATTACT
CATGCTTTTA ATGCTATGCC GGGTATTCAT CATAGATCTC CCGGACCTTT AGGTGAAGCC
ATCGCAAATG GTGATATTTC AATTGGTTTA ATCGCCGATG GTATACATGT TCACCCAAAG
GTCTTAAAAA TATTGCAGAA ACTTGCTCCA GAAAAGATTG TTTTAGTTAG CGATGCTCTA
AGTCCATATG GTCTTGCTCA AGAAAAATTT CAATGGAATG ATCGATCATT AATAGTAAAA
AACAATTTTT GCTCGCTTGA GGATGGCACT TTAGTAGGAA CGACTTTGTC ATTATTGGCT
GCTTGTAAGC GTTTTGCTAA GTGGACAAAT CAGAATTCCG CTGCCATTTG GTCTGCAACG
GTTGCTCCGC GCATTGCTTT GAATAAAGGA GATACTGTTC AAGATTTTCT TGTTGGAAAA
TCATTAAATC AATTGTTGAG ATGGAACCTA GATATTGAGT CTGAAGAGTT AACTTGGAAT
CATGCTAAGT AG
 
Protein sequence
MHWINNIRLP TPFTANADCC WSVLLDSRDI VRSIEPSSSS IDKEENWHGD WLSPMGLDLQ 
INGGLGVSFN ALDREDLPNI NKLLDRLWME GVDEICPTIV TCSLSSLRKS LGVLHQARKR
VSDKSCRLIG AHLEGPFLSR DYVGAHDSDF LINPTLSSLH ERIQEFETEI AIVTLAPELL
GSFEVVQKLI DLGVVVSLGH SGADAELSSL AFDHGVSMIT HAFNAMPGIH HRSPGPLGEA
IANGDISIGL IADGIHVHPK VLKILQKLAP EKIVLVSDAL SPYGLAQEKF QWNDRSLIVK
NNFCSLEDGT LVGTTLSLLA ACKRFAKWTN QNSAAIWSAT VAPRIALNKG DTVQDFLVGK
SLNQLLRWNL DIESEELTWN HAK