Gene P9211_10391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_10391 
SymbolglmU 
ID5730467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp932303 
End bp933664 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content39% 
IMG OID641285406 
Productbifunctional N-acetylglucosamine-1-phosphate uridyltransferase/glucosamine-1-phosphate acetyltransferase 
Protein accessionYP_001550924 
Protein GI159903580 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID[TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGCAA TTGCGATACT AGCGGCTGGG AAAGGAACTC GAATGCGTAG TTCCTACCCA 
AAGGTACTCC AACAACTTGC AGGCAGAAGT TTAATAAAGC GTGTGATAAA GAGTTGCGAA
GACTTAAAAC CAGATCGATT TCTAGTAATC GTTGGTCACC AAGCAGAAGC AGTACAAGAC
CATTTAAAAG AATTATCCCA TCTTGAATAC ATAAACCAGG TGCCTCAAAA AGGGACAGGG
CACGCTATTC AGCAACTCCT TCCAGTACTT GATAATTTCA TTGGTGATTT ATTAGTACTT
AACGGTGATG TTCCACTATT AAAGGCAGAA ACATTACAAA AGTTGATAGC TAAACATAAA
ACATCAAAAG CTAGTGTCAC ATTTCTATCT GCTCGTCTAT CAAATCCCAA AGGATATGGT
CGCGTCTTTT CAAATAACCA AGATGAAGTA GATCGGATTG TTGAGGATGC TGATTGTAGC
AGGGAGGAGA AAAGTAACAA GCTGACAAAT GCTGGTATAT ATCTATTTAA ATGGGATTTA
CTAAAGAATA TATTGCCAAA ATTATCTAGC ACGAATAAGC AGAGTGAGCT TTATTTGACT
GATGCTATAT CCCAATTACC TACAGCAATA CACCTTGAAG TAGATAATAT AGATGAGGTC
AGCGGAGTTA ATGACAGAGC ACAATTGGCT AACTGTGAAA ACTTAATACA GCAAAGCTTA
CGCAATCACT GGATGAGTAA AGGGGTTAGT TTTATAGACC CAGAAAGTTG CACTATCAGC
GAGGAATCCC AATTTGGAAT AGACATAGTA ATCGAGCCAC AAACACATTT GCGAGGCAAT
TGCTTTATAG GCAACAATTG TAGACTAGGG CCAAGCACAT ATATTGAAGA TTCTAGGCTA
GGCGAAAATG TAAACGTAAT GCAGTCGACA TTAAATAACT GCCAGGTTGC TAGCCATGTA
AAGATAGGGC CATTTGCTCA TTTACGGCCA GAGACTAACG TATCAAGTAA TTGTCGAATA
GGGAATTTTG TAGAAATAAA AAAAAGTGAG CTTGGTCAAG GAACAAAAGT GAATCATCTA
AGTTATATAG GCGACTCTCA TGTGGGATGC CATGTGAACA TAGGAGCAGG TACTATAACA
GCAAACTTTG ACGGATTTAG AAAAAACGAA ACGGTCATAG GAGATCACAC TAAAACAGGT
GCTAATTCCG TATTAATTGC TCCAATTAAT ATAGGCAACA GAGTGACCGT AGGTGCAGGG
TCAACTCTGA CTAAAAATGT TCCAGATGGT TCACTGGCTA TTGAAAGGTC AAAGCAAAAT
ATCAAAGAGA ATTGGAAGAC TCGAGAAGAA ACTAATCAAT AA
 
Protein sequence
MFAIAILAAG KGTRMRSSYP KVLQQLAGRS LIKRVIKSCE DLKPDRFLVI VGHQAEAVQD 
HLKELSHLEY INQVPQKGTG HAIQQLLPVL DNFIGDLLVL NGDVPLLKAE TLQKLIAKHK
TSKASVTFLS ARLSNPKGYG RVFSNNQDEV DRIVEDADCS REEKSNKLTN AGIYLFKWDL
LKNILPKLSS TNKQSELYLT DAISQLPTAI HLEVDNIDEV SGVNDRAQLA NCENLIQQSL
RNHWMSKGVS FIDPESCTIS EESQFGIDIV IEPQTHLRGN CFIGNNCRLG PSTYIEDSRL
GENVNVMQST LNNCQVASHV KIGPFAHLRP ETNVSSNCRI GNFVEIKKSE LGQGTKVNHL
SYIGDSHVGC HVNIGAGTIT ANFDGFRKNE TVIGDHTKTG ANSVLIAPIN IGNRVTVGAG
STLTKNVPDG SLAIERSKQN IKENWKTREE TNQ