Gene P9303_18871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_18871 
SymbolglmU 
ID4778650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1648076 
End bp1649488 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content53% 
IMG OID640087396 
Productbifunctional N-acetylglucosamine-1-phosphate uridyltransferase/glucosamine-1-phosphate acetyltransferase 
Protein accessionYP_001017894 
Protein GI124023587 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID[TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.33386 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAGA CGGTGGTGCC GGAACCGAAC GGGCGAATGC CAGTAGATTC CGGCCGATTC 
GCCATTCGCT CCATGCTTGC CGTCGCCATC TTGGCTGCTG GGAAGGGCAC CCGCATGAAG
AGCTGCCTGC CGAAAGTCTT GCAACCGCTC GCAGGTTCCA CATTGGTTGA GCGGGTGTTG
ACCAGTTGCT CGGGCCTTCA ACCTCAACGG TGCCTGTTGA TTGTTGGCCA CCAGGCGCAA
GAGGTGCAAC AGCAGCTCAC TGATTGGCAA GGCTTGGAGT TTGTTGTTCA ACAACCTCAG
AACGGTACAG GTCATGCGGT GCAGCAAGTA CTGCCCGTAT TGGAGGGCTT TGATGGCGAG
CTTTTGGTTC TCAATGGTGA TGTTCCATTA CTTCGACCAA GCACGATCGA ACACTTGGTG
AACGAACATC GCTCAAGTGG TGCCGACGTC ACGCTGCTAA CAGCCCGACT CGCAGATCCC
ACGGGCTACG GCAGGGTGTT TTCAGATCAA CAAGGCCGCG TAAACAGCAT CGTTGAACAT
CGCGACTGTA GTGACGAACA GCGCCACAAC AATCTCACTA ACGCTGGCAT CTACTGCTTC
AATTGGAAGA AATTGGCCGC AGTACTACCC CAACTTTGTA GTGATAATGA TCAAGGTGAG
CTCTACCTCA CCGACACTGT GGCCTTGCTA CCTATTGCGA TGCATGTAGA GGTGGCTGAT
CCCGATGAGG TGAATGGCAT CAACGATCGT TGCCAGCTTG CCAACTGTGA GGCGCTGCTT
CAAGAACGGC TACGTAACCA TTGGATGAAG GAAGGGGTCA CTTTTACTGA CCCTGCTAGC
TGCACGCTTA GTGAAGACTG TCAGTTCGGT AGAGATGTGG TGATTGAACC CCAAACCCAT
TTGCGGGGCT GCTGCAACAT TGGCGATGGC TGCCAGCTTG GCCCAGGAAG CTTGATTGAG
AATGCCGACC TTGGCCATGG AGTCAGCGTT CTTCATTCCG TTGTACGTGA TGCCAAGGTG
CGAAATGAGG TGGCTATTGG CCCTTTCTCA CACCTACGCC CTGGTGCAGA CATCGCTGAT
CAATGCCGTA TCGGCAACTT TGTGGAGATT AAAAAAAGCC AAATTGGCGA GGGTTCAAAG
GTGAATCACC TCAGTTATAT CGGTGACGCA CAACTTGGTC GCCATGTCAA TGTCGGGGCC
GGTACGATCA CTGCAAATTA CGACGGTGTG AGAAAACATC TCACCGTGGT TGGAGACAAC
AGCAAAACAG GGGCGAATTC CGTTTTGGTG GCGCCGATTG TTTTAGGGTC GAACGTGACA
GTAGGAGCTG GCTCCACTCT CACTAAAGAT GTTCCCAATG GTGCTCTCGC TCTTGGCCGC
TCCAAACAAC TGATTAAAAA TGGTTGGCAG TGA
 
Protein sequence
MEKTVVPEPN GRMPVDSGRF AIRSMLAVAI LAAGKGTRMK SCLPKVLQPL AGSTLVERVL 
TSCSGLQPQR CLLIVGHQAQ EVQQQLTDWQ GLEFVVQQPQ NGTGHAVQQV LPVLEGFDGE
LLVLNGDVPL LRPSTIEHLV NEHRSSGADV TLLTARLADP TGYGRVFSDQ QGRVNSIVEH
RDCSDEQRHN NLTNAGIYCF NWKKLAAVLP QLCSDNDQGE LYLTDTVALL PIAMHVEVAD
PDEVNGINDR CQLANCEALL QERLRNHWMK EGVTFTDPAS CTLSEDCQFG RDVVIEPQTH
LRGCCNIGDG CQLGPGSLIE NADLGHGVSV LHSVVRDAKV RNEVAIGPFS HLRPGADIAD
QCRIGNFVEI KKSQIGEGSK VNHLSYIGDA QLGRHVNVGA GTITANYDGV RKHLTVVGDN
SKTGANSVLV APIVLGSNVT VGAGSTLTKD VPNGALALGR SKQLIKNGWQ