Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_18871 |
Symbol | glmU |
ID | 4778650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1648076 |
End bp | 1649488 |
Gene Length | 1413 bp |
Protein Length | 470 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640087396 |
Product | bifunctional N-acetylglucosamine-1-phosphate uridyltransferase/glucosamine-1-phosphate acetyltransferase |
Protein accession | YP_001017894 |
Protein GI | 124023587 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) |
TIGRFAM ID | [TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.33386 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAAGA CGGTGGTGCC GGAACCGAAC GGGCGAATGC CAGTAGATTC CGGCCGATTC GCCATTCGCT CCATGCTTGC CGTCGCCATC TTGGCTGCTG GGAAGGGCAC CCGCATGAAG AGCTGCCTGC CGAAAGTCTT GCAACCGCTC GCAGGTTCCA CATTGGTTGA GCGGGTGTTG ACCAGTTGCT CGGGCCTTCA ACCTCAACGG TGCCTGTTGA TTGTTGGCCA CCAGGCGCAA GAGGTGCAAC AGCAGCTCAC TGATTGGCAA GGCTTGGAGT TTGTTGTTCA ACAACCTCAG AACGGTACAG GTCATGCGGT GCAGCAAGTA CTGCCCGTAT TGGAGGGCTT TGATGGCGAG CTTTTGGTTC TCAATGGTGA TGTTCCATTA CTTCGACCAA GCACGATCGA ACACTTGGTG AACGAACATC GCTCAAGTGG TGCCGACGTC ACGCTGCTAA CAGCCCGACT CGCAGATCCC ACGGGCTACG GCAGGGTGTT TTCAGATCAA CAAGGCCGCG TAAACAGCAT CGTTGAACAT CGCGACTGTA GTGACGAACA GCGCCACAAC AATCTCACTA ACGCTGGCAT CTACTGCTTC AATTGGAAGA AATTGGCCGC AGTACTACCC CAACTTTGTA GTGATAATGA TCAAGGTGAG CTCTACCTCA CCGACACTGT GGCCTTGCTA CCTATTGCGA TGCATGTAGA GGTGGCTGAT CCCGATGAGG TGAATGGCAT CAACGATCGT TGCCAGCTTG CCAACTGTGA GGCGCTGCTT CAAGAACGGC TACGTAACCA TTGGATGAAG GAAGGGGTCA CTTTTACTGA CCCTGCTAGC TGCACGCTTA GTGAAGACTG TCAGTTCGGT AGAGATGTGG TGATTGAACC CCAAACCCAT TTGCGGGGCT GCTGCAACAT TGGCGATGGC TGCCAGCTTG GCCCAGGAAG CTTGATTGAG AATGCCGACC TTGGCCATGG AGTCAGCGTT CTTCATTCCG TTGTACGTGA TGCCAAGGTG CGAAATGAGG TGGCTATTGG CCCTTTCTCA CACCTACGCC CTGGTGCAGA CATCGCTGAT CAATGCCGTA TCGGCAACTT TGTGGAGATT AAAAAAAGCC AAATTGGCGA GGGTTCAAAG GTGAATCACC TCAGTTATAT CGGTGACGCA CAACTTGGTC GCCATGTCAA TGTCGGGGCC GGTACGATCA CTGCAAATTA CGACGGTGTG AGAAAACATC TCACCGTGGT TGGAGACAAC AGCAAAACAG GGGCGAATTC CGTTTTGGTG GCGCCGATTG TTTTAGGGTC GAACGTGACA GTAGGAGCTG GCTCCACTCT CACTAAAGAT GTTCCCAATG GTGCTCTCGC TCTTGGCCGC TCCAAACAAC TGATTAAAAA TGGTTGGCAG TGA
|
Protein sequence | MEKTVVPEPN GRMPVDSGRF AIRSMLAVAI LAAGKGTRMK SCLPKVLQPL AGSTLVERVL TSCSGLQPQR CLLIVGHQAQ EVQQQLTDWQ GLEFVVQQPQ NGTGHAVQQV LPVLEGFDGE LLVLNGDVPL LRPSTIEHLV NEHRSSGADV TLLTARLADP TGYGRVFSDQ QGRVNSIVEH RDCSDEQRHN NLTNAGIYCF NWKKLAAVLP QLCSDNDQGE LYLTDTVALL PIAMHVEVAD PDEVNGINDR CQLANCEALL QERLRNHWMK EGVTFTDPAS CTLSEDCQFG RDVVIEPQTH LRGCCNIGDG CQLGPGSLIE NADLGHGVSV LHSVVRDAKV RNEVAIGPFS HLRPGADIAD QCRIGNFVEI KKSQIGEGSK VNHLSYIGDA QLGRHVNVGA GTITANYDGV RKHLTVVGDN SKTGANSVLV APIVLGSNVT VGAGSTLTKD VPNGALALGR SKQLIKNGWQ
|
| |