Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3961 |
Symbol | glmU |
ID | 3906921 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 4740138 |
End bp | 4741835 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637881289 |
Product | bifunctional N-acetylglucosamine-1-phosphate uridyltransferase/glucosamine-1-phosphate acetyltransferase |
Protein accession | YP_483040 |
Protein GI | 86742640 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) |
TIGRFAM ID | [TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.307012 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCATCTG CCGGCGCCTT CCGGTCGACG AGTAGTGTGG GCGCCGTCGC CCCACCGCTG TCCAGGTCCG AAGACTCGGG AGATCGCGTG ACATCATCAC GTCCGGTGGC CGTCATCATC CTGGCAGCGG GCGAAGGCAA GCGCATGCGC TCGACGAGAC CGAAGGTGCT GCACCACATC GCTGGGCGCT CCCTTCTCCA CCACGTGCTG AGCGCGGCCG GAGCCCTCGC GGCGGCGCGT ACCGCGGTCG TCGTGGGCCA CGGCCGCGAA CAGGTCGCGG CGATGCTGGC GGAGCAGGCT CCCGAGGTCG TCCCGGTCGT CCAGGACCGC CAACACGGCA CGGGCCACGC CGTCCGGGTG GCCCTGGCGG CGCTCGGCGA GCTGGCGCCC GGCGACACGG TGGTCGTGCT GCCCGGCGAC ACCCCGCTGT TGACCAGCGC GACACTCACT GCCCTGGTGG CCCAGCACCA TGCCCGGGCG GCCGCGGCGA CGGTCCTGTC GGCCGTCATC GACGACGCCA CCGGTTACGG CCGGGTGGTG CGCGACGGCG ACGGCGCGGT CCGGGCCATC GTCGAGCATC GCGACGCCGA CGCCGCCACC GCGACGATCC GAGAGATCAA CACCGGCGTG TACGCGTTCG CAGCGGGCCC GCTCCAGACC GCGCTCGCGC GGCTGACCAG GGATAATGCT CAAGGCGAGG AGTACCTCAC CGACGTCGTC GGGCTGCTGG TCGCCGCCGG GCAGCCGGTG GCCGCGCGGG TCGTGGACGA TCCCGGCGAG GCCGGTGGGG TGAACGACCG CGTCCAGCTC GCCGCCGCCG GACGGGTGCT GCGGGACCGG ATCGTCGAGG CGGCGATGCG CGCCGGCACC ACCGTCGTCG ACCCGCGGAC CACCTGGATC GACGCCGACG TCACTCTCGA ACCGGACACG ACTATCGCCC CCAACACCTT CCTGCACGGC CGGACGCACG TCGCTCGCGG CGCCGTCATC GGCCCCGAAT GCACCTTGAC GGACACGACG GTAGGCGCGG GCGCGACCGT CCTGCGGACC ACGGCCGAGC GGGCCGAGAT CGGCGCCGGG GCCGTGGTCG GGCCTTACAG CCATCTCCGA CCGGGCACCC GACTGGGCCG CGAAGGCAAG ATCGGCTCCT TCGTCGAGAC GAAGTCGGCC GATCTCGGGA ACCAGACGAA GGTCCCGCAT CTGGCGTATG TCGGCGACGC GGTGGTCGGC GAGCGCAGCA ACATCGGATG CACGACGGTC TTCGTGAACT ACGACGGGGT GGCCAAGCAC CGCACGGTGA TCGGCTCGGA CGTCCGGATC GGCAGCGATA CCATGCTCGT TGCTCCGGTC ACGGTCGGGG ACGGGGCCTA CACCGGCGCC GGATCCGTGA TCCGGGAGGA CGTTCCGCCG GGCGCGCTGG CGGTCCGGGA GGGGCGGCAA CGGATCATCG AAGGCTGGGT CTCCCGCCGC CGTCCCGGCA GCCCGGCGGC ACGAGCGGCC GCCGCTGCCG GCGTCCAGGC CCCCGGCAGC GTCGGGGATC CCGAGCATCC CGATGAAATG CCGCAACCGT CCGCGGCCGA GGCCGCCGTG CCGATCCCAG CTCATCAGGG TGACGTGGCG CACCGGAGTG AGTTGCCGCA TACCGGCGAG GGCGAGGCCG GCATCGCTGG CATTCCCGGC GCGGGGAGCA GAACGTAA
|
Protein sequence | MASAGAFRST SSVGAVAPPL SRSEDSGDRV TSSRPVAVII LAAGEGKRMR STRPKVLHHI AGRSLLHHVL SAAGALAAAR TAVVVGHGRE QVAAMLAEQA PEVVPVVQDR QHGTGHAVRV ALAALGELAP GDTVVVLPGD TPLLTSATLT ALVAQHHARA AAATVLSAVI DDATGYGRVV RDGDGAVRAI VEHRDADAAT ATIREINTGV YAFAAGPLQT ALARLTRDNA QGEEYLTDVV GLLVAAGQPV AARVVDDPGE AGGVNDRVQL AAAGRVLRDR IVEAAMRAGT TVVDPRTTWI DADVTLEPDT TIAPNTFLHG RTHVARGAVI GPECTLTDTT VGAGATVLRT TAERAEIGAG AVVGPYSHLR PGTRLGREGK IGSFVETKSA DLGNQTKVPH LAYVGDAVVG ERSNIGCTTV FVNYDGVAKH RTVIGSDVRI GSDTMLVAPV TVGDGAYTGA GSVIREDVPP GALAVREGRQ RIIEGWVSRR RPGSPAARAA AAAGVQAPGS VGDPEHPDEM PQPSAAEAAV PIPAHQGDVA HRSELPHTGE GEAGIAGIPG AGSRT
|
| |