Gene Francci3_3961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3961 
SymbolglmU 
ID3906921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4740138 
End bp4741835 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content73% 
IMG OID637881289 
Productbifunctional N-acetylglucosamine-1-phosphate uridyltransferase/glucosamine-1-phosphate acetyltransferase 
Protein accessionYP_483040 
Protein GI86742640 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID[TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.307012 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCATCTG CCGGCGCCTT CCGGTCGACG AGTAGTGTGG GCGCCGTCGC CCCACCGCTG 
TCCAGGTCCG AAGACTCGGG AGATCGCGTG ACATCATCAC GTCCGGTGGC CGTCATCATC
CTGGCAGCGG GCGAAGGCAA GCGCATGCGC TCGACGAGAC CGAAGGTGCT GCACCACATC
GCTGGGCGCT CCCTTCTCCA CCACGTGCTG AGCGCGGCCG GAGCCCTCGC GGCGGCGCGT
ACCGCGGTCG TCGTGGGCCA CGGCCGCGAA CAGGTCGCGG CGATGCTGGC GGAGCAGGCT
CCCGAGGTCG TCCCGGTCGT CCAGGACCGC CAACACGGCA CGGGCCACGC CGTCCGGGTG
GCCCTGGCGG CGCTCGGCGA GCTGGCGCCC GGCGACACGG TGGTCGTGCT GCCCGGCGAC
ACCCCGCTGT TGACCAGCGC GACACTCACT GCCCTGGTGG CCCAGCACCA TGCCCGGGCG
GCCGCGGCGA CGGTCCTGTC GGCCGTCATC GACGACGCCA CCGGTTACGG CCGGGTGGTG
CGCGACGGCG ACGGCGCGGT CCGGGCCATC GTCGAGCATC GCGACGCCGA CGCCGCCACC
GCGACGATCC GAGAGATCAA CACCGGCGTG TACGCGTTCG CAGCGGGCCC GCTCCAGACC
GCGCTCGCGC GGCTGACCAG GGATAATGCT CAAGGCGAGG AGTACCTCAC CGACGTCGTC
GGGCTGCTGG TCGCCGCCGG GCAGCCGGTG GCCGCGCGGG TCGTGGACGA TCCCGGCGAG
GCCGGTGGGG TGAACGACCG CGTCCAGCTC GCCGCCGCCG GACGGGTGCT GCGGGACCGG
ATCGTCGAGG CGGCGATGCG CGCCGGCACC ACCGTCGTCG ACCCGCGGAC CACCTGGATC
GACGCCGACG TCACTCTCGA ACCGGACACG ACTATCGCCC CCAACACCTT CCTGCACGGC
CGGACGCACG TCGCTCGCGG CGCCGTCATC GGCCCCGAAT GCACCTTGAC GGACACGACG
GTAGGCGCGG GCGCGACCGT CCTGCGGACC ACGGCCGAGC GGGCCGAGAT CGGCGCCGGG
GCCGTGGTCG GGCCTTACAG CCATCTCCGA CCGGGCACCC GACTGGGCCG CGAAGGCAAG
ATCGGCTCCT TCGTCGAGAC GAAGTCGGCC GATCTCGGGA ACCAGACGAA GGTCCCGCAT
CTGGCGTATG TCGGCGACGC GGTGGTCGGC GAGCGCAGCA ACATCGGATG CACGACGGTC
TTCGTGAACT ACGACGGGGT GGCCAAGCAC CGCACGGTGA TCGGCTCGGA CGTCCGGATC
GGCAGCGATA CCATGCTCGT TGCTCCGGTC ACGGTCGGGG ACGGGGCCTA CACCGGCGCC
GGATCCGTGA TCCGGGAGGA CGTTCCGCCG GGCGCGCTGG CGGTCCGGGA GGGGCGGCAA
CGGATCATCG AAGGCTGGGT CTCCCGCCGC CGTCCCGGCA GCCCGGCGGC ACGAGCGGCC
GCCGCTGCCG GCGTCCAGGC CCCCGGCAGC GTCGGGGATC CCGAGCATCC CGATGAAATG
CCGCAACCGT CCGCGGCCGA GGCCGCCGTG CCGATCCCAG CTCATCAGGG TGACGTGGCG
CACCGGAGTG AGTTGCCGCA TACCGGCGAG GGCGAGGCCG GCATCGCTGG CATTCCCGGC
GCGGGGAGCA GAACGTAA
 
Protein sequence
MASAGAFRST SSVGAVAPPL SRSEDSGDRV TSSRPVAVII LAAGEGKRMR STRPKVLHHI 
AGRSLLHHVL SAAGALAAAR TAVVVGHGRE QVAAMLAEQA PEVVPVVQDR QHGTGHAVRV
ALAALGELAP GDTVVVLPGD TPLLTSATLT ALVAQHHARA AAATVLSAVI DDATGYGRVV
RDGDGAVRAI VEHRDADAAT ATIREINTGV YAFAAGPLQT ALARLTRDNA QGEEYLTDVV
GLLVAAGQPV AARVVDDPGE AGGVNDRVQL AAAGRVLRDR IVEAAMRAGT TVVDPRTTWI
DADVTLEPDT TIAPNTFLHG RTHVARGAVI GPECTLTDTT VGAGATVLRT TAERAEIGAG
AVVGPYSHLR PGTRLGREGK IGSFVETKSA DLGNQTKVPH LAYVGDAVVG ERSNIGCTTV
FVNYDGVAKH RTVIGSDVRI GSDTMLVAPV TVGDGAYTGA GSVIREDVPP GALAVREGRQ
RIIEGWVSRR RPGSPAARAA AAAGVQAPGS VGDPEHPDEM PQPSAAEAAV PIPAHQGDVA
HRSELPHTGE GEAGIAGIPG AGSRT