Gene Franean1_0770 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0770 
SymbolglmU 
ID5669186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp896167 
End bp897690 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content75% 
IMG OID641239697 
Productbifunctional N-acetylglucosamine-1-phosphate uridyltransferase/glucosamine-1-phosphate acetyltransferase 
Protein accessionYP_001505134 
Protein GI158312626 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID[TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.100351 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTCAGC CACGTCCGGC GGCTGTCATC GTCCTGGCGG CGGGCGAGGG GCGGCGAATG 
CGTTCGGGCA CCCCGAAGGT CCTGCATCGC CTCGCCGGTC TGACCCTGCT CGAGCATGTG
CTCCGCGCGA CCGAGCCGGT CGGGGCGCCC CGGGCGGTGG TCGTCGTCGG TCACGGGCGC
GAGCAGGTCG CGGCCATGCT CGCCGAGCGG GCGCCGCACG TCATCACCGC CGTCCAGGAG
CACCAGGGCG GGACGGGCCA CGCCGTGCGT GCCGCGCTGG CGGCGCTGGA CGGTCTCCCG
CCCGACGCGT CCGTCGTCGT GCTGCCCGGT GACACCCCGC TGCTGACCGG CGAGACCATC
GCCGGCCTGG TCGAGCGCCA TCAGGCGCTG GGCGCGGCGG CCACCGTTCT GTCGGCCTTC
GTCGCCGACC CGACCGGGTA CGGCCGCATC GTGCGCGGCG ACGGCGGCCA GGTGCGGGCG
ATCGTCGAGC AGCGGGACGC CGACCCCGCG ACCGCCGCGA TCAGGGAGAT CAACGCCGGC
GTCTACGTGT TCGACGTCGA GCTGTTGCAG TCCGCGCTGA AACGGCTCAC CACCGACAAC
GCGCAGGGCG AGGAGTACCT CACCGACGTG GTCGGCCTGC TCGTCGCGGA CGGCGAGCCG
ATCGGGGCGC ACGTCGTCGC CGACGCGGCC GAGGCCGGCG GAGTCAACGA CCGGGTGCAG
CTCAGCGAGG CCGGCCGCAC GCTGCGCGAA CGGATCACCC GCGCCGCAAT GCTCGGCGGG
GCCACGATCG TCGACCCGGT CACCACCTGG ATCGACGTCG ACGTCACCCT GGAGCCGGAC
ACGACCGTCT GGCCGAACAC GCACCTGCGT GGGGCCACGA CGATCGCCAC CGGGGCGGAG
GTCGGCCCGG ACTGCACCCT CATCGACACG GTCGTCGGCG CGGGAGCGCG TGTCGTCAGC
TCGGTGACCG AGCGCGCCGA GGTCGGTGCC GGCGCCGTCG TCGGGCCGTT CGCCCACCTG
CGCGCGGGGA CCAGGCTCGG CCGCAGCGGC AAGATCGGCG CCTTCGTCGA GACGAAGGCC
GCCGACATCG GCGATGAGTC CAAGGTCCCC CATCTCGCCT ACGTGGGCGA CGCGGTGGTC
GGCGAGCGCA GCAACATCGG CTGCACCACG GTGTTCGTCA ACTACGACGG GGTCGCCAAG
CACCGCACGG TGATCGGTTC GGACGTCCGG ATCGGCAGCG ACACCATGCT GGTGGCCCCG
GTGACCGTGG GGGACGGCGC CTACACCGGC GCCGGCGCCG TCATCAGGGA GGACGTCCCA
CCGGGAGCGC TCGCGATCAG GGAGGGCCGG CAGCGCAACG TGCCGGGCTG GGTGCTGCGC
CGCCGGCCGG ACAGCCCCGC GGCGCAGGCC GCGCTGCGCG CCCGGCAGCA TGAGGCGGAA
ACGGCCGAGC CGGCCGGACA CAAGCAGGCC ACGCCCGAGC CGGCCGGGCA GGAGCCCGGC
CAGCCCGGGC AGTCGGGCGC GTGA
 
Protein sequence
MTQPRPAAVI VLAAGEGRRM RSGTPKVLHR LAGLTLLEHV LRATEPVGAP RAVVVVGHGR 
EQVAAMLAER APHVITAVQE HQGGTGHAVR AALAALDGLP PDASVVVLPG DTPLLTGETI
AGLVERHQAL GAAATVLSAF VADPTGYGRI VRGDGGQVRA IVEQRDADPA TAAIREINAG
VYVFDVELLQ SALKRLTTDN AQGEEYLTDV VGLLVADGEP IGAHVVADAA EAGGVNDRVQ
LSEAGRTLRE RITRAAMLGG ATIVDPVTTW IDVDVTLEPD TTVWPNTHLR GATTIATGAE
VGPDCTLIDT VVGAGARVVS SVTERAEVGA GAVVGPFAHL RAGTRLGRSG KIGAFVETKA
ADIGDESKVP HLAYVGDAVV GERSNIGCTT VFVNYDGVAK HRTVIGSDVR IGSDTMLVAP
VTVGDGAYTG AGAVIREDVP PGALAIREGR QRNVPGWVLR RRPDSPAAQA ALRARQHEAE
TAEPAGHKQA TPEPAGQEPG QPGQSGA