Gene Acel_1947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1947 
Symbol 
ID4486366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp2204965 
End bp2206482 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content69% 
IMG OID639730738 
ProductUDP-N-acetylglucosamine pyrophosphorylase / glucosamine-1-phosphate N-acetyltransferase 
Protein accessionYP_873705 
Protein GI117929154 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID[TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAA CCTCCCGCCC GGCCGCGGCG ATCGTGCTGG CCGCCGGTGA AGGCACCCGG 
ATGCGCTCGA CTCGGCCCAA GGCGCTCTTC CCGATCCTTG GCCGCAGCCT TCTCGGCCAC
GTGCTCGCCG CCGTCCGAGC CCTCGATCCG GAGGAACTCG TCGTCGTCAC CGGTCACCGG
CGCGCGGAGG TCGAAGCCCA CCTCGCCGAG ATCGACCCTG CCGCGAAGCC GGTCTACCAG
ACCGAGCAGC GCGGCACCGG TCACGCCGTC CGGGCGGCCC TTGAGGCGCT CGACGCTGAA
CGGCGCGCCG CCGGCCGTCC CACCCTCACC GGCACGGTCC TCGTCACCGC GGCCGACACC
CCGCTGCTCA CCCCCCGCTC CCTCGCCGCG ATGGTTGCGC ATCGGGAGCA GACCGGGGTC
GCCGGCGTGC TCCTTACCGC GACAATGCCC GATCCCACCG GCTACGGCAG AGTCCTGCGC
GACGACTCCG GCCGGGTCCG GGGCATCGTC GAAGAAAGGG ACGCCGATCC GGCGCACCGG
GAGATACGTG AAGTCAACGC TGGCGTCTAC GCCTTTGACG CAGCAAAAGT ACGGGCCGCG
CTGGCCCGCC TCACCACCGA CAACGCGCAA GGCGAGGAGT ACCTCACGGA CGTCGTTCGG
ATCTTTGGCA CCGAGGGCGA GCCGATTGCC GCCGCCGTCC TCGATGACTG GCGCGAAATC
CTCGGCGTCA ACGATCGAGC ACAACTTGCT GTCGCCGCCG CACTCCTTCG CGATCGGAAA
AACACCGCGT TGATGCGTGC CGGGGTGACC ATCATGGACC CGGCGACCAC CTGGATCGAC
GTCGACGTCG ATGTCGCGCC GGACGCCGAA ATCTGGCCGA ATACGATCCT GGCCGGGACG
ACGCGGGTTG CGGCGTCCGC TCGGATCGGC CCCAACTGCC ATCTGATCGA CACCGAGGTC
GGGGAACGGG CCCGAGTTCG GGACGCCACC TGCGAAAACG CGCAGATCGG TCCGGACGCC
GAGGTCGGGC CGTACACCTA TTTGCGCCCA GGCACGCGAC TGGGGCGCGG CGCGAAAGCC
GGCGGATTCG TGGAGATGAA GAACGCTGTC GTCGGCGCGG AATCGAAGGT GCCGCATTTG
TCGTACGTGG GCGACGCGAC GATCGGTGAG CGGACCAACG TAGGCGCGGC GACGGTTTTT
GTGAATTACG ACGGGGTCGC CAAGCATCAT TCCGTCGTCG GGAACGACGT CCGAATCGGC
AGCGACACGA TGATCGTCGC TCCGGTGACG ATCGGCGACG GGGCGTACAC CGCCGCGGGT
TCCGTGATCG TCGAGGATGT GCCGCCGGGT GCGCTCGCGA TCGCCCGCTC CCGGCAACAG
AACATCGAAG GATGGGTGGT CCGGAAGCGG CCGGGTACCC GCGCGGCGGA GGCAGCGAAG
CAGGCTTCGG CAGCGGGCGG ACCGCCGGCG GCGGGCGGTA CGTCCGGCCG GGACGCAGGC
GACGATGCAT CGCGCTAG
 
Protein sequence
MSETSRPAAA IVLAAGEGTR MRSTRPKALF PILGRSLLGH VLAAVRALDP EELVVVTGHR 
RAEVEAHLAE IDPAAKPVYQ TEQRGTGHAV RAALEALDAE RRAAGRPTLT GTVLVTAADT
PLLTPRSLAA MVAHREQTGV AGVLLTATMP DPTGYGRVLR DDSGRVRGIV EERDADPAHR
EIREVNAGVY AFDAAKVRAA LARLTTDNAQ GEEYLTDVVR IFGTEGEPIA AAVLDDWREI
LGVNDRAQLA VAAALLRDRK NTALMRAGVT IMDPATTWID VDVDVAPDAE IWPNTILAGT
TRVAASARIG PNCHLIDTEV GERARVRDAT CENAQIGPDA EVGPYTYLRP GTRLGRGAKA
GGFVEMKNAV VGAESKVPHL SYVGDATIGE RTNVGAATVF VNYDGVAKHH SVVGNDVRIG
SDTMIVAPVT IGDGAYTAAG SVIVEDVPPG ALAIARSRQQ NIEGWVVRKR PGTRAAEAAK
QASAAGGPPA AGGTSGRDAG DDASR