Gene Acel_0073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0073 
Symbol 
ID4484665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp78012 
End bp79358 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content69% 
IMG OID639728835 
Productglycosyl transferase, group 1 
Protein accessionYP_871835 
Protein GI117927284 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID[TIGR03449] UDP-N-acetylglucosamine: 1L-myo-inositol-1-phosphate 1-alpha-D-N-acetylglucosaminyltransferase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGCGTT GGCTGTCGGC TCCCGAGCAT GCACGGCACC GCACGGTCGC TGCTTCGGCG 
GGTGCTCCGT GGACGCCGCG ACGCGTCGCG ATGATCGCCG TCCACACCTC GCCGCTGGAG
ATCCCTGGGT GTGGCGATGC CGGCGGCCTC AACGTGTACG TGGCGCAAAT CGCCCGGCGG
TTGGCATCCC GCGGTATTGA CGTTGACGTG TTCACCCGGG CCACCCGGCG GGACCTACCG
CCGCAGCAAC GACTCGCTCC GGGGGTCACC GTCCGCAATG TCGTCGCCGG GCCGTTGGAA
CCGCTGCCCA AAGACGAGCT TCCGGTGCAT CTGTGTGCGT TCACCGCCGC GGTGCTTCGT
GCGGAGGCGA TGCGTGAACC CGGCTGGTAC GACGTCATCC ACTCCCATTA CTGGCTCTCC
GGTGAGGTTG GCCGGGTGGC GAGCCAGCGG TGGGGGGTGC CCCTCGTCCA CACCATGCAC
ACCTTGGCGA AAGTCAAGAA CGCCGCGCTC GCCGAAGGCG ATGTGCCGGA ACCGGGCCGA
CGTGTCATCG GAGAAGCCGA CGTTGTCGCC GCCGCGGATC GGCTGGTGAC TAACACCTGG
ACTGAGGCGA GGCAGCTCGT TGACCTTTAC GGCGCCGAGC CCGATCGGAT CCGCGTGGTA
CCGCCCGGCG TCGAAACGGC GATCTTCCGG CCCGGTGACA GCGCCCGGGC TCGCCGTCGG
CTGGGCCTTC CCATCGACGG GTGCGTCGTC TTGTTCGTGG GACGCCTCCA ACCGCTCAAA
GGGCCGGACA TCGCGGTTCG CGCCGCAGCC GAATTTCTCT CCACCCATCC GGGAATGCGT
TCGACGTTCC GCCTCGTCAT TGTGGGAGGT CCGAGCGGGT CGCGCAGCAC GGAGCCGGAA
CGGTTGCGCG CGCTCGCTGC CGATCTCGGG GTTGCCGATG CCGTCATTTT CGCCCCTCCG
ATGCCGCCGG ATAGGCTCGT CGAGTTCTAT CGCGCCGCGA CGGTGACGAT TGTTCCGTCC
CACAGCGAAT CGTTTGGCTT GGTGGCACTG GAATCGCAAG CCTGCGGCAC GCCGGTGGTG
GCCGCCCGGG TCGGCGGTCT GACGACGGCG GTGCGGGACG GCGAGAGCGG TCTGCTCGTC
GACGGCCACG ACCCCGCGAG GTACGCCGGC GCAATCGGCC GCCTTCTCGA CCCGGGTCTG
CGGGCCGAGC TCGTCCGCGG CGCGGTCGCT CACGCCATGC GGTTTCACTG GGACAACACC
GTCGAGGGCA TTCTGGGCGT GTACCGGGAT GCCCTGGCTG AGCGCCGGAC CGCGGCGACA
CAGCGGCTGG CGAGCCGCGT GGGCTAG
 
Protein sequence
MPRWLSAPEH ARHRTVAASA GAPWTPRRVA MIAVHTSPLE IPGCGDAGGL NVYVAQIARR 
LASRGIDVDV FTRATRRDLP PQQRLAPGVT VRNVVAGPLE PLPKDELPVH LCAFTAAVLR
AEAMREPGWY DVIHSHYWLS GEVGRVASQR WGVPLVHTMH TLAKVKNAAL AEGDVPEPGR
RVIGEADVVA AADRLVTNTW TEARQLVDLY GAEPDRIRVV PPGVETAIFR PGDSARARRR
LGLPIDGCVV LFVGRLQPLK GPDIAVRAAA EFLSTHPGMR STFRLVIVGG PSGSRSTEPE
RLRALAADLG VADAVIFAPP MPPDRLVEFY RAATVTIVPS HSESFGLVAL ESQACGTPVV
AARVGGLTTA VRDGESGLLV DGHDPARYAG AIGRLLDPGL RAELVRGAVA HAMRFHWDNT
VEGILGVYRD ALAERRTAAT QRLASRVG