Gene Acel_0447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0447 
Symbol 
ID4485197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp480191 
End bp481336 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content70% 
IMG OID639729214 
Productglycosyl transferase, group 1 
Protein accessionYP_872207 
Protein GI117927656 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.429575 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGGT TGCGCGTCGC GGTGAACTTG CTCTGGTGCG TGCCCGGCGT GGTCGGCGGC 
ACCGAACGGT ATGCGGTCGA CCTGCTCAAC GCCCTCCGCG GCCGCGACGA CCTGGCGCTC
ACGCTTTTTG CCACGCCAGA ATTCACCGGC CGCTACCCGG AATTCGCCCG CCATTGCATC
ACCGCGCCGC TGCCCGCCGG CCGACACGTC GTCCGCCGGG TCGCGGTCGA GCACAGCTGG
CTCGCTCTCC GCCTCCGGTC GGGGGATTTC GACGTCGTCC ACCATCTCGG CGGGCTGGTG
CCGACCGCGC CCGTCCCGGC GGTCGTGACC ATTCACGATT TGCAATACCT GGTGTATCCG
CGGAATTTCT CAATTCTCAA ACGCGCGTAT TTGCGGGCCG CGCAAGGCCG GGCGGTGCGC
CGGGCCCGGG TAGTCTGCAC GGTGAGCGAG TTCACCGGCC GGCACGTCAG GGCCGCATTT
CCGGCCGCGG GCCGGGTCGT CGTCATTCCG CCGCTGCTCC TTCCCCCACC AGAACCGACG
GACGCCGACC GGGAGGCCGT CGATGCGTTG CTCCGCAACG TCGGGACGTT CATCCTCTAT
CCGGCCGCGT TCTACCCGCA CAAGAATCAC CGCGTGCTCA TCGAGGCTTT CGCGCGGTTC
GCTCACCGCC GCGCGGTGCA GCTCGTTTTC ACCGGCGCCG CCGGGGCTGG GGCGTGGGGG
TCGGCGCGGT CGACGGAATC GGAAATCCGT GCGCTGGCGG CTCGGCATCG CCTCAACGAC
CAGGTGAAAT TCTTCGGCCA CCTGCCCCGG CCGCACCTTG TCGAACTGTA CCGGCGGGCG
GCTGTGCTTG CTTTTCCGTC CCGTTTCGAG GGTTTCGGCT TGCCAGTCCT GGAGGCGATG
GCGCACGGCG TCCCGGTGGC CGCCGCCCGG GCCGCGGCGT TGCCGGAGCT GGTCGGTGAC
GCCGGGCTGC TCGTCGATCC GGACGATATC CTCGGCTGGG CGGACGCTCT GGAGCGACTG
CTAGACGACG ACGCCGAACG GTGCCGCTGC GCCGACGCGG GCCGACGCCG GGCGGCCGAG
TTCGCCGCGC CGCGCAGCGT CGACCGGCAG GTGGCGGTGT ACCGGGAGGT GGCCGAGCGG
AGATGA
 
Protein sequence
MTRLRVAVNL LWCVPGVVGG TERYAVDLLN ALRGRDDLAL TLFATPEFTG RYPEFARHCI 
TAPLPAGRHV VRRVAVEHSW LALRLRSGDF DVVHHLGGLV PTAPVPAVVT IHDLQYLVYP
RNFSILKRAY LRAAQGRAVR RARVVCTVSE FTGRHVRAAF PAAGRVVVIP PLLLPPPEPT
DADREAVDAL LRNVGTFILY PAAFYPHKNH RVLIEAFARF AHRRAVQLVF TGAAGAGAWG
SARSTESEIR ALAARHRLND QVKFFGHLPR PHLVELYRRA AVLAFPSRFE GFGLPVLEAM
AHGVPVAAAR AAALPELVGD AGLLVDPDDI LGWADALERL LDDDAERCRC ADAGRRRAAE
FAAPRSVDRQ VAVYREVAER R