Gene Acel_0479 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0479 
Symbol 
ID4484801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp513034 
End bp514059 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content61% 
IMG OID639729246 
Productglycosyl transferase family protein 
Protein accessionYP_872238 
Protein GI117927687 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCTGA GCGTGGTTAT CATCACCCGG GATGCGACGC GAGACTTGCG TAGGCTGCTT 
CGCTCACTCG AGTCTCAGGT GCTGCGGGTC GGGGCCGAGG TTATCGTTGT GGACGACGGT
TCACGGCGTC CCGACGCTGT TGCCAACGTA TGTGCAACCT TTGCCGGCCC GATAAGACTG
CTGAGGCGAG CGCGGGGCGA GGGATCGAGG GCGGCAAGCC GCAACCGGGG TATCGATGCT
GCGCGGGGCC GGGTCGTGCT TTTTCTGGAC TCCGATCAAG TCGCTCCTGA GACGCTGCTC
TCTGAGCACC TGCGATACCA CCGGGCCTTC GACCGCGCTT GTGTCATCGG CTTCAGGCGG
CATGCCGGTC CGGGCCGGAC AGTCCAATGC CTTCGACCTG AGGTTCGAAC GCGTGTGACG
TCTCGGTGGT CGGAGAACAT CCGTGCCTTA GCGAGTGCGT GGTACCTGAC TTTCACCTGC
AACCTGTCTG TCACGCGCGA CGTGCTTGTT GACATTCATG GATTCGACGA AGGGTTCGTG
GGCTGGGGCC TGGAAGACAG CGAACTCGGG CTGAGAGCGT GGCAGCACGG CGCGGTCATC
GTTCACAACC CGTACGCATG GACCATCGAC TACGGGCACG TGGTGCGGAC GGACCCGCAG
CGCATGCAGG AATGGCAAGC CAATCGCTCG CATTTCCTTC ACAAACACGC AAAGGACGCC
GGGAGTGCAT TGGCGCTCTT AGACAATTAT CCACGCGGCC CGGGATCGCT GGGATTTCGG
TGGTTAGAGT CATACGAGCG TTTTGAAAGA CAGTGCCGCG TGAACCTCGG CAGACCGGAG
TCGACGCCTG CAGCGCCGGC GTCTGTCACC GTGATAAATC ACGACGATCT CAACGAGGTG
AGGGAACGGA TCTCGCACGG CGAGTCCTTG GACATTATTG ATTTCTTGCC CTCGAGCGGA
CTCGATCTTG AGATCCAGTT GTCGCCTGCC GAACGCATCC GCTATTGGGT GGGTGGCCGA
TGGTGA
 
Protein sequence
MDLSVVIITR DATRDLRRLL RSLESQVLRV GAEVIVVDDG SRRPDAVANV CATFAGPIRL 
LRRARGEGSR AASRNRGIDA ARGRVVLFLD SDQVAPETLL SEHLRYHRAF DRACVIGFRR
HAGPGRTVQC LRPEVRTRVT SRWSENIRAL ASAWYLTFTC NLSVTRDVLV DIHGFDEGFV
GWGLEDSELG LRAWQHGAVI VHNPYAWTID YGHVVRTDPQ RMQEWQANRS HFLHKHAKDA
GSALALLDNY PRGPGSLGFR WLESYERFER QCRVNLGRPE STPAAPASVT VINHDDLNEV
RERISHGESL DIIDFLPSSG LDLEIQLSPA ERIRYWVGGR W