Gene Acel_1888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1888 
Symbol 
ID4486145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp2134072 
End bp2135313 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content70% 
IMG OID639730678 
Productthreonine dehydratase 
Protein accessionYP_873646 
Protein GI117929095 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01127] threonine dehydratase, medium form 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.795284 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGGCA CGCCCGACGA CTCCATTGCG CCCTGGATCA CCGCTACTCC CACGGAGTGG 
TTGGAGAACG CCCGGGCGGC CGGCACGCTG CTGGACGGCG TTGCCCGCCG CACCCCGATT
GAGCTGTCCC GACGGGCGTT GATCGGCTCG CCGACTGCCC TGAAGTGCGA GAACATGCAG
TGCGGCGGTT CGTTCAAAAT CCGGGGCGCG TACGTGCGCA TTGCGCGGAT GCCGCCTGAC
CAGCGGGATC GCGGCGTCGT GGCCGCGAGC GCCGGGAATC ACGCGCAGGG GGTCGCCTTG
GCGGCCGGTC TACTCGGTAC CCGCGCCACC GTCTTCATGC CGCGCGACGC GTCCCTGCCG
AAAGTGGCGG CGACCCGCGG TTACGGCGCC GACGTGCGAC TTGTCGGGCA TTCCGTCGAT
GAGGCGCTCG TCGCGGCGCA GGAGTACGCC GCACAGACGG GAGCCGCCTT CATCCACCCG
TTCGACCACC CCGACGTGAT TGCGGGTCAG GCGACGGTCG GCATCGAAAT TCTCGAGCAA
TACCCCGACG TCCGCACGAT CGTGGTCAGT GCCGGCGGAG GCGGCCTTGC CGCAGGCATT
GCCGTCGTGG TCAGAGCACT TCGGCCGGAT GTGCGGGTGG TCGCGGTGCA GGCCCAACAG
GCCGCGGCCC TCGCCGCCTC GGTCCGCGCC GGGCATCCGA TAGCGCTTTC CGACGCCTCG
ACGATGGCCG ACGGCATCGC GGTCCGACGT CCCGGCTCGC TCACCCTGCC GCTGGTCAGC
CGGTACGTCG ACGAGATCCG CACGGTGAGC GAAGAGGCCA TTGCGCATGC CGTGTTGGCC
TGTTTGGAGC GGGCCAAACT CGTCGTTGAA CCGGCGGGGG CGGCAGCGTT GGCTGCGGTG
ATGGATGATT CCAGCGCCTT TCCATCGCCG GTGGTTGCCG TCCTCTCCGG CGGGAACGTC
GATCCGATCG TTCTGCTTCG CATCCTGCGC CATGGGTTGG CGGCGGCCGG CCGCTACCTG
ACCTTTCACG TCACGTTGCC CGATCGTCCC GGCGCGCTCG CCGCTCTGCT CACGGCCCTT
GCCGCGGTTG AGGCGAACGT GCTGGACGTC GTCCACGAGC GGACGCAGGC CGCGCTCGCT
GTCGACGAGG TGGAGGTCGC GCTGCAGGTG GAGACTCGCG GTCAGGAGCA CTCGCAGGAG
GTGCTCGAGA CCGTTGCGCG GCTGGGGTAC GCCGTGTCTT GA
 
Protein sequence
MAGTPDDSIA PWITATPTEW LENARAAGTL LDGVARRTPI ELSRRALIGS PTALKCENMQ 
CGGSFKIRGA YVRIARMPPD QRDRGVVAAS AGNHAQGVAL AAGLLGTRAT VFMPRDASLP
KVAATRGYGA DVRLVGHSVD EALVAAQEYA AQTGAAFIHP FDHPDVIAGQ ATVGIEILEQ
YPDVRTIVVS AGGGGLAAGI AVVVRALRPD VRVVAVQAQQ AAALAASVRA GHPIALSDAS
TMADGIAVRR PGSLTLPLVS RYVDEIRTVS EEAIAHAVLA CLERAKLVVE PAGAAALAAV
MDDSSAFPSP VVAVLSGGNV DPIVLLRILR HGLAAAGRYL TFHVTLPDRP GALAALLTAL
AAVEANVLDV VHERTQAALA VDEVEVALQV ETRGQEHSQE VLETVARLGY AVS