Gene Acel_1800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1800 
Symbol 
ID4485699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp2040611 
End bp2042491 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content65% 
IMG OID639730590 
Productthiamine pyrophosphate binding domain-containing protein 
Protein accessionYP_873558 
Protein GI117929007 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3962] Acetolactate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.749934 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAGA CCGTTCGGCT CACCGTCGCC CAAGCACTCA TCCGGTTTCT GCGCGCCCAG 
TACAGCGAAC GCGACGGTTC CCGCCAGCGA CTCATTGCCG GGTGTTTCGG CATTTTCGGT
CACGGCAACG TTGCCGGAAT CGGTGAAGCG CTGCTTGAGG AAGAAATCGC GAACCCGGGC
CTGTTCCCGT ATTACCAGGC ACGCAATGAG CAGGCGATGG TGCACATCGC CGCCGAATAT
GCGCGGATGA CGAATCGTCT CTCCACCTTG GCCTGCACGA CGTCGATTGG ACCGGGTGCG
ACCAACCTTG TGACCGGGGC GGCACTCGCC ACGATCAATC GGCTTCCGGT GCTGCTGTTG
CCCGGAGACA TTTTTGCCAC CAGGGTCGCT GATCCGGTTC TTCAGCAGCT GGAAACCCGC
TGGGGTTTTG ACGTCTCCGT CAACGACACG CTGCGACCCG TCTCGCGGTA CTTCGATCGC
ATCTGGCGTC CGGAGCAGCT TCCCCGCAGC CTGCTCGCCG CGATGCAGGT GCTCACCGAT
CCGGCGGAAA CCGGCGCGGT GACGCTCGCA CTTCCGCAAG ACGTTCAGGC CCAGGCTTAC
GACTGGCCCG TTGAACTCTT CGCCGAACGC GTCTGGCCGA TACGCCGGAC GCCGCCGGAT
CCGGCTGCGC TTCGCCGTGC CGTTGAGGCT GTCACGGCGT CGTCCCGTCC GCTCATAGTC
GCCGGCGGAG GCGTCATTTA TTCCGACGCG ACCGACGCGC TCCGTGCTTT CGTCGATGCC
ACCGGCATTC CGGTCGCCGT GACGCAAGCC GGAAAGCCGG CTATCCGTTA CGACCATCCC
CTGTGCCTTG GGGCAATCGG CTCGACCGGA ACGCCTGCTG CCAATGCGAT GGCGTCGACC
GCGGATCTCG TGCTCGGGAT CGGCACCCGG TTTGGCGATT TCACCACCGC GTCCTCGACC
ATCTTCGGCG CGCCGGACGT CCGCTTCGTC CACCTCAATA TCACGCCATC GGACGCCGCG
AAGCTTGCCG CGTTACCGGT CGTCGCCGAT GCCCGGGCCG GCCTCGACGC GTTGCGTGAG
GCGCTTGCCG GTTGGCGGGT GCCCGAGGAA TACAGCGCCG AAGCGCGCCG GCAGGCCGAC
GCATGGCGGG CAACGGTCTT TGACGTGTAT CGGCAACGCA GTTCCGGGCT GCCCCGGCAG
AGTGAGGTCA TTGGTGCCGT CGGTGAGGCT GCTGGTGAGA CAGGCACGGT GATTTGTGCC
GCAGGGTCTC TGCCGGGCGA CCTTCATAAG CTGTGGCGGG CCAGGGATCC CCGCAGCTAT
CACGTGGAGT ACGGCTACTC CTGCATGGGC TTTGAGATTC CGGCCGGCAT CGGGGCGAAA
TTGGCGTGCC CGGACCGGGA GATTTTCGTA ATGGTTGGTG ACGGCTCCTA TCTCATGCTT
CCCGGTGAGC TGGTCACCGC GGTGCAGGAG CAGATCAAAA TCATCGTTGT TCTTCTGGAC
AACCACGGTT TTGCATCGAT CGGCTCGTTG TCCGAAAGCG TGGGGGGTCA ACGCTTCGGC
ACGAATTATC GCTACCGCAG CCGTCTCACC GGCCGGCTGG ACGGTGACAT CCTGCCGATC
GATTTCACCG CGAATGCCGC CAGTCTCGGC ATGGTGGCCT GGCGGGCGGA GACGGTCGAC
GAACTCCGCG GGCTCCTCGA GAAAGCGAAA GCCGAGACAC GTCCGGTGCT CATCTGCATC
GAGACCGATC CGACGCCCTC CGCGCCCGAC AGCGAGGCGT GGTGGGACGT GCCGGTCGCC
GAGGTCTCGG ACCTGGAGTC CACGCGTGCC GCCTACGCCG AATACCAAGC GGCAAAGGCC
CGGCAGCGCC CATACCTGTG A
 
Protein sequence
MAKTVRLTVA QALIRFLRAQ YSERDGSRQR LIAGCFGIFG HGNVAGIGEA LLEEEIANPG 
LFPYYQARNE QAMVHIAAEY ARMTNRLSTL ACTTSIGPGA TNLVTGAALA TINRLPVLLL
PGDIFATRVA DPVLQQLETR WGFDVSVNDT LRPVSRYFDR IWRPEQLPRS LLAAMQVLTD
PAETGAVTLA LPQDVQAQAY DWPVELFAER VWPIRRTPPD PAALRRAVEA VTASSRPLIV
AGGGVIYSDA TDALRAFVDA TGIPVAVTQA GKPAIRYDHP LCLGAIGSTG TPAANAMAST
ADLVLGIGTR FGDFTTASST IFGAPDVRFV HLNITPSDAA KLAALPVVAD ARAGLDALRE
ALAGWRVPEE YSAEARRQAD AWRATVFDVY RQRSSGLPRQ SEVIGAVGEA AGETGTVICA
AGSLPGDLHK LWRARDPRSY HVEYGYSCMG FEIPAGIGAK LACPDREIFV MVGDGSYLML
PGELVTAVQE QIKIIVVLLD NHGFASIGSL SESVGGQRFG TNYRYRSRLT GRLDGDILPI
DFTANAASLG MVAWRAETVD ELRGLLEKAK AETRPVLICI ETDPTPSAPD SEAWWDVPVA
EVSDLESTRA AYAEYQAAKA RQRPYL