Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_1800 |
Symbol | |
ID | 4485699 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | - |
Start bp | 2040611 |
End bp | 2042491 |
Gene Length | 1881 bp |
Protein Length | 626 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639730590 |
Product | thiamine pyrophosphate binding domain-containing protein |
Protein accession | YP_873558 |
Protein GI | 117929007 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3962] Acetolactate synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.749934 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTAAGA CCGTTCGGCT CACCGTCGCC CAAGCACTCA TCCGGTTTCT GCGCGCCCAG TACAGCGAAC GCGACGGTTC CCGCCAGCGA CTCATTGCCG GGTGTTTCGG CATTTTCGGT CACGGCAACG TTGCCGGAAT CGGTGAAGCG CTGCTTGAGG AAGAAATCGC GAACCCGGGC CTGTTCCCGT ATTACCAGGC ACGCAATGAG CAGGCGATGG TGCACATCGC CGCCGAATAT GCGCGGATGA CGAATCGTCT CTCCACCTTG GCCTGCACGA CGTCGATTGG ACCGGGTGCG ACCAACCTTG TGACCGGGGC GGCACTCGCC ACGATCAATC GGCTTCCGGT GCTGCTGTTG CCCGGAGACA TTTTTGCCAC CAGGGTCGCT GATCCGGTTC TTCAGCAGCT GGAAACCCGC TGGGGTTTTG ACGTCTCCGT CAACGACACG CTGCGACCCG TCTCGCGGTA CTTCGATCGC ATCTGGCGTC CGGAGCAGCT TCCCCGCAGC CTGCTCGCCG CGATGCAGGT GCTCACCGAT CCGGCGGAAA CCGGCGCGGT GACGCTCGCA CTTCCGCAAG ACGTTCAGGC CCAGGCTTAC GACTGGCCCG TTGAACTCTT CGCCGAACGC GTCTGGCCGA TACGCCGGAC GCCGCCGGAT CCGGCTGCGC TTCGCCGTGC CGTTGAGGCT GTCACGGCGT CGTCCCGTCC GCTCATAGTC GCCGGCGGAG GCGTCATTTA TTCCGACGCG ACCGACGCGC TCCGTGCTTT CGTCGATGCC ACCGGCATTC CGGTCGCCGT GACGCAAGCC GGAAAGCCGG CTATCCGTTA CGACCATCCC CTGTGCCTTG GGGCAATCGG CTCGACCGGA ACGCCTGCTG CCAATGCGAT GGCGTCGACC GCGGATCTCG TGCTCGGGAT CGGCACCCGG TTTGGCGATT TCACCACCGC GTCCTCGACC ATCTTCGGCG CGCCGGACGT CCGCTTCGTC CACCTCAATA TCACGCCATC GGACGCCGCG AAGCTTGCCG CGTTACCGGT CGTCGCCGAT GCCCGGGCCG GCCTCGACGC GTTGCGTGAG GCGCTTGCCG GTTGGCGGGT GCCCGAGGAA TACAGCGCCG AAGCGCGCCG GCAGGCCGAC GCATGGCGGG CAACGGTCTT TGACGTGTAT CGGCAACGCA GTTCCGGGCT GCCCCGGCAG AGTGAGGTCA TTGGTGCCGT CGGTGAGGCT GCTGGTGAGA CAGGCACGGT GATTTGTGCC GCAGGGTCTC TGCCGGGCGA CCTTCATAAG CTGTGGCGGG CCAGGGATCC CCGCAGCTAT CACGTGGAGT ACGGCTACTC CTGCATGGGC TTTGAGATTC CGGCCGGCAT CGGGGCGAAA TTGGCGTGCC CGGACCGGGA GATTTTCGTA ATGGTTGGTG ACGGCTCCTA TCTCATGCTT CCCGGTGAGC TGGTCACCGC GGTGCAGGAG CAGATCAAAA TCATCGTTGT TCTTCTGGAC AACCACGGTT TTGCATCGAT CGGCTCGTTG TCCGAAAGCG TGGGGGGTCA ACGCTTCGGC ACGAATTATC GCTACCGCAG CCGTCTCACC GGCCGGCTGG ACGGTGACAT CCTGCCGATC GATTTCACCG CGAATGCCGC CAGTCTCGGC ATGGTGGCCT GGCGGGCGGA GACGGTCGAC GAACTCCGCG GGCTCCTCGA GAAAGCGAAA GCCGAGACAC GTCCGGTGCT CATCTGCATC GAGACCGATC CGACGCCCTC CGCGCCCGAC AGCGAGGCGT GGTGGGACGT GCCGGTCGCC GAGGTCTCGG ACCTGGAGTC CACGCGTGCC GCCTACGCCG AATACCAAGC GGCAAAGGCC CGGCAGCGCC CATACCTGTG A
|
Protein sequence | MAKTVRLTVA QALIRFLRAQ YSERDGSRQR LIAGCFGIFG HGNVAGIGEA LLEEEIANPG LFPYYQARNE QAMVHIAAEY ARMTNRLSTL ACTTSIGPGA TNLVTGAALA TINRLPVLLL PGDIFATRVA DPVLQQLETR WGFDVSVNDT LRPVSRYFDR IWRPEQLPRS LLAAMQVLTD PAETGAVTLA LPQDVQAQAY DWPVELFAER VWPIRRTPPD PAALRRAVEA VTASSRPLIV AGGGVIYSDA TDALRAFVDA TGIPVAVTQA GKPAIRYDHP LCLGAIGSTG TPAANAMAST ADLVLGIGTR FGDFTTASST IFGAPDVRFV HLNITPSDAA KLAALPVVAD ARAGLDALRE ALAGWRVPEE YSAEARRQAD AWRATVFDVY RQRSSGLPRQ SEVIGAVGEA AGETGTVICA AGSLPGDLHK LWRARDPRSY HVEYGYSCMG FEIPAGIGAK LACPDREIFV MVGDGSYLML PGELVTAVQE QIKIIVVLLD NHGFASIGSL SESVGGQRFG TNYRYRSRLT GRLDGDILPI DFTANAASLG MVAWRAETVD ELRGLLEKAK AETRPVLICI ETDPTPSAPD SEAWWDVPVA EVSDLESTRA AYAEYQAAKA RQRPYL
|
| |