Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0712 |
Symbol | |
ID | 4485132 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 779201 |
End bp | 780865 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639729480 |
Product | putative alpha-isopropylmalate/homocitrate synthase family transferase |
Protein accession | YP_872471 |
Protein GI | 117927920 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR00977] 2-isopropylmalate synthase/homocitrate synthase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAGCG AACCCACCGG GCGGCACTCG GCGGCTGCGC CGGCCGGTTT TCAGGTCTAC GACACCACCC TGCGTGACGG CGCCCAGGGT GAAGGAATGG CCCTCACGGT GGCGGACAAG CTCGCCATTG CCCGGCACCT CGACGACCTC GGTGTCGGCT TCATTGAAGG CGGCTGGCCG GGCGCCCTGC CGAAAGACAC CGAATTCTTC CGCCGGGCGC GGACCGAATT GACCCTCCGC AACGCGGTGT TGGTCGCGTT CGGCGCGACC CGTAAGCCCG AGAGCAGGGT GGAAGCAGAT CCGCAGGTTC TCGCCCTCCT CGAAGCGGAG ACCCCGGCCG TCTGCCTGGT GGCCAAGAGC CACGTGCGTC ACGTCAGCGA GGCGCTCCGG ACCTCGTTGG ACGAGAACCT CGCGATGATC CGCGATACTG TCGCGTTCTT CCGCCGGGAA GGACGACGGG TTTTCCTCGA CGCCGAGCAC TTCTTCGACG GGTACGCCGC TGATCCGCAG TACGCCGTCG AGTGTGTCCG GGTCGCTGCG GAGTCCGGAG CCGAGGTCGT CGTCCTGTGT GACACCAACG GGGGAATGCT CCCGCCGCGG ATCGCTGACG TCGTGCACGA GGTTGCCGAG CGGACCGGCG TGCCGCTGGG CATCCACTGC CACGATGACA CCGGCTGCGC GGTCGCGAAC ACGCTCGCGG CGGTCGACGC CGGTGTCGTG CAGGTGCAGG GTGTCGTCAA CGGGTACGGC GAGCGGTGCG GCAACGCCAA CCTGATCACT GTGGTGGCCA ACCTCGAGCT GAAGATGGGC CGCCGGGTGC TGCCGCCCGG CCGACTGGCC GAGCTCGGCC GGGTGAGCCA CGCGATCGCG GAGGTCGCGA ACCAGCCGCC GCGGGCCAAC CAGCCGTACG TGGGGCTCTC GGCGTTCGCG CACAAGGCCG GCCTCCACGC CTCGGCGATC AAGGTGTCAC CGGATCTCTA CCAGCACATC GACCCGGCCT TGGTCGGCAA CGACATGCGG ATGCTCGTCT CGGAGATGGC CGGCCGGGCC AGTGTCGAGT TGAAGAGCCG CCAGCTCGGC TTTGACCTGT CCGGTCAGCG TGATGCGGTG AGCCGGATCG TCGAGCGGGT GAAGAATCTC GAAGCACGCG GGTTCATGTT CGAAGCCGCC GATGCCTCCT TCGAATTGCT GCTCCGTGAA GAGCTTGACG GCGTCCCGAC CCGGTTCTTC GACCTGGAGT CCTGGCGGGT CATCGTTGAG CGACGCGCGG ACGGCGAGGT GGTCTCGGAG GCCACCGTGA AGGTGGTGGT CAAGGGGGAG CGGATCGTCG CGACCGCGGA AGGCAACGGC CCGGTGAACG CGCTGGACCG GGCGTTGCGG CAGGCGCTGG AGCGGCTGTA CCCGCAGCTT GCCGAGCTCG AACTCGTCGA CTACAAGGTC CGCATCCTGG ACGGCTCGCA CGGCACCGGC GCGGTGACCC GGGTGCTGAT TGAGACCAGT GACGGCGAGA CCGAGTGGAC GACGATCGGC GTCGACGGCA ATGTGATCTC CGCTTCCTGG CAGGCCTTGG ACGACGCGTA CATGTACGGC TTGCTGCGTC AGCACGCCGG CTCGGGCGAC CCGGCTGCCC AGGGGGTGTC AACCGCCGGT CCGCGGACTC GATAG
|
Protein sequence | MSSEPTGRHS AAAPAGFQVY DTTLRDGAQG EGMALTVADK LAIARHLDDL GVGFIEGGWP GALPKDTEFF RRARTELTLR NAVLVAFGAT RKPESRVEAD PQVLALLEAE TPAVCLVAKS HVRHVSEALR TSLDENLAMI RDTVAFFRRE GRRVFLDAEH FFDGYAADPQ YAVECVRVAA ESGAEVVVLC DTNGGMLPPR IADVVHEVAE RTGVPLGIHC HDDTGCAVAN TLAAVDAGVV QVQGVVNGYG ERCGNANLIT VVANLELKMG RRVLPPGRLA ELGRVSHAIA EVANQPPRAN QPYVGLSAFA HKAGLHASAI KVSPDLYQHI DPALVGNDMR MLVSEMAGRA SVELKSRQLG FDLSGQRDAV SRIVERVKNL EARGFMFEAA DASFELLLRE ELDGVPTRFF DLESWRVIVE RRADGEVVSE ATVKVVVKGE RIVATAEGNG PVNALDRALR QALERLYPQL AELELVDYKV RILDGSHGTG AVTRVLIETS DGETEWTTIG VDGNVISASW QALDDAYMYG LLRQHAGSGD PAAQGVSTAG PRTR
|
| |