Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0665 |
Symbol | |
ID | 7407089 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 749942 |
End bp | 751600 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 643715046 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_002572562 |
Protein GI | 222528680 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000174611 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAGTG ATACTGTCAA GAAAGGGTTT GAAAAGGCTC CTCAGCGTTC GCTTTTCAAA GCAATGGGAT ACACTGATGA AGAGATAAGA AGACCACTTA TTGCAGTTGT GAATTCATGG AATGAAGTTG TACCTGGACA CATTCATCTT GACAGAATTG CAGAGGCAGT GAAAGCTGGT ATCAGGCTTG CTGGTGCAAC TCCAATGGAG TTTAATGTCA TAGGTGTATG TGATGGTATC GCTATGGGTC ACATTGGCAT GAAGTATTCG CTCATCACAA GAGAGCTCAT TGCAGATTCA ATCGAGGCAA TGGTAATGGC ACACCAGTTT GATGGCATGG TCTTGATTCC AAACTGTGAC AAAATAGTCC CTGGAATGCT AATAGCAGCA GCAAGAGTAA ACATCCCTGC CATTTTAATA AGTGGTGGAC CTATGCTTGC GGGTAAAATT GGTGATAAGG TATGTGACCT TAACTCTGTA TTTGAAGGTG TAGGTGCATA CTCTGCAGGC AAGATTTCTG AAGAAGATTT ATATGCCTTA GAAGAAAATG CATGTCCTGG ATGTGGTTCA TGTTCTGGAA TGTTTACAGC AAACACCATG AACTGTTTGA GCGAGGTTTT GGGGCTTGCT CTTCCTGGAA ATGGAACAAT TCCGGCTGTA ATGGCAGCAC GCATCCGTCT TGCTAAAATG GCAGGTATGA AGATTGTTGA GCTTGTTGAA AAGGACATAA AACCGTCTGA TATTTTGACA GTTGAAGCAT TTGAAAATGC CTTAGCAGTT GACATGGCGC TTGGTGGGTC AACAAACACT ATCTTGCATC TTCCTGCTAT TGCAAATGAA GTTGGAATAA AGTTAAATCT TGATATAATA AACGCTATAA GTGATAGAAC ACCAAATCTT TGTAAGCTCT CACCGGCAGG ACAACATCAT ATTGAGGACC TTTACTTTGC AGGCGGCGTT CAGGCTGTTA TGAATGAGCT TTCTAAAAAA GGTTTGCTTC ATTTAAATCT TATGACAGTT ACAGGTAAAA CAGTTGGTGA GAATATTAAA GATGCAAATG TTAAGAATTA CAATGTCATA AGACCAATTG ACAATCCATA TTCTGAAACA GGCGGGCTTG TAATTGTGAG GGGTAACCTT GCACCAGATG GTGCTGTTGT CAAAAAAAGT GCTGTGCCAC CAAAGCTAAT GAAGCACAGA GGACCTGCGC GTGTGTTTGA AAGCGGTGAA GAGGTGTTTG AGGCAATCTT GAAAGGGAAA ATCCAAAAAG GAGATGTTAT TGTCATAAGA TATGAAGGGC CAAAAGGCGG ACCTGGTATG AGAGAGATGC TCTCTCCTAC ATCAGCACTG GCAGGAGTTG GGCTAATTGA AGATGTTGCG CTGATAACTG ATGGAAGGTT TTCAGGTGCA ACAAGAGGTG CATGTTTTGG TCATGTATCG CCGGAGGCAG CAGAAAGAGG ACCAATTGCA GCAGTTCAGG ATGGAGATAT GATTTCAATT GACATAGAAA ACAAGACTCT TACGTTAGAA GTACCAGAAG AAGAAATCAA AAGAAGACTT GAAATCTTAC CACCGTTTGA GCCAAAGGTG AAAAAAGGGT ATCTTTACAG ATACTCAAAA CTTGTCAGGT CTGCGTCAAC TGGTGCTATA CTTGAGTAA
|
Protein sequence | MRSDTVKKGF EKAPQRSLFK AMGYTDEEIR RPLIAVVNSW NEVVPGHIHL DRIAEAVKAG IRLAGATPME FNVIGVCDGI AMGHIGMKYS LITRELIADS IEAMVMAHQF DGMVLIPNCD KIVPGMLIAA ARVNIPAILI SGGPMLAGKI GDKVCDLNSV FEGVGAYSAG KISEEDLYAL EENACPGCGS CSGMFTANTM NCLSEVLGLA LPGNGTIPAV MAARIRLAKM AGMKIVELVE KDIKPSDILT VEAFENALAV DMALGGSTNT ILHLPAIANE VGIKLNLDII NAISDRTPNL CKLSPAGQHH IEDLYFAGGV QAVMNELSKK GLLHLNLMTV TGKTVGENIK DANVKNYNVI RPIDNPYSET GGLVIVRGNL APDGAVVKKS AVPPKLMKHR GPARVFESGE EVFEAILKGK IQKGDVIVIR YEGPKGGPGM REMLSPTSAL AGVGLIEDVA LITDGRFSGA TRGACFGHVS PEAAERGPIA AVQDGDMISI DIENKTLTLE VPEEEIKRRL EILPPFEPKV KKGYLYRYSK LVRSASTGAI LE
|
| |