Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0199 |
Symbol | |
ID | 7407190 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 244822 |
End bp | 246072 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 643714600 |
Product | 2-hydroxyglutaryl-CoA dehydratase D-component |
Protein accession | YP_002572123 |
Protein GI | 222528241 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1775] Benzoyl-CoA reductase/2-hydroxyglutaryl-CoA dehydratase subunit, BcrC/BadD/HgdB |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000602439 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTATA TGGAACTTTA CAGCAAATTA TTAAATAAGC TAATTCAAAA CAACAAAAAC GGATATTGGC TTTTGAAAAG TGGGATATTG ATTGGGAAAA ACTATGTTAA GTATTTTCCA GACAAAAGAC TTCCAAGGTC GTTTCAGTAT CTGCACAAAA TAGTCTTTGA TTGGACTTAC AAAGGAATTG CAAGTAAAAG TAGTGTATGG GTAAATCTAT TTGCTCCAAG TGAAATTTTA CTTGCTTTTG GCTTCAATCC CATCTTTGTT GAGGCAATTT CTGCGTTTTT ATCAGGACTT GATGTGGAAG ACGAGCTCAT CTTGAAAGCA GAAAGCCAGG GAATAAGCGA AAGCTTTTGT ACTTTTCACA AAGCTTTTTT AGGTGCTGCA ATTTCAAATC TTTTAAAAAA GCCGAAATTC TTGGTTGCAA CTTCAAATAT CTGCGATGCC AACCTAAATA CATTTAGATT TTTGTCTGAG ACTTTAAAAC TGCCATTTTT CTTTTTAGAT GTTCCCTCTG AAGATTCCAA AGAGGCAATG CAGTATTTAA AAGCACAGCT GAGCAGTATT ATAAACTCCA TAGAAAAGCT GACAGGTCGA AAACGTAATC TTGACTATTT AGCTAAGATT ATAAAAAAAG AAAATGCGAC AAGAAAACTT ATAAAGGAAA GCTTACAGCT GAGATCAACA AAAAACATTA AAACCACACT TACATTTGAG ATGTTTATGC TCTATCCTTC GCATGTGTTC TGTGGCACTG ACCCGGCTTT GAGGTTTTAT CAGATGTTTG TTGATGACCT GAAAAACTCA GAAGAAAGAG GCGGTAAAAG TATTTTCTTT ATACACACAC TTCCTATATT TGAAGAAAAT TTTAAAGAGT ATTTCAATTT CAGTAGCAGA ATAAATGTCC TTGGGATGGA CCTAAATTTC GACTTTTTAG ATGAAATAAA CGAACAAGAT CCAATAGAAG CAATTTGCGA AAAACTTCTG AAAAATCCAT ACAATGGTGA TTTTAAAAGA AGATTTGAGC ACATCAAAAC ATTAATAGAA ATTTTAAAGC CAGATGGAGT TTTGCAGATA TGCCAGATGG GATGCAAACA GTCCATAGGA TGCTCAATGC TTTTAAAATC GAACATTGAA ACTCTGGGTA TTCCATTTAC CACTATAGAT GTAGATTGCG TTAACAAGAA GAACAATGGC AAAGAACAAA TAAGAACTCG GCTTGAAGCA TTTTTAGAAA GAGTCAAATG A
|
Protein sequence | MNYMELYSKL LNKLIQNNKN GYWLLKSGIL IGKNYVKYFP DKRLPRSFQY LHKIVFDWTY KGIASKSSVW VNLFAPSEIL LAFGFNPIFV EAISAFLSGL DVEDELILKA ESQGISESFC TFHKAFLGAA ISNLLKKPKF LVATSNICDA NLNTFRFLSE TLKLPFFFLD VPSEDSKEAM QYLKAQLSSI INSIEKLTGR KRNLDYLAKI IKKENATRKL IKESLQLRST KNIKTTLTFE MFMLYPSHVF CGTDPALRFY QMFVDDLKNS EERGGKSIFF IHTLPIFEEN FKEYFNFSSR INVLGMDLNF DFLDEINEQD PIEAICEKLL KNPYNGDFKR RFEHIKTLIE ILKPDGVLQI CQMGCKQSIG CSMLLKSNIE TLGIPFTTID VDCVNKKNNG KEQIRTRLEA FLERVK
|
| |