Gene Athe_0199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0199 
Symbol 
ID7407190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp244822 
End bp246072 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content34% 
IMG OID643714600 
Product2-hydroxyglutaryl-CoA dehydratase D-component 
Protein accessionYP_002572123 
Protein GI222528241 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1775] Benzoyl-CoA reductase/2-hydroxyglutaryl-CoA dehydratase subunit, BcrC/BadD/HgdB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000602439 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTATA TGGAACTTTA CAGCAAATTA TTAAATAAGC TAATTCAAAA CAACAAAAAC 
GGATATTGGC TTTTGAAAAG TGGGATATTG ATTGGGAAAA ACTATGTTAA GTATTTTCCA
GACAAAAGAC TTCCAAGGTC GTTTCAGTAT CTGCACAAAA TAGTCTTTGA TTGGACTTAC
AAAGGAATTG CAAGTAAAAG TAGTGTATGG GTAAATCTAT TTGCTCCAAG TGAAATTTTA
CTTGCTTTTG GCTTCAATCC CATCTTTGTT GAGGCAATTT CTGCGTTTTT ATCAGGACTT
GATGTGGAAG ACGAGCTCAT CTTGAAAGCA GAAAGCCAGG GAATAAGCGA AAGCTTTTGT
ACTTTTCACA AAGCTTTTTT AGGTGCTGCA ATTTCAAATC TTTTAAAAAA GCCGAAATTC
TTGGTTGCAA CTTCAAATAT CTGCGATGCC AACCTAAATA CATTTAGATT TTTGTCTGAG
ACTTTAAAAC TGCCATTTTT CTTTTTAGAT GTTCCCTCTG AAGATTCCAA AGAGGCAATG
CAGTATTTAA AAGCACAGCT GAGCAGTATT ATAAACTCCA TAGAAAAGCT GACAGGTCGA
AAACGTAATC TTGACTATTT AGCTAAGATT ATAAAAAAAG AAAATGCGAC AAGAAAACTT
ATAAAGGAAA GCTTACAGCT GAGATCAACA AAAAACATTA AAACCACACT TACATTTGAG
ATGTTTATGC TCTATCCTTC GCATGTGTTC TGTGGCACTG ACCCGGCTTT GAGGTTTTAT
CAGATGTTTG TTGATGACCT GAAAAACTCA GAAGAAAGAG GCGGTAAAAG TATTTTCTTT
ATACACACAC TTCCTATATT TGAAGAAAAT TTTAAAGAGT ATTTCAATTT CAGTAGCAGA
ATAAATGTCC TTGGGATGGA CCTAAATTTC GACTTTTTAG ATGAAATAAA CGAACAAGAT
CCAATAGAAG CAATTTGCGA AAAACTTCTG AAAAATCCAT ACAATGGTGA TTTTAAAAGA
AGATTTGAGC ACATCAAAAC ATTAATAGAA ATTTTAAAGC CAGATGGAGT TTTGCAGATA
TGCCAGATGG GATGCAAACA GTCCATAGGA TGCTCAATGC TTTTAAAATC GAACATTGAA
ACTCTGGGTA TTCCATTTAC CACTATAGAT GTAGATTGCG TTAACAAGAA GAACAATGGC
AAAGAACAAA TAAGAACTCG GCTTGAAGCA TTTTTAGAAA GAGTCAAATG A
 
Protein sequence
MNYMELYSKL LNKLIQNNKN GYWLLKSGIL IGKNYVKYFP DKRLPRSFQY LHKIVFDWTY 
KGIASKSSVW VNLFAPSEIL LAFGFNPIFV EAISAFLSGL DVEDELILKA ESQGISESFC
TFHKAFLGAA ISNLLKKPKF LVATSNICDA NLNTFRFLSE TLKLPFFFLD VPSEDSKEAM
QYLKAQLSSI INSIEKLTGR KRNLDYLAKI IKKENATRKL IKESLQLRST KNIKTTLTFE
MFMLYPSHVF CGTDPALRFY QMFVDDLKNS EERGGKSIFF IHTLPIFEEN FKEYFNFSSR
INVLGMDLNF DFLDEINEQD PIEAICEKLL KNPYNGDFKR RFEHIKTLIE ILKPDGVLQI
CQMGCKQSIG CSMLLKSNIE TLGIPFTTID VDCVNKKNNG KEQIRTRLEA FLERVK