Gene Athe_1008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1008 
Symbol 
ID7407910 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1105075 
End bp1106871 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content36% 
IMG OID643715373 
Productindolepyruvate ferredoxin oxidoreductase, alpha subunit 
Protein accessionYP_002572882 
Protein GI222529000 
COG category[C] Energy production and conversion 
COG ID[COG4231] Indolepyruvate ferredoxin oxidoreductase, alpha and beta subunits 
TIGRFAM ID[TIGR03336] indolepyruvate ferredoxin oxidoreductase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.160683 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAG TCTTAATGGG CAACGAAGCT GTAGCATATT CTCTTTTTAT CAATGGCGTA 
AATGTTGCTG TTGGATATCC AGGTACGCCT TCAACAGAGG TCATAGAGAC GTTGAAAAAC
TTTCAAGATG ATGATTTTTA TGTAGAATGG TCGACAAACG AAAAAGTAGC ACTTGAGATA
GCAGCAGGTG CGAGCCTTGC AGGTGCAAGA ACTGTGGCTG CGATGAAACA AGTTGGTTTG
AACGTTGCAG CAGACCCTCT TTTATCACTT TCGGTTGTAG GTGTAGAAGG CGGACTTATA
GTGTTTGTAG CTGATGACCC TGGACCTCAT TCCTCTCAGA CAGAGCAAGA CACTCGAAAC
TTTGCAAGGT TTTGCAACCT GCCAGTTTTT GACCCTTCTT CTCCAAAAGA GGCATTTGAA
CTCATAAAAC CTGCTTTTGA GATTTCAGAA AAATACAAGC TGCCTGTGCT TTTTCGAATG
ACAACACGAG TTTGTCATTC GAACCAGTCA ATTGAGTTTG AATTCAAAAG AGAAAAGAGG
AAGATTAAAG GGTTCGAGAA AAAGCCAGAC TGGGTTATTT TACCAGCCCT TTCGTACAAA
AAACATATGG AACTTGAACA AAAGCTTCAG GATATGAAAA AAGAACTTTC TGTTTACAAC
AAAGTTGAAG GTAAAGGGAA AATAGGAATT GTAACGGGTG GAGTATCATA TTTTTACGTG
AAAGAAGCTA TAAAGGGTTT TGAGGATTTG TTTTCAATCT TAAAGATTAC AGTTGCCCAT
CCTTTAGATG AAGAACTTAT ATTGGAATTT GTAAAGGATA AAGAAATACT AATATTCATC
GAAGAGCTTG ACCCTGTTTT GGAAGAAGAA GTTAAACTAA TACTTTTTGA GAATGGTAGA
ATCATCTCAA CATATGGAAA ACGAAATGGT TATGTTCCAT TTGCAGGTGA ACTTGATGTT
GACAAAGTAA AGGACATTTT TTACAAGGTA CTTTCTGACA AGCGATTCTT AATGAAAAAT
ACATTTACGG ATGTTCTGCA AATTCCCAAA AGACAAGCAC AACTTTGTGC TGGATGTCCT
CACAGGAATT CTTTTTTGAT TGTCAAATAT GCAACTAAAG GGCAGGATGT AATCTTCACA
GGAGATATTG GTTGTTACAC TCTTGGATTT GCTAAACCCA TTTCAACAAC AGACACTTGT
CTTTGTATGG GTGCAAGTAT TACTATGGCA CAAGGTCTTA AAATTACAGA TGGTTCAAAA
AAGGTTATTG CCTTTATAGG GGATTCAACC TTTTTTCACA GCGGCATTAC AGGACTTGTA
AATAGCTACT ACAATAGACA CAACATAACC ATATGCATTT TAGACAATCT GACAACTGGT
ATGACAGGAT TTCAGCCTCA TCCTGGAACA GGAAAGAAGA TTTATGGGGA AGAAGGAAGA
AAGGTTAGTA TTGAAAGTAT TGTAAAAGGA ATAGGTGTTG AAAAAGTTTT AATGATTGAC
CCTTATTCAG ATTTTACACA AAATGTAGAC AAAGTAAGAG AATTTTTAAA TAACGACCAA
CTTGGTGTAA TAGTTTTTCG AAGAGAATGT GCCAACTTAA ATTCAAGAGA AGGCTACTTT
AAAATAAATC AGAACTGTTT AAAATGTAAG GTATGCTTGA ATGTGACAGG CTGCCCTGCA
ATAGATGAAG ATGAAAATGG GAATATTTTC ATAGATTCAG TTTTGTGTAA AGGCTGTGGG
CTTTGCAAAA ATTTTTGTCC TTATTATGCG ATTGAAAAGG TGATGGAAAA TGAATGA
 
Protein sequence
MKKVLMGNEA VAYSLFINGV NVAVGYPGTP STEVIETLKN FQDDDFYVEW STNEKVALEI 
AAGASLAGAR TVAAMKQVGL NVAADPLLSL SVVGVEGGLI VFVADDPGPH SSQTEQDTRN
FARFCNLPVF DPSSPKEAFE LIKPAFEISE KYKLPVLFRM TTRVCHSNQS IEFEFKREKR
KIKGFEKKPD WVILPALSYK KHMELEQKLQ DMKKELSVYN KVEGKGKIGI VTGGVSYFYV
KEAIKGFEDL FSILKITVAH PLDEELILEF VKDKEILIFI EELDPVLEEE VKLILFENGR
IISTYGKRNG YVPFAGELDV DKVKDIFYKV LSDKRFLMKN TFTDVLQIPK RQAQLCAGCP
HRNSFLIVKY ATKGQDVIFT GDIGCYTLGF AKPISTTDTC LCMGASITMA QGLKITDGSK
KVIAFIGDST FFHSGITGLV NSYYNRHNIT ICILDNLTTG MTGFQPHPGT GKKIYGEEGR
KVSIESIVKG IGVEKVLMID PYSDFTQNVD KVREFLNNDQ LGVIVFRREC ANLNSREGYF
KINQNCLKCK VCLNVTGCPA IDEDENGNIF IDSVLCKGCG LCKNFCPYYA IEKVMENE