Gene Athe_1299 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1299 
Symbol 
ID7408880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1389002 
End bp1390741 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content40% 
IMG OID643715664 
Producthydrogenase, Fe-only 
Protein accessionYP_002573172 
Protein GI222529290 
COG category[R] General function prediction only 
COG ID[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID[TIGR02512] hydrogenases, Fe-only 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.130328 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAATGG TGAATATAAC AATAGATGGC AAGAAGATTC AGGTGCCAAA GGATTATACA 
GTGCTTCAAG CAGCACGCGA AGCAGGAGTT GAGATTCCAA CCCTTTGTTA TCTAAAAGGT
ATAAATGAAA TTGGTGCTTG CAGAATGTGC GTTGTTGAAG TAAAAGGGGC AAGAAGCTTG
CAGGCTGCTT GTGTTTATCC TGTGTCAGAA GGCATGGAGG TCATCACAAA CAGTGAAAGG
GTAAGAAGAG CAAGAAAGGT TAATCTTGAA CTTATTCTTT CAAATCATGA CAGGAGCTGC
TTGACATGTG TCAGAAGTGG AAACTGTGAA CTTCAAAAAC TCGCAGAAGA TTTAAATGTT
AAGCAGATTA GATATGAAGG TGAAAATATA AGAAGACCTC TCGATGATTT TTCACCTTCT
GTTGTAAGAG ATCCAAATAA ATGTATACTT TGCAAAAGAT GTATAAATGT TTGCAGGAAT
GTTCAAGAGG TTGGAGTTAT TAATGCAAAT TACAGAGGTT TCAGAACAGT TATATCCACC
GCATTTGACA GAAGTTTGAA TGATGTTGCA TGTACAATGT GCGGTCAATG TATTCAGGCT
TGCCCAGTTG GAGCTTTAAG AGAGAAAGAC TCAACAGACA TTGTATGGAA GGCTTTAGCA
GACAAGAACA AATATGTTGT TGTTCAAGCG GCCCCAGCTG TGAGAGTTGC ACTTGGTGAA
GAGTTTGGAC TACCAATTGG TACAAGAGTT ACCGGCAAGA TGGTAACTGC TCTCAAGATG
CTTGGGTTTG ACAAAGTATT TGATACAGAC ACAGGCGCAG ACCTTACCAT TATGGAAGAA
GGTACAGAGC TTATTAACCG AATTAAAAAC GGTGGTAAGT TACCGCTAAT AACCTCATGT
TCACCAGGCT GGATAAAGTT CTGTGAGCAC TACTTTCCAG AATTTTTAGA CAACTTATCA
ACTTGCAAAT CACCACATGA GATGTTTGGT GCTATTTTAA AGACATACTT TGCACAGAAG
ATGGGAATTG ACCCTGCGAA TATGTTTGTT GTATCTGTCA TGCCATGTAC CGCTAAGAAA
TTCGAAGCTC AAAGAGAAGA ACTTGCTGCA AGTGGATATC CAGATGTTGA TGCAGTATTG
ACGACAAGAG AGCTCGCAAG AATGATAAAA GAAGCGGGAA TTGACTTTGT GAACTTGCCA
GATAGCCACT TTGACGACCC AATGGGAGAT GCAACAGGTG CAGGTGTCAT CTTTGGAACA
ACAGGCGGTG TAATGGAAGC AGCACTGCGA ACTGTATATG AAGTGCTAAC AGGAAAAACA
CTTGAAAATG TCGAAATTAC TCAAGTTCGT GGCCTTGAAG GAATAAGAGA GGCTGAGATT
GATGTTGGCA CTATGAAGAT TAAAGCAGCT GTTGCACATG GTCTTGCAAA CGCTAAAAAA
CTTCTTGAGA TGGTCAAAAA CGGAGAAAAA GAGTATCATT TTATAGAAAT TATGGCATGT
CCCGGTGGCT GCATAATGGG TGGTGGACAG CCAATTGTTC CTGCAAAGGT AAAAGAAAAA
GTAGATGTTG CAAAACTCAG AGCAAGCGCA ATATACGACG AAGATAGGTC CCTGCCAATA
AGAAAGTCTC ATGAAAACCC TGCTGTAAAA AGATTATATG AAGAGTTTTT AGACCATCCA
AATAGTGAAA AAGCTCATCA TATTCTGCAT ACACACTATA AAAAAAGACC ACTATACTGA
 
Protein sequence
MEMVNITIDG KKIQVPKDYT VLQAAREAGV EIPTLCYLKG INEIGACRMC VVEVKGARSL 
QAACVYPVSE GMEVITNSER VRRARKVNLE LILSNHDRSC LTCVRSGNCE LQKLAEDLNV
KQIRYEGENI RRPLDDFSPS VVRDPNKCIL CKRCINVCRN VQEVGVINAN YRGFRTVIST
AFDRSLNDVA CTMCGQCIQA CPVGALREKD STDIVWKALA DKNKYVVVQA APAVRVALGE
EFGLPIGTRV TGKMVTALKM LGFDKVFDTD TGADLTIMEE GTELINRIKN GGKLPLITSC
SPGWIKFCEH YFPEFLDNLS TCKSPHEMFG AILKTYFAQK MGIDPANMFV VSVMPCTAKK
FEAQREELAA SGYPDVDAVL TTRELARMIK EAGIDFVNLP DSHFDDPMGD ATGAGVIFGT
TGGVMEAALR TVYEVLTGKT LENVEITQVR GLEGIREAEI DVGTMKIKAA VAHGLANAKK
LLEMVKNGEK EYHFIEIMAC PGGCIMGGGQ PIVPAKVKEK VDVAKLRASA IYDEDRSLPI
RKSHENPAVK RLYEEFLDHP NSEKAHHILH THYKKRPLY