Gene Athe_1292 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1292 
Symbol 
ID7408873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1383225 
End bp1384556 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content37% 
IMG OID643715657 
ProductFe-S cluster domain protein 
Protein accessionYP_002573165 
Protein GI222529283 
COG category[R] General function prediction only 
COG ID[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATT TACATTCTAT AATGCTTGAC AAAGAAAAAT GTAAAGGATG TACAAACTGT 
ATTAAAAGAT GTCCAACTGA AGCCATTAGA GTTCGAAACT CAAAAGCAAG GATTATTGAC
CAAAGGTGCA TAGACTGCGG AGAATGCATA AGGACATGCC CGTACCATGC GAAATATGCC
ATTACTGACA GTTTAGAGGA AATCAATAAA TTTCAATATA AAGTTGCACT GCCAGCCCCT
TCGTTTTACG CTCAGTTTGA GGTTGATGAT GTGAATAAGC TTCTGTATGC TTTGCTTAAC
CTTGGGTTTG ATGATATATT TGAGGTAGCA AAAGCAGCTG AGATAGTAAC CCACTTTACA
AAGCAGTTTA TTCTTTCTGA TAAAAACAAA AAACCAGTAA TTTCCTCTGC ATGCCCAGCA
GTTGTAAGGC TTATTCAAAC AAAATTTCCG GACTTAATCG AGAATATTCT GCCAATTGCT
TCACCTATGG AAGTTGCTGC ATATATTGCT AAGAAAAAGA TACATAAAGA AAAAGGAATT
GATGAAGACA AAATAGGCGC TTTTTTTATA TCTCCATGTG CAGCAAAGAT GACATATATA
AATAATCCTC TTGGTTTTGA GCGTTCATAC GTGGATGGAG TAATAGCGAT AAAAGATATT
TATGGACTTG TAAGAAGTAA GCTAAGAGAA ATAAAAGTTA TAAAGCCTCT TTCAATTACC
TCAGGCAAAG GTATTGGATG GGCAGCATCA GGTGGCGAAA GCTTGGCGTT GGAAATTGAA
GAGTATATAA ACGTTGATGG TATTCACAAC GTAGTTAAAG TTTTAGAAGA GATTGAAAAT
GGCAGGCTCA AAGACATCAC ATATTTTGAA GGTCTTGCTT GCACTGGTGG GTGTGTTGGA
GGGCCTCTTG CAGTAGAAAA TCCGTATGTT GCCAAAAATC GTATTAAAAG ATTGTCTTCC
AAATTAAAAG ACAAAGAAGA GAGTCTTTCA GCGTGGACAG CGGAAATTAT TAATAGTTTT
TCTCTCAGGC TTGAGGATGT TCTTTTTGAA AAAGAGTTGG AGACAAATCC TGTGCTTGAA
CTTGACTCTG ATATTGAAAG AGCTATAGAA AAGTTTGAAA AGGCAAATAA TATCCTAAGT
ATACTTCCGG GTTTGGACTG TGGTGCTTGT GGTTCACCTA CATGCAAAAC TCTTGCCGAG
GATATAGTGC GTGGTTTTGC CAATGATACA GATTGTATTT TTATTCTCAG AGAAAGCATA
AAAGAGCTCG CAAACAAGAT GGTTGAGCTT TCAAATAAAC TACCACCATC ACTTGAAAGG
AATGATGAGT AG
 
Protein sequence
MKNLHSIMLD KEKCKGCTNC IKRCPTEAIR VRNSKARIID QRCIDCGECI RTCPYHAKYA 
ITDSLEEINK FQYKVALPAP SFYAQFEVDD VNKLLYALLN LGFDDIFEVA KAAEIVTHFT
KQFILSDKNK KPVISSACPA VVRLIQTKFP DLIENILPIA SPMEVAAYIA KKKIHKEKGI
DEDKIGAFFI SPCAAKMTYI NNPLGFERSY VDGVIAIKDI YGLVRSKLRE IKVIKPLSIT
SGKGIGWAAS GGESLALEIE EYINVDGIHN VVKVLEEIEN GRLKDITYFE GLACTGGCVG
GPLAVENPYV AKNRIKRLSS KLKDKEESLS AWTAEIINSF SLRLEDVLFE KELETNPVLE
LDSDIERAIE KFEKANNILS ILPGLDCGAC GSPTCKTLAE DIVRGFANDT DCIFILRESI
KELANKMVEL SNKLPPSLER NDE