Gene Athe_2078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2078 
Symbol 
ID7408787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2198955 
End bp2199944 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content35% 
IMG OID643716445 
Productoxidoreductase domain protein 
Protein accessionYP_002573928 
Protein GI222530046 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATTT GCATAGTAGG CAGTAGTGGA CACTATGTAT ATGCTTTAAG AGGAATAAAA 
GAAGACCCTC ATGCCCAAAT TGTGGGAATC TCTCCTGGAT GTGAAGGAGA GAATATTGAA
AGGTTACATT CTCAAGTAAA TGAAATGGGA TTCACACCTG TGGTTTATAG CAATCCTATA
AGGATGTTTG AAGATCTCAA ACCTGACATT GCTGTGATTA ATACATTTTT TTATAAAAAT
TCTGAGCTTG CAATTGAGGC TATGAAAAGA GGAATCCACG TATATATGGA AAAGCCTGTT
GCACTATCAA TAGAAAAACT TGAAGAACTA AAGAGTGTGT GGAGGCAAAC AAAAGTAAAA
CTCTCATCAA TGCTGGGATT GCGCTATACA CCCCATTTTT GGACTGCTTA TAAACTTATA
AATGAAAACA AGATAGGTAG AATAAGACTG ATACATGCCC AAAAATCTTA TAAACTTGGA
ACTCGACCTG ACTTTTATAA ACATAGAAGA ACATATGGCG GAACAATTCC CTGGGTTGGC
ATTCATGCTA TTGATTGGAT TTATTGGCTA AGTGGCAAGA AATTTAAATC GGTCTTTGCA
GGACATTCAA AACTTTATAA TAATGATCAT GGTGAGCTTG AATCTACTGC TTTTTGTAGT
TTTGTAATGG AAGATGAGAT TTTTGCAACG GTGAACATTG ACTATCTGCG TCCTGCTACT
GCCCCTACTC ATGATGATGA TAGAATTAGA ATTGTGGGAA CAAGAGGAAT TTTTGAAGTT
TTAAATGGAA AAGTTTTCTT GCTAAATGAT ACCACTAAAG AGATCTCAGA AGTCTCTTTA
GAAAAACCAC CTATTGTGTT TTTAGATTTC TTAAATGAGG TAAGAGGTAC AGATAAGTGC
TTAGTTAGTA GCGAGGATAG CTTTTATGTA ACCTTTGCTT CGCTTTTAGC AAGGCAGTCT
GCTGATGAGG ATAAGGTAAT TGAATTTTAA
 
Protein sequence
MKICIVGSSG HYVYALRGIK EDPHAQIVGI SPGCEGENIE RLHSQVNEMG FTPVVYSNPI 
RMFEDLKPDI AVINTFFYKN SELAIEAMKR GIHVYMEKPV ALSIEKLEEL KSVWRQTKVK
LSSMLGLRYT PHFWTAYKLI NENKIGRIRL IHAQKSYKLG TRPDFYKHRR TYGGTIPWVG
IHAIDWIYWL SGKKFKSVFA GHSKLYNNDH GELESTAFCS FVMEDEIFAT VNIDYLRPAT
APTHDDDRIR IVGTRGIFEV LNGKVFLLND TTKEISEVSL EKPPIVFLDF LNEVRGTDKC
LVSSEDSFYV TFASLLARQS ADEDKVIEF