Gene Athe_1968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1968 
Symbol 
ID7407382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2080043 
End bp2081419 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content29% 
IMG OID643716340 
ProductRadical SAM domain protein 
Protein accessionYP_002573828 
Protein GI222529946 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGGAA AAATACAAAT TACAAATATC AGATGGGAAA TAGAACCAAA ATGTAATCTA 
AATTGTCGTC ATTGTTTTGT AGGATCAAAG TTAGATAAAT ATAAGGAAAT GGATTTTGAG
ACAGCTAAAG TTGTTGTTGA TATATTAAGT GATTGCGGAA TAAAAGAAAT AATTTTTTCA
TCGAAAGAAC CTTTAATGTA TAAAGATATT GATAAGCTTA TTTTTTATTG TAATGTGAAA
GGAATACATA CGGAATTGGT CACTAATGGA ACATTATTAC AAAATACTTT GTTTGCTGAA
AAAATTGTAT CTAGTGGTGT TGGGACTATC AGTATTAGCA TAGAAGGTAT TACGAAAGAT
TCAAACGATT ACATTAGGGG TGAAGGAAAC TTAGATCAGG TATTAAGAGC ATTAGGTAAT
CTGAAAAATA TAATGGAATA TAAAAGGCAA ATTATCATTG GAATTCAAAT GTCACTAAAT
AGAAAGAATA AAAATGAAGC TGCTCACGTA CCTGAATTTT TTAATCGTCT GCCAATTGAT
ATTCTTGTGA TTGGTGGACT TTCTTTAGAT GGGAATGCAA AAAACAATAG TGATTTATTA
CTATCTTCGG ATGAATTCAT ACAGATATGG GATGTAATAT TAGAAAATTA TTTAAAATTA
AAAGATAAAA AATATTATCT TACATCTAAA TCGTTATTTC CAACTGAGGC AGTGTACTAT
AATTGCTTGT TTGGCTCTGA CTTTTGTCCA GTAATACCAA AATGCGGTAT ATTAAAAGAA
CACTACTCTC TTTTGCCAAA TGGCGATATA GTTCCTTGTG TAGCTCTTTT AGACAAGATA
AATGATGTTG GAGAATTTCC ACGTATAAAT ATTCTTGATA AAATTTTGAA TGAAGAGAAA
ATCAACAAGT TGGAAAACTT TAAAATTCAG TTAGAAAATT TGATAAAAGA AAATAGGCCT
CATGTTTGTC GTGAATGTTA TTACAAAGAG GATTGTAGGC CTTGTCCAGT AAACATTATA
CAACAGCAAT TTAAGGAAGA AATCTCAATA AGGTGTTCAA GAGCTAAGGG AAAAATAAAA
GAAATATTTA ATTTGATTAG AAGACATTTT GATGAGTATT ATCTTTCAAT TAGAGATAAT
ACATTGTTGA CGATTCAAAA TCAAAAAGTA TCTTTAGCAA GGTATTATGA ACAAGGTGGA
GTATACAGAA AAGAATATGA ACTTGAGCCA TTGCAGATTA ATGCTTTGCA GAAAATTTTT
AATTTTAAAG AGATAAGCTT AAAGGAATTG TTATATGATT TTAAAAATTA CGAGATGTTA
CTAAATTTTA TTGAGCCGTT AGTGTTTGAT AATTTTATTA TGGTAAGGAG GAGCTAA
 
Protein sequence
MDGKIQITNI RWEIEPKCNL NCRHCFVGSK LDKYKEMDFE TAKVVVDILS DCGIKEIIFS 
SKEPLMYKDI DKLIFYCNVK GIHTELVTNG TLLQNTLFAE KIVSSGVGTI SISIEGITKD
SNDYIRGEGN LDQVLRALGN LKNIMEYKRQ IIIGIQMSLN RKNKNEAAHV PEFFNRLPID
ILVIGGLSLD GNAKNNSDLL LSSDEFIQIW DVILENYLKL KDKKYYLTSK SLFPTEAVYY
NCLFGSDFCP VIPKCGILKE HYSLLPNGDI VPCVALLDKI NDVGEFPRIN ILDKILNEEK
INKLENFKIQ LENLIKENRP HVCRECYYKE DCRPCPVNII QQQFKEEISI RCSRAKGKIK
EIFNLIRRHF DEYYLSIRDN TLLTIQNQKV SLARYYEQGG VYRKEYELEP LQINALQKIF
NFKEISLKEL LYDFKNYEML LNFIEPLVFD NFIMVRRS