Gene Athe_0623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0623 
Symbol 
ID7406964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp707210 
End bp708571 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content37% 
IMG OID643715004 
ProductRadical SAM domain protein 
Protein accessionYP_002572520 
Protein GI222528638 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000646743 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTCATA CGTTTGAGAA GTTTGGACTC AAAATTGTAG TGGATGTGGC ATCAGGTTCA 
ATTTTCACAG TTGATAGTGT TGCTTATGAG GTAATAAAGT ATTACAAAGA AAATGGAAGT
TTTGATGGAG TAGAAGATGC ACTTGAGTTT GATAAACAGC AGATAGCTGA GGCAGTTTTT
GAAGTAAAAA GCTTGATTGA ACAAGGAGTT CTGTTTTCTG AGGATACTTA TAAGGATATG
AATTTGATTG AAAAAAGAAA TCTGGTTATA AAGGCAATGT GCCTTCATGT TGCTCACGAC
TGTGACCTTC GGTGCAGGTA CTGTTTTGCA TCAAGTGGTA GTTTCAAACA AGAAAGAAAA
CTTATGAGCT TTGATGTTGG CAAAAAGGCA ATAGGTTTTT TGCTTCAAAA TTCTGGTTCA
AGACAGAACT TAGAGGTTGA TTTTTTTGGT GGAGAGCCAC TTTTGAATTT CGATGTGGTA
AAAAAGATTG TTGAATATGC AAGAGAAGAA GAGAAGAAGT ATAACAAGAA AATATCTTTT
ACACTGACAA CAAATGCAAC AAACCTTTCA GACGATATAA TTGAATATCT AAACCAGAAC
ATGGAAAATG TTGTACTCAG CCATGATGGA AGACCTGAAG TCAATGACTT TATGAGGATT
GACAGAAACG GTAATGGCAC CTATAGCAAA ATCACAAACA ACATTTTGAG GTTTATCCAA
AAAAGAAATG GGAAAACTTA TTATGTGAGA GGAACATTTA CAGCAAAGAA CTTGGATTTT
TCAAAGGACG TTTTACACTT ATACAGCCTT GGGATAAAAG AAATTTCGGT TGAACCTGTT
GTGCTGGACA AAAGCAGTCC CTGGGCGATA CGTGAGAGTC ACATCGAAAG GATAAAAGAA
GAGTATGATA TCTTGGCTGA GGAGTATATA AATGCGAAAT TTAAAGGAGA AGGCTTTAGC
TTCTTCCATT TCAATATAGA CCTTACAGGT GGTCCATGTG TTTCAAAAAG ACTTTCAGGG
TGCGGTGCCG GGTTTGAGTA TGTAGCAGTT GACCCTGAGG GTAATATATT TCCGTGCCAC
CAGTTTGTTG ACAAACCAAG TTTCAAGTTA GGAAGCGTGT TTGAAGGCAT AAAAAGACTT
GATTTGGTTG AGGAGTTTAA GAAAAATACA GTGTATGAAA AAGATGAATG TTCAAAGTGC
TGGGCGAGGT TTTACTGCAG TGGCGGATGT GCTGCTGCAA ACTATAATAT GAACGGTGAT
GTGAAGAAAT CTTACGTTGT TGGATGCGAA CTTGAAAGAA AGAGGGTGGA AAATGCAATA
GCTATAAAGC TTTATCTTAT GGAAAAGGGG ATAAGAAGCT AA
 
Protein sequence
MVHTFEKFGL KIVVDVASGS IFTVDSVAYE VIKYYKENGS FDGVEDALEF DKQQIAEAVF 
EVKSLIEQGV LFSEDTYKDM NLIEKRNLVI KAMCLHVAHD CDLRCRYCFA SSGSFKQERK
LMSFDVGKKA IGFLLQNSGS RQNLEVDFFG GEPLLNFDVV KKIVEYAREE EKKYNKKISF
TLTTNATNLS DDIIEYLNQN MENVVLSHDG RPEVNDFMRI DRNGNGTYSK ITNNILRFIQ
KRNGKTYYVR GTFTAKNLDF SKDVLHLYSL GIKEISVEPV VLDKSSPWAI RESHIERIKE
EYDILAEEYI NAKFKGEGFS FFHFNIDLTG GPCVSKRLSG CGAGFEYVAV DPEGNIFPCH
QFVDKPSFKL GSVFEGIKRL DLVEEFKKNT VYEKDECSKC WARFYCSGGC AAANYNMNGD
VKKSYVVGCE LERKRVENAI AIKLYLMEKG IRS