Gene Athe_2228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2228 
Symbol 
ID7407647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2361071 
End bp2362183 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content35% 
IMG OID643716594 
Productregulatory protein GntR HTH 
Protein accessionYP_002574073 
Protein GI222530191 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.916284 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAAAA TAAACTTGTA TATACAAGAT GATGGTAAAA TGAGTAAAAC AAAATATGAA 
TTGATAAAGG ATTTCATAAT TGAGGGTATT AATTCTGGTA AATTTAAAGA GGGAGAAAAG
ATTTATTCAG AGAATATGCT TGCAAGGAAG TTCAAAGTTT CGCGGCACAC TGTAAGAAGA
GCTATAATGG AGCTTGAATT TGAAGGGCTT TTGGTATCGC TAAAAGGAAG AGGCACGTTT
GTTGCAAAAA AGAGTGCAGA CAGTTCTAAA TGCATTGCAG TTCTTACAAC ATTTATCTCT
GACTATATCT TTCCACTAAT TATAAGAGGA ATTGAAAAGG TGATAGCACA TGAAGGTTAC
GGACTTTTAC TATTTTCAAC AGACAATAGT TACGAGTTTG AAAGATACCA CTTGGAAAGT
ATAATCAACA ACCCCAATAT TGATGCTGTT ATAATAGAAC CCACAAAAAG TGCGCTGCCT
TCCAAAAATC TGGAACTTTA CAAAAAGCTT ATACAAAAAG ACATACCTGT TATATTCATA
AACACCATCT TAGAAGAAGT GGCTCAAAAC TATATAATAA CAAAAGACCA ATCGGCTGTA
TATAAACTTA CAACCAACCT TATAAAAAAT GGGTGTAAAA GACTTTTTGG TGTGTTCAAA
GGTGATGACC TGCAAGGAAT AAAAAGGTAC CGTGGGTTTG AAAAAGCCTG CAAAGAAGCC
GATGTAGAAT TTGATGTAAT CTTTTTCACA AGTGAGGAAT ACAATTTTGT TCACAAAAGG
GCAGCAGAGG TTATATCAAG AGGAAAGTTT GATGCAGCTG TCTGTTACAA CGACAAGATT
GCTCTTCCTC TTTGTGTAAA GCTAAAGGAA ATGGGTTTTA GGATTCCCCA TGATATATCT
GTTACAGGGT TTGACAATTC ACTTTTAGCA ACATTAACAG ATATAAAGCT CACAACAGTT
GAACATCCTA AGGAAAAGCT TGGTGAGATG GCAGCAAAAG CCACTATTGC AATGATAAAG
AAAAATAAAG TGCATGTAAG TGAAGAAATA GAGTGTGAGA TTATTTACAG GAATTCCACA
AGAGGAGGAA TACAGGAATG CTTGAAAGCT TAA
 
Protein sequence
MIKINLYIQD DGKMSKTKYE LIKDFIIEGI NSGKFKEGEK IYSENMLARK FKVSRHTVRR 
AIMELEFEGL LVSLKGRGTF VAKKSADSSK CIAVLTTFIS DYIFPLIIRG IEKVIAHEGY
GLLLFSTDNS YEFERYHLES IINNPNIDAV IIEPTKSALP SKNLELYKKL IQKDIPVIFI
NTILEEVAQN YIITKDQSAV YKLTTNLIKN GCKRLFGVFK GDDLQGIKRY RGFEKACKEA
DVEFDVIFFT SEEYNFVHKR AAEVISRGKF DAAVCYNDKI ALPLCVKLKE MGFRIPHDIS
VTGFDNSLLA TLTDIKLTTV EHPKEKLGEM AAKATIAMIK KNKVHVSEEI ECEIIYRNST
RGGIQECLKA