Gene Athe_0102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0102 
Symbol 
ID7408464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp123769 
End bp125328 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content30% 
IMG OID643714510 
Producttwo component transcriptional regulator, AraC family 
Protein accessionYP_002572033 
Protein GI222528151 
COG category[T] Signal transduction mechanisms 
COG ID[COG4753] Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGTATA AAGTTTTAAT TGCTGACGAT GAAAAAATAG TTGTGGATTC TATTAAATTT 
ATACTTGAAA ATAATCTGAA TACAGATTTT GAAATTTCTA TATTTACATC AGGTAGAGAA
GCACTTGAAA ACTTGCTTTT TTATTCTTAC CATATAGCTT TTATAGACAT TAAAATGCCA
GACTTAGATG GACTTGAACT TATAGAAGAG TACAGAAAAA TGAAAAATTC AGAGTTTCCA
ATTTTTATTA TTGTTTCTGC ATACGACAGG TTTGAGTTTG CCAAAAAAGC AATAAAGGAA
AAGGCATTTG CATATATCCT AAAACCGTAT TCAATAGAAG ATATAATCTC AACTATGCAC
TCTGCAATAG CTCAAGTAGA TAGCATTTTG GCAAGGACAA AAGAGAACAT AGAAAAAAAT
GCACAGCTGA TTGTGATGAG AAACTTGCTT GAAAACAGTT TTATACCTAC CTTAATATTC
AAAAATGCAT TCGATATTGT TGATGTAAAT CAATATGAAA AAATCTTTGG AATAAATCTT
AAAAGCGGGT TTTTAATGGT TTTGACACTC AAAGACAAAA GTGATTTGGT CTCAAGCTTC
AAAGAACTTG ACAATATCCG AAAAGACATA AAAATCTCAT TTGAACACAA AGCTTTAACA
TCAATTGGCA TGGGTGAGTA TCTTATTTGT TTTTTCCCGT CTCAATCACA GAAAGAAGCA
GAGGTTTTGC AAGAAAAAAT TCAAGAAATT CTAAAACAAA AACCTTACTG GAATAGTATC
AAAATTGGAT TTAGTGACCT TTATTACTTA GAAGAAGGAT ATGAAAATGC ATTCTGGGAG
GCATACTATT CAACCCTTGA CTTGGAATTC CCAGAAGAAA ATGAAGAAAA TGAGCATCTC
CTTCTATTGA CAGAAAATTT AGAAGCAAAA CTGATTCATT CTATCAACAA CCCAACACAA
ATTCCAATGA TAGAAAACTA TATAACCCAG CTTTGCAAAT TATACATTGA ACTTTTTGGG
GAAAATAACC TAAAATACAA AGTGATAAAA CTTATTATAA TGTTATTGCT TGAAACTGGA
ATAGCAACAA GCGATGAGTC TATTGATGTA GAAAAATTAA TTTCGCAAAT ACTTAATTCT
TCTTATGAAC AGATTGTAGA AATATTTAAA AAAGCTGTGC TTTCACTTTT TAGCAAGGCA
AAAACCAAGC ATGAACAGAT TATAAACAAT GATTCGATTA ACAAAGCAAT AGAATTTATA
AACCAAAACT ACAGTGAGGA AATTACACTT TCACAGATAA GCTCAACTTT TAACTTTAAC
CCATATTATT TCAGTAAATT GTTTAAAAAA TACACAGGTG TAAGTTTTAA GACATACCTT
ACAAAGCTTA GAATTCAAAA GGCTTGTCAG CTTCTGAAAA ATACATCAAA GAGTATAAAG
GAAATATCAT TTGCTGTTGG TTTTTCTGAC CCGAACTATT TTATCAAGGC TTTCAAAAAG
TTCACTGGAA TGACACCCTC TGCATTTAGA AGCTCATCAG TAGATATAAA TTCAATATAA
 
Protein sequence
MTYKVLIADD EKIVVDSIKF ILENNLNTDF EISIFTSGRE ALENLLFYSY HIAFIDIKMP 
DLDGLELIEE YRKMKNSEFP IFIIVSAYDR FEFAKKAIKE KAFAYILKPY SIEDIISTMH
SAIAQVDSIL ARTKENIEKN AQLIVMRNLL ENSFIPTLIF KNAFDIVDVN QYEKIFGINL
KSGFLMVLTL KDKSDLVSSF KELDNIRKDI KISFEHKALT SIGMGEYLIC FFPSQSQKEA
EVLQEKIQEI LKQKPYWNSI KIGFSDLYYL EEGYENAFWE AYYSTLDLEF PEENEENEHL
LLLTENLEAK LIHSINNPTQ IPMIENYITQ LCKLYIELFG ENNLKYKVIK LIIMLLLETG
IATSDESIDV EKLISQILNS SYEQIVEIFK KAVLSLFSKA KTKHEQIINN DSINKAIEFI
NQNYSEEITL SQISSTFNFN PYYFSKLFKK YTGVSFKTYL TKLRIQKACQ LLKNTSKSIK
EISFAVGFSD PNYFIKAFKK FTGMTPSAFR SSSVDINSI