Gene Athe_0600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0600 
Symbol 
ID7406941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp677989 
End bp679629 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content32% 
IMG OID643714983 
Producttwo component transcriptional regulator, AraC family 
Protein accessionYP_002572499 
Protein GI222528617 
COG category[T] Signal transduction mechanisms 
COG ID[COG4753] Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATAAAA TTGTAATAGT TGACGATGAC ATGCTTATTC GAAAAGGAAT TAGAAATGTA 
ATAAGATGGA AAGATTTAGA TTGTGAAATT AGCGGTGAAG CTTCAAGCGG GGATGATGCT
CTAAAGGTGA TCGAGAAAGT AAAACCTGAC ATTGTCATTA CAGACATTAA AATGCCCAAT
ATGGATGGTA TTGAACTCAT AGAAAGAATT AAAAAGATTA TACCACATTG CAAGATTGTA
ATTTTAACAG CTTACAGAGA GTTTGAATAT GCACAGAGAG CTATAAAATG TGGTGCTTTT
GATTTTCTTC TAAAACCAAC CAAGGTAGAA GATATAATAA ATGTTGTGCA AAAAGCTATA
GACGAGATTA AAAAAGAAAA AACAATATTA GAAGAGGTAG AAAAGATAAA TAACATTTTG
AAAGAAAAAT TACCAATCTT GAGAGAAAAT TTTTTGTTCA ATGTCATGTT TGAGATGATT
TCAAATGAGG ACGAGATTCT TCAGATGGCA TCTCTTTATG AGATTGAAAT TGATATATTT
TTAATGATTT TAGCTGAGTG CAGTACAAAA GAAGAAAGAG GAAGACAGAA TACTCATTTG
TACCTTTTGG GAATCTCAAA TATGCTAAAT GATTTTTTGA GCAATAATTT TACCATCTAT
ACTATTTATT TAAATAACTC TCAAGCAGTG TATATTGTAA ACTCAAAGAA GGAACTTGAT
AAAGAAGAAG AGAAGAAATT CTTTGAACTT TTGAGCCAGC TAAAAAAAGC AGCAATGGAA
TGTTTTAATA TTGATTTGAC ATTTGCAGTA AGTACGTGGG GCAGGGGATT AGTGCAGCTT
CCAGACAAAT ATAAAGAATG CATAGATGCA ATAAATTACA GATTTTATTT TGATGAAGAA
GATATCATAT ATTACAAGGA CCTTTCTCAC TTTTTCATGT ATGTGGATGA TAGAAAGTTG
AAAAATCTGA AAAATGAAAT TTTGTTGAGT GTCAGATATG GAAATTATTC AAACATAAAT
AATATCTTGC TTGAGCTCGA AGAGACCTTG AAAAAGTCAA AAGCAGATAA ACAGTATATC
TTCAATTTTT ACTATCTTCT GCTCATTGAA ATAAATATGA TAAAAGCTCA GCTTTCTTCG
GCCATAAATA AACCTGATAA TCCAGAGATG TTTGAAAATT TTGATTATTT TAATGAGATT
ATAAAATGCA AAAGTTTGTC AGAACTCAGC AACATCTTGC GAATATCTAT CCAGCGTACA
ATTGAAGAAG TCCAAAAACA CAACCTAAAC AAAATGGGCA GTTTAATAAA AAAGGTGATA
GATTACATAA AAGAAAACTA CCACTCAAGC GAGATTTCCC TCAGCGATAT TTCAGAGAAA
TTTTTTGTAA GTCCTTCATA TTTGAGCAGA CTGTTCAAGA AAGAGACAGG AAAAAACCTT
TCCGACTTTA TAAATGAGTA TAGAATAGAA AAGGCAAAGC AGCTTTTGCT CACAACCGAC
ATTAAAACAT ACGAGGTTGC AGATAAGGTT GGTATTCCAG ACCCACACTA CTTTTCAAGA
CTTTTTAAAC GCTACACTGG CTACAGCCCT TCTGAATACA AAGAAGGTGC AAAACTAAAA
GGTGAAAAAG TAAGCGAATA A
 
Protein sequence
MYKIVIVDDD MLIRKGIRNV IRWKDLDCEI SGEASSGDDA LKVIEKVKPD IVITDIKMPN 
MDGIELIERI KKIIPHCKIV ILTAYREFEY AQRAIKCGAF DFLLKPTKVE DIINVVQKAI
DEIKKEKTIL EEVEKINNIL KEKLPILREN FLFNVMFEMI SNEDEILQMA SLYEIEIDIF
LMILAECSTK EERGRQNTHL YLLGISNMLN DFLSNNFTIY TIYLNNSQAV YIVNSKKELD
KEEEKKFFEL LSQLKKAAME CFNIDLTFAV STWGRGLVQL PDKYKECIDA INYRFYFDEE
DIIYYKDLSH FFMYVDDRKL KNLKNEILLS VRYGNYSNIN NILLELEETL KKSKADKQYI
FNFYYLLLIE INMIKAQLSS AINKPDNPEM FENFDYFNEI IKCKSLSELS NILRISIQRT
IEEVQKHNLN KMGSLIKKVI DYIKENYHSS EISLSDISEK FFVSPSYLSR LFKKETGKNL
SDFINEYRIE KAKQLLLTTD IKTYEVADKV GIPDPHYFSR LFKRYTGYSP SEYKEGAKLK
GEKVSE