Gene Athe_2749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2749 
Symbol 
ID7408319 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2899221 
End bp2900303 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content37% 
IMG OID643717105 
Productputative PAS/PAC sensor protein 
Protein accessionYP_002574574 
Protein GI222530692 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGTTTT CGAAAAAATT TTATGAGAGT ATTTTAAATG GAATGCTTGA CCTTGTTAGG 
GTCATTGATA TAGATGGAGT TGTTGTTTTT TGCAATACAA AGATGAAAGA AGAATTTGGC
GACCAGACAG GCAAAAAGTG CTATGAACTT TTTTGCAAAG ATTCACGGTG TGAAGACTGC
ATTGCAATAA GGTCTATAAG AGAAAATACG CGGTTTATGA AGTATACACA TTATAAAGAC
AAGACATACT ATGTCATAAG CTCACCGGTT TGTGGCCAGG ATGGAAAAGT TGTTGGAACT
GTTGAGGTGT TCAGAGATAT TACAGAACAG AAAAAGATTG AAGAGAGGCT TAGACGCCAG
AATGAGATTT TGAGACGTGA CTTGGAGTTT GCAAAGAGGC TTCAGCAATC GCTTCTGCCA
GTCATACCAA GAATTGAAGG GTATAGAATT ACATATACAT ACAAGCCGTG TGAAAGGCTT
GGCGGAGATT TTTTGGATGT CATCAATATA GATGACAAGA TAGTTTTCTA TGTTGCAGAT
GTGGCAGGGC ACGGCCTTTT GGCTTCAATG GTAACAATAT TTGTCAAACA AAGCATCATC
AAAAATGCTC ACACTTACAT AAACTCAAGT GCCCAGGAGA TAATAAAGGG AGTTCTCTTG
GATTTTATAG AGATGAATTT TCCAAGCGAG ATATACATCA CCGTAGTGCT TGGCATCCTG
GAAAAACAAA GTGGAAAAGT TACTATGATT TCTGCCGGGC ATGTGACAGA GCCCATTTTG
GTCAAAGCGA ATAAGAAGGT TAAAATGTTT TCGATGAGAG GGCAGCCTAT TGCATCAATT
GACCTTGAAC AAGGGTTTGA GATGAAAGAG ACAATTCTTG AGAAAAATGA CAAGCTGATA
TTTTACTCAG ATGGTCTTAT TGAAAGCAAG AACAAACAAG GTGAGATGTA CGGTAAAAAG
AGGCTTATAA AAAGAATACT TAGTATTAAG AACATCAACA CAGAGCTTTT AATAAGAGAT
GTGAGAAATT TTGTCTCAGA TATAGACGAT GACATAGCTG TTTTGATGGT TGAGAAGATT
TAA
 
Protein sequence
MEFSKKFYES ILNGMLDLVR VIDIDGVVVF CNTKMKEEFG DQTGKKCYEL FCKDSRCEDC 
IAIRSIRENT RFMKYTHYKD KTYYVISSPV CGQDGKVVGT VEVFRDITEQ KKIEERLRRQ
NEILRRDLEF AKRLQQSLLP VIPRIEGYRI TYTYKPCERL GGDFLDVINI DDKIVFYVAD
VAGHGLLASM VTIFVKQSII KNAHTYINSS AQEIIKGVLL DFIEMNFPSE IYITVVLGIL
EKQSGKVTMI SAGHVTEPIL VKANKKVKMF SMRGQPIASI DLEQGFEMKE TILEKNDKLI
FYSDGLIESK NKQGEMYGKK RLIKRILSIK NINTELLIRD VRNFVSDIDD DIAVLMVEKI