Gene Athe_0217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0217 
Symbol 
ID7407208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp263481 
End bp265775 
Gene Length2295 bp 
Protein Length764 aa 
Translation table11 
GC content36% 
IMG OID643714618 
Productprotein serine/threonine phosphatase 
Protein accessionYP_002572141 
Protein GI222528259 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID[TIGR02865] stage II sporulation protein E 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGAAAGAC TTGAGGTTCT GCAGTTTAAA AGACAGCAGA CAGAAAGACC AAAGAATACT 
ATATCAGACC GGTTTTTGGT TTTTAGTGCA CAGGAGTTGT TTATTCTGAT TTTAGCCTTT
TTGCTTGGAA GATGTAGCCT GTTTTCAAGA AGCTTTTTTT CATCTGCGTA TGTTGCAAGT
TTCAAAAAAA GGGATTACAT GTACTATTTA GCCTCGCTTT TTTCCATCTT CGGGATAATT
TCGTGCTCAG ATAAAAGCTC AATATTAAAA TATGTGCTTT CCATTCTTTT GATAACTACA
ATCAACCATT TTTTTGACCT AAACCTTTAT TCAAAGGCCC TGCTCTGCGC TTTGAGTGTA
GGAAGCAGTG GCAGCATTTC TATTTTTCTG TTTTCTAAAG CACCGATAGA GTTTTTGTAT
CTTGTACTTG AGATGGTTGG GTGTTTTTGG GCTGTAATTA TGTTTGAAAG GTTTTTTGCT
GCAATTTATC CCAAAAAAGC ATATACAGCT GATCAGACGG TGGTTGTGGT TGTAGTTCTG
GCTTTGAGCT TTTTGGGGCT TTCAAACAGC TTGGATTCGG TACTGGATAT TGAAAATATT
TTGTTTTTCA TTCTCCTTTT TGCGGTATCG CTTTTTCACG GAATGATAAT GTCGACTGCA
ATGGGTTTTG TAATTGGGCT TTTAGAAAGC ATAAAAGAGT GCAGAAGCAT TGAAATGGCA
TGCGTTTTTG CATTCTCAAG CCTTCTGGCA GGTCTAATGA AAGGGTTTGG CAAGCTTGGA
ATTGCTTTGG GCGGGTTTTG TGGATATATC ATATCAATGT TTTACATATC GTCAAACCCG
ACACTTAGAT TTCGTGAGAT TTTGATATCT GCCGTGTTGT TCTGCCTGTT CCCGTTAGAA
AAGATTGTAA AATTACAAAG TACTGATGAG AGGGAGGTAC AGAGAATGAT AAAAGAAAAG
ATTTTTGGAG TTGCATCTAT AATTGAAAAT ATACAACAGA ATGTTTGCAG CAAACCAGCA
GTTTTAATTT GCAAGGATGA AGCAAAGAAT ATTGTTCAAA GTGCGTGTCA AAAACTTTGT
TTGGATTGCG GAAATTCAAA TGTGTGCTGG AATATAGATT ATCATAGGAC AAATCATAGT
TTGAACGAGA TAAAAAATAT AATATTGAAG AAGGGGAAGC TTTCTCAAGA AGATTTAAAA
GAATTTAGAT TTTTGTGTGA AAAAGCAAAA GAGTTTGAAA TAATTATAAA TGGTTTTTTG
GAATCTTTGA AATATTCAAA GCTTGTTCAA GAGGCATCAA GCCCCAAAGA AAACATGTTC
AAAACACATA TAGAAATTTT AAAAGATATT GTAATTGATG CTGCAAGTAT GGCTGAAAAT
GAAGCAAAAA AAGACATGGG AACATCAAGA GAGATTGAAC TTGAACTTGT GCGGTTTGGC
TATGAAGTTG AGAAGGTAGA CTATGTTGGA TACGATCACT ATTTCCAAAT AGATATAGAT
CTCAAAGATG GGTTTAAAGC TCCGAGAAAA ATGGAGATAG AAGAGATTGT GAAAGGAGTT
GTAGGTTGCA GTGTAGAAAT TATATCAGAG GTGCCAAAAA TCTCAGGGGG ATATACAGTT
TCCATTATCA AAAAACCAAA CGTGCACATA GATTATTCTA TATATTCAAA GAGCAAAGAA
AACATAAACG GTGACAGGGT GTGCTTTTTG CAGCTTAAAA ACGGGAAATT TTTAGCCTGC
ATATCAGACG GTATGGGCAC CGGAAAGACA GCTTCGGAAA ACAGCTTTAT TGTGATAGAT
GCTCTCAAAA AATTCTCATC ACTTGGGTTT GACAGAAAAA TCGCAATCAG GTTTATAAAC
TCACTTCTTA GTATAAAAAA CGCTGAAGAA TTTGCATCTG TTGACGTTGT GTGTATAGAC
AGGTTTACTC TTACGTGTGA GTTTTTAAAA GCTGGGGCAA TGCCAACCTT TATCAAAAGA
GGAAGTGAGG TTTTGACGGT TGAGTCAAAC TCTCTTCCGG TTGGAATAGA AGCCGAAAGT
CAGTTTGATT TTTCAACCTG CAAGCTTCAA AAGGGTGATA TGATATTTAT GTTCTCTGAC
GGGCTTTTTG AGCTCTTGGG TGAGGATGGT GATAGGATTT TGAAAGAGTT CATTGCCAAA
AACCAGTTTG TCTCAACCCA GAGCAGTGCC AAACAGATTT TTGAATGGGC AATTTCTAAT
TCGTTTTTGA TAAAAGATGA TGTAACCATA ATTGTCTTGA AAGTTGGAGG TGGACTTGAA
AAAAGAGGTG AGTAA
 
Protein sequence
MERLEVLQFK RQQTERPKNT ISDRFLVFSA QELFILILAF LLGRCSLFSR SFFSSAYVAS 
FKKRDYMYYL ASLFSIFGII SCSDKSSILK YVLSILLITT INHFFDLNLY SKALLCALSV
GSSGSISIFL FSKAPIEFLY LVLEMVGCFW AVIMFERFFA AIYPKKAYTA DQTVVVVVVL
ALSFLGLSNS LDSVLDIENI LFFILLFAVS LFHGMIMSTA MGFVIGLLES IKECRSIEMA
CVFAFSSLLA GLMKGFGKLG IALGGFCGYI ISMFYISSNP TLRFREILIS AVLFCLFPLE
KIVKLQSTDE REVQRMIKEK IFGVASIIEN IQQNVCSKPA VLICKDEAKN IVQSACQKLC
LDCGNSNVCW NIDYHRTNHS LNEIKNIILK KGKLSQEDLK EFRFLCEKAK EFEIIINGFL
ESLKYSKLVQ EASSPKENMF KTHIEILKDI VIDAASMAEN EAKKDMGTSR EIELELVRFG
YEVEKVDYVG YDHYFQIDID LKDGFKAPRK MEIEEIVKGV VGCSVEIISE VPKISGGYTV
SIIKKPNVHI DYSIYSKSKE NINGDRVCFL QLKNGKFLAC ISDGMGTGKT ASENSFIVID
ALKKFSSLGF DRKIAIRFIN SLLSIKNAEE FASVDVVCID RFTLTCEFLK AGAMPTFIKR
GSEVLTVESN SLPVGIEAES QFDFSTCKLQ KGDMIFMFSD GLFELLGEDG DRILKEFIAK
NQFVSTQSSA KQIFEWAISN SFLIKDDVTI IVLKVGGGLE KRGE