Gene Mesil_0243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMesil_0243 
Symbol 
ID9249721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMeiothermus silvanus DSM 9946 
KingdomBacteria 
Replicon accessionNC_014212 
Strand
Start bp250641 
End bp253892 
Gene Length3252 bp 
Protein Length1083 aa 
Translation table11 
GC content68% 
IMG OID 
Producttranscriptional activator domain protein 
Protein accessionYP_003683695 
Protein GI297564723 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCAAC TCTTACGCCG CTTGCGGATG GCGGCAGGCG AGGTGGTGCT GGGCGAGGAC 
CTGATCGAGC TAGCCCCCGA GGTGGAGGCC GACGTGGGGC GCTGGTCCTA CCTAGAATCG
CCCTTCCTGA GCCAGCTGCG CCAGGAAGAG GCTTTTCTGA AGGGAAGCGA CCACGATGAC
CTTCCCGAGC TGGCCGAGTG GGTAGAGAGC ACCCGCGCCG AACTCCGTGA GCTGCGGGCC
CAGGCCGCCG AAGCCGAGGC CACCCGGCTG GAGCGGGCGG AGAACTTTAA GGGGGCCCTC
GAGTACGCTC AGATTCGCTT GCGCATGGAG CCGCTCTCCG AGGATAGCTA CCGGCAAGTG
GCGCGGTTAC AGTACCTGTT GGGTGACCGG GCTGCGGCGC TCGCCACGCT CGAGCGCGGC
CGGGCGATGC TCGAGCAGGA GCTAGGGGCC GAGCCTCTGC CGGAGACTCT GCGGCTGCTG
CGCATGATCG AGTCCGGCGC CCACCTGCCC GACGCCCCCC CAAAGCCCAA GAGCCCGCTG
ATCCCCGCCA CCACCCTGCG GCCCCCGGTG CTGGCCGGGC GTGAACGGGA ATGGGCCATG
ATGGAAGAGG CCTGGGAAGC GGGGAAGCTG ATCTTTCTCA AGGGCCAGCC GGGGGTGGGC
AAGAGCCGCC TGGCCGCGGA TTTTCTCGAG CACAAAGGCT CCTATATCCG GCTCGAGGCC
CGCCCCGGTG ACCCGCATGT GCCCTATTCC TCCAACTTCC GCCACCTGCG GGCGATTCTG
GCCAAATATC CCACCGAGCC CCTACCGGAC TGGGTGCGGC AGGCCCTGGC CCCCTGGATG
CCCGAGCTGG GGCCGAGCCC TGCCCCCGTC GAGCCGAGCC CGCTCCAGCA GGCCCGCTTT
TTCGAGGCTC ACCGAGAGAT CTTCAGCATC CTAGTACGCC ACCACGAGGC TATGCTGCTC
GACGACCTCC AGTTTCAGGA CGCCTCCAGC AACCAAGTGG GGGCCTATCT GATGAGTTCG
GTGCTGCCAC TGGGGCCTAG CGGGCTGCGC GGTTATATCG GCTGTTACCG CAGTGGGGAC
AACTCCGAGG GCATGGACCG CTTCGCCGCC CAGTTCGTGG AGACAGGAGA GGGCGTGTTG
ATCGAACTCG AGCCGCTCGA CCCGGACTCG GTGTTGACAC TGTTGCAAGG TCTCGAGCTG
CCCGGAGCCG ACCGGCTGGC GGCGGGGCTG TCACGCTACA CTGGGGGAAA CCCCTTATTC
ATCCTCGAGA CCCTCAAGCA CCTCATCGAG ACCGATACCT TAGAGCGCGG CTTACCGGCC
CGGCTGGCCC CGCCTGGTCG GGTGGCCCCG CTGATCCAGC GCCGCTTGCA ACGGCTCACC
CCGGCGGCTT TGAACCTGGC CCGGGCCGCC GCGGTAGCCG AGACCGAGTT TGGGCTGGGG
CTGGCCCAGG AGGTGCTGGA GCGCTCGGGC TTGGAGCTGG CCGAGAGCCA CGCCGAACTC
GAGGCCGCCC AGATCCTGCG CGGCAACGCC TTCACCCACG ATCTGGTGTT CGAAGCGGTC
TTAGCAGGCA TCCCTACCGC GGTCAAGCAG GTGTTGCATA GCCGCACCGC CAAATACCTA
GAGATGATCG GGGCTGATCC CGCCCTCATC GCCCAGCACT GGTTGGAAGC CGACGAGCGC
AAGGCCGTCC CCTTCCTGCT GGAAGCGGCC AAAGCGGCCC GCTCCACCTA CCACCTCTTC
GACGCGGCGG ATTTTTATGA ACGAGCTGCG GCCCTGCTGG AACGCCAGGA GCGCCCCACT
GAGGCCGCCG AAGCGCTCCT CAACGTCTGC GAGTTCATCC TTGACTTCGA TACCGGGGCC
CGCGCCGAGC GTTTAGCCCA GAAGATACTC GAGCTGGCCC GTGACCCCCG CTCGAGTTCG
CGGGCCTGGC TCTACCAGGC CACCCTGCAC CTGCACCGCG GGCAGACCCC CGAGGGCGAA
CGGGCAGCGC AGCAGGCCCT GGAAAACGCG CTGCGCTCCG GCGACCGGGG GCTCGAGGTG
GATCCTCTAA ACCTCCTAGG GATCGTGTGG CGCCGCCAGG GCCGCTTTGA GGAGTCCCGA
GCCGCCCTCG AGCAAGCCCG CGAGCTGTGC TTGGAGACCC ACAACGAGAC CCTGCTGGCC
GCCGTGCTGA GCAACCTGGG GCTGGCCCTC CAGCAGCTCA ACCGCTACGC CGAAGCCGCC
CAGCGCTTCC AGGAAGCTTT CGCGCTGCAA AAAGACCGCA CCACCCGGGG GCGGGTGCTC
AACAACCTGG CGATCTGCTT AGGACAACTG GGGCGGAGCC GCGAAGCCCT GGAAACCCTG
GAGCGGGCCC GAGAGATGCT GGCCGAGACC GAGGGGGCCA CCGGGGCTCA TCTGGTGGTG
CTGACCTCCC TGGCTAACCA TCACCGCCTC CTGCTGGAGT ACCGGCGCTC GTTGGAGTAC
CTTGAGCAAG CCCGAGCCAT GGTGGAGGGA TACCAGCACT GGAAGCTGGA GGACCTCTAC
CGCAACTTCG CTCGGACATT AAACCGAGCT AGGCCAGTTC GAGCAGGCCC AGAGCTATCT
CAACCGGGCG CTGGAGGAGT TCACCGAGGC CTACCAGGAG GCCGGGTTGG TCTGGCTAGA
GCAGATCCGC CTCTCGAGCT GGAGCGGGCG GAACCCGCAG TCCCCGCTAA ACCGGGCCCG
ACCCATGCTG GGGGAAGCGG CCAACTTGTC GGGGTATCGC TTCCGCTTGG AAGAGGCCCG
GCTGCTTCCC CCCGCCCAGG CCCTCAAGGC CCACCGGCAA AGCCTGGTCT TCGCCCAGCA
ATACGGCCTC AAGGGGCTCG AGATCGCGGC GCACACCCTC GCCGCCCAAG CCCTGCTGCA
CCTGGATCAC CCCGCCGAGG CGCTCGAGCA CACCCGCAAC GCCCTTGCTC TCCTGGAGAC
CTACACCCCC GATCTCTACG CCGGGGAGGT GCGGCTCACC CACTACCAAG CGCTCCAGGC
CAACGGCGAT CCGCAGGCAG CGGCCTACCT CACCGAGACT GCCGCCTGGC TCCGCGAGAT
CGCCCAAGAG AGGGTGCCCC CCCAGTACCG CTCGAGCTTC CTCGAGCACA ACCCCTTCAA
CCGCGCCATC CTGCAAGCCG CCGCGCAACC CCGCTAAGCG GCCCCGACCT GCCGGGCGTG
GCGCTCGAGC CCTCCGAGGA GGCCTCGCAT CGCATGGCGG CTGTGAATCG AGGCCGTGAC
CGGCCAGGGT AG
 
Protein sequence
MRQLLRRLRM AAGEVVLGED LIELAPEVEA DVGRWSYLES PFLSQLRQEE AFLKGSDHDD 
LPELAEWVES TRAELRELRA QAAEAEATRL ERAENFKGAL EYAQIRLRME PLSEDSYRQV
ARLQYLLGDR AAALATLERG RAMLEQELGA EPLPETLRLL RMIESGAHLP DAPPKPKSPL
IPATTLRPPV LAGREREWAM MEEAWEAGKL IFLKGQPGVG KSRLAADFLE HKGSYIRLEA
RPGDPHVPYS SNFRHLRAIL AKYPTEPLPD WVRQALAPWM PELGPSPAPV EPSPLQQARF
FEAHREIFSI LVRHHEAMLL DDLQFQDASS NQVGAYLMSS VLPLGPSGLR GYIGCYRSGD
NSEGMDRFAA QFVETGEGVL IELEPLDPDS VLTLLQGLEL PGADRLAAGL SRYTGGNPLF
ILETLKHLIE TDTLERGLPA RLAPPGRVAP LIQRRLQRLT PAALNLARAA AVAETEFGLG
LAQEVLERSG LELAESHAEL EAAQILRGNA FTHDLVFEAV LAGIPTAVKQ VLHSRTAKYL
EMIGADPALI AQHWLEADER KAVPFLLEAA KAARSTYHLF DAADFYERAA ALLERQERPT
EAAEALLNVC EFILDFDTGA RAERLAQKIL ELARDPRSSS RAWLYQATLH LHRGQTPEGE
RAAQQALENA LRSGDRGLEV DPLNLLGIVW RRQGRFEESR AALEQARELC LETHNETLLA
AVLSNLGLAL QQLNRYAEAA QRFQEAFALQ KDRTTRGRVL NNLAICLGQL GRSREALETL
ERAREMLAET EGATGAHLVV LTSLANHHRL LLEYRRSLEY LEQARAMVEG YQHWKLEDLY
RNFARTLNRA RPVRAGPELS QPGAGGVHRG LPGGRVGLAR ADPPLELERA EPAVPAKPGP
THAGGSGQLV GVSLPLGRGP AASPRPGPQG PPAKPGLRPA IRPQGARDRG AHPRRPSPAA
PGSPRRGARA HPQRPCSPGD LHPRSLRRGG AAHPLPSAPG QRRSAGSGLP HRDCRLAPRD
RPREGAPPVP LELPRAQPLQ PRHPASRRAT PLSGPDLPGV ALEPSEEASH RMAAVNRGRD
RPG