Gene Mesil_2154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMesil_2154 
Symbol 
ID9251668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMeiothermus silvanus DSM 9946 
KingdomBacteria 
Replicon accessionNC_014212 
Strand
Start bp2167842 
End bp2171261 
Gene Length3420 bp 
Protein Length1139 aa 
Translation table11 
GC content65% 
IMG OID 
Producttranscriptional activator domain protein 
Protein accessionYP_003685533 
Protein GI297566561 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0522112 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCCCT ACCTCAGCCT ACTCGGCCCG CCAACCCTCA CTGTCAGGGG AAAGGTCACT 
GAACTACCCC AGCGCAAGGC AGTGGCCTTG GCGGCCTACC TGGCTACCCG GCGGGAGCCG
GTGGAGCGTT CGGTCTTGGC TGGGCTTTTG TGGGAGGGGG ACGAGGAGGC GGCACGGAGA
AATCTGCGGC AGGAACTCTT CCGGCTCAAG GGTTCTGCGC TCGAGCCCTT GCTCGAACAG
ACCCCTCAAA CCCTCGCTTT GGGCGAGGTA GATACCGACC TCGAGGCTTT TTTGGGTCAC
CTGGCGCGGG GTGAGTGGGC GAAGGCGGCG GGAATCTGGC GAGGAGGGTT CTTACCGGGA
TTCGAAGTGC GGGGGGGTGA GGCTTTTTTG GACTGGCTCT TGCCCGAACA GGAACGCTGG
CAGAGTCTGT ACCGCGAGGC CATGCTGGGC TGGGCCCGTA GCCGGGAGGC TGCCGGAGCC
TACCAGGAAG CGCTGGAGAT CTATCAGCGG ATGCTGGAGG CCGATCCTTA CCAAGAGCAA
GAGCAGCAGG CAGTGATGCG GCTCTACGCC CTTTTGGGCG AGACTCCGGC GGCGCTGCGG
CAGTACGAGG GTTACCGCGA GGTACTGCGC CGGGAGTTCG GAGTAGAACC CTCCTTACAG
ACGCAGGCCC TGTATCGTCG CTTGCGCCAA GGCAAACCCC TGGCCGAGGC GGAGGGTTGG
CTACTGCCCC GCGGGTTGAG CGAGCCGCCG CTGGTGGGCC GGGCCGAGGA CTGGATGTGG
CTCGAGGCTA ACCTGCGCTC AGGGGTGTTG CTTTTAGTGA CCGGTGAACC CGGGGTGGGG
AAGAGCCGCT TGACCCAGGA GTTTTCCCGG CGGCGAGGCC GAATGCTCAC TGTGCGGCAG
CGCGAGAGCG GCCGGGGGCT TGGCTTTAGC GGTTTCATTG AGGCGATACG GGTAGCGCTC
GAGCAGGGCT GGAGTCCGGC CGGGCTGGAT GCCGCTTGGC GGGACCAGCT GACCTGGCTG
GTCCCCGAGT TGCTCAGCAA ACCCGAAGCG TCCTTTCGCC GAGAGAGCCG GGAGGGGTTG
CTTCGCGGTG CTGGCCAGGT GAGGTACCTC AAAACTCGTC CCCTTACCTC CCGCTCAGCC
AAGTCGCATC TCTTCGAGGC CTTGGCCCGC TTCGTGCAGG ACTGGGTGGG GCCGGGTGGG
ATCTTGCTGT GGGAAGATGT ACACTGGGCC GACGAGTCCA GCCTGGAGTT TTTACCCTAC
CTGGTGCAGC GTTCGGGGGG GTTAGGGTAT ATGGTGCTGG CGACGGCTCG CCCCGAAGAG
CAGTACCCGG CCCAACTGCG GGCTGTCCTC CAGGAACTCA AGGCCGAGCA GGCGGTGAAG
ACCCACCCCT TGCGTAACCT CGAGATGGCC GAGGTGCTGG AGCTGATCCG CCAACTCTCG
GGCCAAGAAT CGGGGGGAAG CCTCTTCTCC GAGCGGCTGT ACCAGAGCAC CGGGGGAAAC
CCTTTTTTCC TGCTCGAGAC CTTACGTTTT TTGTTTGAGC AGGGGCTCTT GCGGGTAGGA
GAGGGCAGCT GGCACACCCC TTACGACGCC TTCACCGCCG ACTACCGCGA ACTCCCGGTC
CCGCCCAGCG TACAAGGGGC GGTGTTGAGC CGGTTTCAGC GCCTTACGGA GGAGGTCGCA
GGGACTCTGT CAACGCGTGA ACAGGCCCTA TCGGTCGCTC AGGCGTTGGC CCTAGCTGAC
CAGCCGCTTA GCCTCGAGGC CCTGCGGGAT TTGCTTGCGG AGGAGTCGCC ATCGGACAAC
GCTCTGCTGG CAGCGCTGGA CCAGCTGGTG CAGGCCGGAT TGGTTCGGGC CGGATCGGAT
TCGCCAGGCG GGGTGAGATT CAGCCTAAGC CACGAGCTTT CCCGCCAAGC CATTCTGGCT
GCGATGCCCG AGACCACCCG GATGCGCTTT CATGCACGCT TCGCCGAGCT ATTGCGGATC
TCCGGAGCAG CTCCTGAGCG CTTAGCTCCC CACCTGCACC TGGCAGGCCG ACCGCGTGAG
GCGGGCCAGG CCTATCTCCA AGCGGCGCGC GCGGCTCGCT CGGGGCCGCT GGCCGCGCAG
GCCCTTAGCT ACTATGCCGA AGCCCGGAGC CTGCTTGGGG AGAGCCTCAA CCCGGCAGAC
TCCTTCGCAT TGCTGGCCGA GATCGCCGAA CTTAAGCTGA CCTTGGGGGA GAATCCTCGG
CGCGAAGTAC GCCAGATGAA CCCGCTGGCC GCCGAGCTGG GCAAAGCGGC CCAGTACCGG
CTGCGCCTTT TGGAAACGAA CGCAGCGGTG TTGACCGGGG TCGTGGAGGA GGGTATCGTC
TCGGCTCGGC AGGCCCTCGA GCTAGCCGAG AACCCCCTCG AGCGAGGTCA GGTGCTGTTT
CGGCTGGCCT GGCTGGAGTA CCGGGGCGGC GACCCCGACG CTCAGCTCGA GCCCTTAGAA
CAAGCCATTC AAGCTTTTCA TGAGGCGGGC GAAAGGAGCC TAGAGGCCCA AGCCATACGA
AACCTTTCCG GTTACTGGTT GCGGCTGGGC CAGATTGAGG CCTTCGAAGC AGCTTACGGC
GAGGCGTTGC GGTTGGCCGA GGCGACCCAG GATCGCTATC TGCTGCGTCG GCTGATGGCT
GACCGGGCCA ATGTGGACTG GGTGATGGGG CGTTACGGCC AGAGCCTGGC GGAGGGTGAG
CGCCTTTTGG CCGAAGCCCG CCAGACCGGA GACCTCTGGG CGGTGTGGGA TGCCTTGCAG
GTGCAAGCCC TCAATGCCAG CGTGTTGGGG CTCGATGAGG GGCTCGAGCA GGCCCTTCGG
GATTCACTTA AGGAGGCCGA GGCAGCCGGG GCCTGGCGCG ACCGGGCGGT GCTGCGGAGT
GACCTGGGGG CTGGCCTGAT GGCGATCAAC CGCCTGGCCG AAGCCCGCGA ACACCTCTCC
ATCGCCCTGC GCGAACTTCA GGAACTCGGC GAACGAGCCC GCTTGGGGCA TACCCTTTTC
GCCTTAGGGT GGACCCTCCT GGATTCGGGA GAGCCCGAAT TGGCCGAGGC TTACCTAAGC
GAAGCCGCCG ACTTGTGGAG GGAGCGCAAA GAGTGGCGCC ACCGGGCCCG CTCGCTAGCG
GCTTTGGCGT TGGCCCGCTT GCGCTACGGG AACCGCGCCA AAGCCCGCGA GGCCGCCCAG
GAGGCCATGC ACTACCTCGA GGACTGGGCC AAGGGGCTTT TCGACCTGCC GCTGGTGCTC
TACGCCTACG CCCGCGCCCT GGGCGACCGC GAGGGGCGGC CCTACTTGGT CCGCAGTCAG
AAGGTGATGC ACGAACTGGC GGCGACGCTC GAGCCCGCCC TGCGCGAGCG CTTGCTGGCT
AACCGCTTCG TAGCGCACGC CCTGAGTAAG TCCAATAACG GTGGCATAAC GGGGATTTAA
 
Protein sequence
MKPYLSLLGP PTLTVRGKVT ELPQRKAVAL AAYLATRREP VERSVLAGLL WEGDEEAARR 
NLRQELFRLK GSALEPLLEQ TPQTLALGEV DTDLEAFLGH LARGEWAKAA GIWRGGFLPG
FEVRGGEAFL DWLLPEQERW QSLYREAMLG WARSREAAGA YQEALEIYQR MLEADPYQEQ
EQQAVMRLYA LLGETPAALR QYEGYREVLR REFGVEPSLQ TQALYRRLRQ GKPLAEAEGW
LLPRGLSEPP LVGRAEDWMW LEANLRSGVL LLVTGEPGVG KSRLTQEFSR RRGRMLTVRQ
RESGRGLGFS GFIEAIRVAL EQGWSPAGLD AAWRDQLTWL VPELLSKPEA SFRRESREGL
LRGAGQVRYL KTRPLTSRSA KSHLFEALAR FVQDWVGPGG ILLWEDVHWA DESSLEFLPY
LVQRSGGLGY MVLATARPEE QYPAQLRAVL QELKAEQAVK THPLRNLEMA EVLELIRQLS
GQESGGSLFS ERLYQSTGGN PFFLLETLRF LFEQGLLRVG EGSWHTPYDA FTADYRELPV
PPSVQGAVLS RFQRLTEEVA GTLSTREQAL SVAQALALAD QPLSLEALRD LLAEESPSDN
ALLAALDQLV QAGLVRAGSD SPGGVRFSLS HELSRQAILA AMPETTRMRF HARFAELLRI
SGAAPERLAP HLHLAGRPRE AGQAYLQAAR AARSGPLAAQ ALSYYAEARS LLGESLNPAD
SFALLAEIAE LKLTLGENPR REVRQMNPLA AELGKAAQYR LRLLETNAAV LTGVVEEGIV
SARQALELAE NPLERGQVLF RLAWLEYRGG DPDAQLEPLE QAIQAFHEAG ERSLEAQAIR
NLSGYWLRLG QIEAFEAAYG EALRLAEATQ DRYLLRRLMA DRANVDWVMG RYGQSLAEGE
RLLAEARQTG DLWAVWDALQ VQALNASVLG LDEGLEQALR DSLKEAEAAG AWRDRAVLRS
DLGAGLMAIN RLAEAREHLS IALRELQELG ERARLGHTLF ALGWTLLDSG EPELAEAYLS
EAADLWRERK EWRHRARSLA ALALARLRYG NRAKAREAAQ EAMHYLEDWA KGLFDLPLVL
YAYARALGDR EGRPYLVRSQ KVMHELAATL EPALRERLLA NRFVAHALSK SNNGGITGI