Gene Mesil_1046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMesil_1046 
Symbol 
ID9250539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMeiothermus silvanus DSM 9946 
KingdomBacteria 
Replicon accessionNC_014212 
Strand
Start bp1034837 
End bp1038268 
Gene Length3432 bp 
Protein Length1143 aa 
Translation table11 
GC content65% 
IMG OID 
Producttranscriptional activator domain protein 
Protein accessionYP_003684459 
Protein GI297565487 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.203225 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACCT TGCCGCACCC GCCCTTTCCC CGCAGTGGAG GAGTGTGGAG CCTCGAGCTG 
CTAGGGGTAG CCCGCCTGGT GCGCGGGGGG GTGGCGTTAG AGCCGCTCGA GCGCAAGACT
GCGGCCCTTT TCGCGATACT AGCCCTGGAA GGCCCCACCC CGCGCTCTAA GATCGCCGGG
CTCTTATGGC CTGAGGTAGA CGAAGGAAAA GCCCGGCGCA ATCTGCGCCA GCGGCTACAC
CGGCTCAAGG ACGTGCTGGG GGTGAGCCTG ATCATCCCCG AGGATACGCT GTACCTGGAT
CCCGCCCTCG AGGTAGACGC GGTGTGCCTG GAGTCGCTGG CTTTTACCGG GGATTACCCG
GGGGCCCTCA GGAAAGAGGG GGAACTCCTG GGCACCCATG ACTACGACGA TTGCCTCGAG
CTGGCCGAGT GGGTGGAACT CGCGCGGGAA CGGGTGCGCC GGGTGCGCCG CGAAGCGATG
CTAGCCGAGA TCCAACGCCT GGAGGCGGCA GGGGATTACC CGGCGGCGCT GCCTTGGGCT
GAGCGGCTTT TGAATGACGA CCTGGTTTCG GAGGAGGCCC ATCGCCGGGT GATGCGGCTG
TTGTACCTGA TGGGCGAGCG AGCCGGGGCG CTCAAAGCCT ACCACCGCTG CGGCGAGATC
TTGCGGCGCG AGCTGGGGCT GGAGCCCCTA CCCGAGACCG CTGAGTTAGC CCGTACCATT
GACCAGGGTG CAGCCTTGCC TCACCCTCCC GCCGCCCAGC GCCCGCGGAT TCCGCTCCAG
GTGCTGCGCC CGCCCACGCT GGTGGGCCGC GAGCGGGAAT GGTCCTTGCT GGAAGAGGCC
TACCAGCAGG GCAAGCTGGT GTTTTTGCGC GGTGAGCCAG GAGTGGGCAA GACCCGGCTG
GCGCTGGACT TCGCGGCATC CAAAGGCAGG GTTTTGCTGC TGGCGGCCCG CCCGGGGGAT
GCTGGGATTC CCTTCTCGAG CTATGCCCGC AGCTTGCGCG AGTCCTTGGC TGAACACCCT
GAGTGGGTCG CCCATATGCT GCCCTGGGTG CGCGCCGAGA TTTCGCGCAT ACTCCCTGAG
CTGGCCGAGG GGGTGCTTCC GCCCCCGCTG GCCTCTGAGG CCGAGAAGAT GCGGCTTTTT
GACGCCCTGG GGACCCTTAC CCTAAGTACC TACGCTGACG GTGAGCGGGT TTTGGTGAGC
GATGACGCCC AGTACCTGGA CCAGGCTTCC ATCGAGGTGA TGCTCTACCA GCTCAACAAG
TTCGGCAGGC TGGGCCATCC GGGGGGCTTT CCCTTCACCA TCGTCACCTA CCGGCGCGGT
GAGATGGATC CCGAGGTCGA ACAGAATGTT TTCTTTCCGC TCCTCGAGGC CGGAGCGGCC
ACCCTCATCG ATGTCGAGCC CCTCGAGCCG GATGCACTCA AAGCCTTGCT GCACAGCCTG
GAGGACCCGC TGTTGGATAG ATTGTCCCCG GCGCTTGAGC GCTACAGCGG CGGCAACCCC
ATGTTCGTGC TGGAGCTGCT CAAAAGCCTC TACGAAACTG GGCAGATCCA ACGGGGCCTG
CCGCAGCAGC TAGGGGTGCC GGGCAAGATC GGCTTTGTGG TCCAAAGACG CCTCGAGCGC
CTCTCCCCGG CGGGTTTGCG CCTGGTCCGT ACCGCCGCGG TAGCCGGGGT GGACTTCAGC
TCCGAGTTAG CTGGAGCGGT GCTGGGGGTG AGCCCCTTGG ATTTGGTAGA GCCGTGGGCT
GAACTCGAGG CCGCCCAGCT GATTTTTGGC TCTGGTTTTG TCCATGACCT CGTCCATGAG
GCAACTTTGG CGGGAATCCC GGTGCCCATC AAGGCCCTCC TGCACCGCCG CATCGCCGAG
TACCTAGAAG CCCACCGGGC CGAGCCGGCC CGCATTGCCC AGCACTGGCT CGAGGCCGAT
GAGAGCAACG CCATTCCCTA TCTGCTAGAA GCGGCCCGAG CGGCCCAGGT CACCTACCGG
CTTTCTGAAG CCTCCCGTTT TTACGAGCAG GCTGCCTGTA TCCTCGAGCG TTTGGGGCGT
TCGGAGGAGG CCTTCGCGAG CTGGCAGCAG GCTTGTGCAA CCCTGCTGCG CTTTGACACT
GGCCCACGCC ACGAAGCTGC TCTGGAGCGG CTTTTTGCCC TGGCCGAAAG CCCCCAGCAG
CGGGCCCAAG CCCACTTGGC CGAAGCCAGC TTGGCCGGCG AACGCCATGA CCGTGAGCGG
GCTGAGCAGA GCGCGAGGGA GGGCTATAGG TTGGCGCTCG AGGCCCAAGA CCCATCCCTC
CAGGCCAAGC TGCTCACCGC CTTAGGCCAA GCCCTGTGGT TGCAGGGGCG AACCGAGGAG
GTGGTGCCTC TACTCGAGCA GGCCGCCGAG ATCTATCGGC AGACCGGCGA CGAGCGCGGT
CTGGCTGGGG TGGAGTCCAA TCTGGGCACG GTGCTCGACC AGCTCTTTCG CCACCAGGAA
GCCATCGTGC ACCATGGCCG GGCCGCCGAG CGCTTCGCAG TTTTCGATGA CCGGCTGGGC
ACGGTCAAAG CCTTGCACAA CCTTGCGGTG AGCCACGCCT ATTTCGGCCT GGCCCACCTA
GCGCTGGAAT ACCAGCGTAG GGCCGAGGCC TTGCTGAGCG AGGTAGAGGA TGCTCAGATC
TTGGCGCTGC GCAGCGCGGG AAACCTAGCC GCCCGCTACG CCGACCTGGG GGAGTACACC
CGGGCTTGGG CGGAGTTGGA GCGCGCTTTG GGTCTGGTCC CGGAGGGCTG GGAGTGGGCG
CGGGCCCACT ACCGCACCGA AAAGGTTAAG CTGCTCCTGC TATTGGGGGC GCTGGAGGAA
GCCCAGGCCG AACTGGGGGC GGTGCTCGAG CTGCGCGCTC AGGGATTAGG TCTGCCAGAC
GACATCTTCA AAGTAGCCCT GACCCTCCAG GCACAGATCA AGGCATGGCG GGGGGAATCA
CCCCACCCTG AGCTAAGCCA GGCCCAACCG CTGTCCGATG CCCACCGTCC CTATAACAAA
ATCCGCTTCA TGCTGGTCAA GGCCACCCTC CTGAACCCCC AGGAGGCCTT GCCGTTGGCT
CAGGAAGTGC TCGAGTTTGC CCTCGAGCAC CAGCTTCCTG CGGTCGAGTT GGCGGCCCGG
ACCCGAGTGG CGCAGGTTTT GCTGCGGGTG AAGAAGCAGC GCCGGGCCCT CGGGCATACC
GAGGCGGCGA TGAGCCTGCT ACGAACCTAC GATCCCACCG ATTTCTACCG GGGGGAGATT
CTCCTGACCC ACTGCTTGGC CCTGGAGGCC GCCCGGCAAC CGGGAAGGCT GGGCCACCTC
GAGGCCACCC TGAAATGGCT GCTCGAGACC GCCGAGGGAA AGGTTCCGCC CTTGTACCAA
AAAAGCTTCC TGTGGCAAAA CCCCACTAAC CGAACCATCC TCGAGATGGC GAGCCAGCAG
GGGATCGCCT GA
 
Protein sequence
MATLPHPPFP RSGGVWSLEL LGVARLVRGG VALEPLERKT AALFAILALE GPTPRSKIAG 
LLWPEVDEGK ARRNLRQRLH RLKDVLGVSL IIPEDTLYLD PALEVDAVCL ESLAFTGDYP
GALRKEGELL GTHDYDDCLE LAEWVELARE RVRRVRREAM LAEIQRLEAA GDYPAALPWA
ERLLNDDLVS EEAHRRVMRL LYLMGERAGA LKAYHRCGEI LRRELGLEPL PETAELARTI
DQGAALPHPP AAQRPRIPLQ VLRPPTLVGR EREWSLLEEA YQQGKLVFLR GEPGVGKTRL
ALDFAASKGR VLLLAARPGD AGIPFSSYAR SLRESLAEHP EWVAHMLPWV RAEISRILPE
LAEGVLPPPL ASEAEKMRLF DALGTLTLST YADGERVLVS DDAQYLDQAS IEVMLYQLNK
FGRLGHPGGF PFTIVTYRRG EMDPEVEQNV FFPLLEAGAA TLIDVEPLEP DALKALLHSL
EDPLLDRLSP ALERYSGGNP MFVLELLKSL YETGQIQRGL PQQLGVPGKI GFVVQRRLER
LSPAGLRLVR TAAVAGVDFS SELAGAVLGV SPLDLVEPWA ELEAAQLIFG SGFVHDLVHE
ATLAGIPVPI KALLHRRIAE YLEAHRAEPA RIAQHWLEAD ESNAIPYLLE AARAAQVTYR
LSEASRFYEQ AACILERLGR SEEAFASWQQ ACATLLRFDT GPRHEAALER LFALAESPQQ
RAQAHLAEAS LAGERHDRER AEQSAREGYR LALEAQDPSL QAKLLTALGQ ALWLQGRTEE
VVPLLEQAAE IYRQTGDERG LAGVESNLGT VLDQLFRHQE AIVHHGRAAE RFAVFDDRLG
TVKALHNLAV SHAYFGLAHL ALEYQRRAEA LLSEVEDAQI LALRSAGNLA ARYADLGEYT
RAWAELERAL GLVPEGWEWA RAHYRTEKVK LLLLLGALEE AQAELGAVLE LRAQGLGLPD
DIFKVALTLQ AQIKAWRGES PHPELSQAQP LSDAHRPYNK IRFMLVKATL LNPQEALPLA
QEVLEFALEH QLPAVELAAR TRVAQVLLRV KKQRRALGHT EAAMSLLRTY DPTDFYRGEI
LLTHCLALEA ARQPGRLGHL EATLKWLLET AEGKVPPLYQ KSFLWQNPTN RTILEMASQQ
GIA