Gene Mesil_2051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMesil_2051 
Symbol 
ID9251564 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMeiothermus silvanus DSM 9946 
KingdomBacteria 
Replicon accessionNC_014212 
Strand
Start bp2051793 
End bp2055035 
Gene Length3243 bp 
Protein Length1080 aa 
Translation table11 
GC content67% 
IMG OID 
ProductSMC domain protein 
Protein accessionYP_003685431 
Protein GI297566459 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.380302 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.930097 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCG ACCGTCTCAT TCTGCAAGGG TTCAAATCGT TTGGCGAACG CACGGTGCTT 
GAGTTTGGCT CCGGGGTGAC GGGGATCGTA GGTCCTAATG GTTCGGGCAA AAGCAACCTC
GTCGAAGCGT TGCGCTGGGT AGTGGGAGCG AAGCCCCGGG AGCTACGCGG GGAAGAGGCT
CAGGCTTTGT TGTTCCACGG ATCGGATGCT CGAGCGCCAA TGCCCTTTGC GGAGGTAGTG
CTCGAGCTTT CCCGCGGCTC CGAACGGCTC ACCGTAAGCC GCCGCCTAGA CCGTGACGGG
GAGGCCGAGG TGCGGCTGGG GCATAAGGTT TCCACCTTGC GGGCGGTGGA ACGAGCGCTG
GCCGGAGCTG GGCTGGGGCG GGGCGGCTAC GCAGTGATTG GGCAGGGCGA GATCGGTAGC
ATCCTGCAAG CCGGCCCGGA GGTGCTCCTG GGCTACCTCG AGGAAGCCGC TGGACTCAAG
GCGGTAGCCT TGGCCGCCAA CAACACGCGC GAACGGCTTG CAGCAGCGGC GCAGGAAATG
CAGGCCCTCG AGGCCGAGCA CGCCCGCATG CAGGGGGCCC TGCGGGAAAA ATCGGCTCAG
GCCGAGGCCG CCCGCCAAGC CCGAGCCCTA AGCACGCAAA TCCTCCGGCT GCGGCACGCG
CTGATCCGGG TCAGGGCTGA GGAGGCCCTC GCTGAAGCCC GTAAGGCCGA GGAACGGATC
ACCGCGCTCG AGGCCGAGCG GCAGGAACTC TCGGAACGCC TGGCGCAGAT ACAGATCCAG
AAAACCCAAG CCCAAACCGC CCTCGAGACC CTCCAGACAG CCCACGCTGA GGCCTTGCGT
CAAGCCGAGG CGCTGGTCGG GCGGCGTCGG CTGCTGCAAC AGGAGCGGCA ACACCACGCC
GATCTGGCCC GAAGGTTAGA GCGCGAACAC AGCCTGCTAG AAAGCGAACA CTCCCGCCTG
GCCGCCCTAC AGCCCCCGAA GCCGCCCCAG ATTCCGGAAG AGCAGGTAGA AAAGCGCTTG
GCGGAAGCCA GCCGGCTGCA ACACATCGAA ATCGAACTGC ATGAGGCCCA GTCTGCCCTG
CGTGCGGCTC AGGCCCGCTA CGAGGCCTAT CTCAAAGCCC AAGCCACCTA CGAGGCTCAG
CGGACTGCCT TCCTCCAGGC CCAGCGCGAG CGCGAACGGC TCGAGGCGGA GCAGGCGCAG
CTTTCCCAAC GCCTAGCCGA AGCGACCCAG CACCGCCAGC AGGCCGAGAT CGCCGAGAAA
GCCCTGCGCG CCGAATTGAA CCAGCGGGTG GAGCAGGAGA GCAAACTATC CGCCGAAGCC
CGCGCGCTAC GGGCAGAGGT AGAGCGGCTG GAGGCCTTTT TGCAATCTGG GGCCGACCTC
ACCGAGGGGC CCCGCCGGGT CAAAGAGGCC AGGCTCGAGG GCATCATCGG CGTGGTCGCG
GACCTGCTCG ACGTTCCCGA GGGGCTGGAA CTCGCGGTCG AGGTGGCCCT GGCCGCTCGG
CTCCAGTGGG TGCTCACCGA GGACGACCGC TCGGCCCAAG CAGCCATCAA GCTGCTCAAG
CAAAAGGGTG GACGGGCCAC CTTCTTGCCC CTGACCCTCC TGCGGCCCGC CGCCAGGCCC
CGCCGCGACT GGAGCCAGGA AAAGGGGGTT CGCGGCCTGG CGCGGGAGCT GGTGGAGGTG
CGCGGCTACC CGCAGGTAGG GGCCACCCTG TTTGGCGAAA CGCTGGTGCT AGAGTCGCTA
GAGGCGGCCC TTTCCCTGGC CAAGCGTTAC CCCGACGCGC CGCGCATGGT CACCCGGGAG
GGCGAGCTCC TTGAGCCCAG CGGGGCCCTC ACCGGCGGAA AGCTGCCCAA AGGCGGGCAG
ATGCTGGCCC TGCGGCGCCG GGTGCGGGAG GCAGCCGCCC AAGCCGAAGG GCTGGAGGGC
GAGATACTCC GGCTTGCACA ACAGGCCCAG CGCCTGCGCG AGGAGCTGGC TAAACTCGAC
CTCCCAGCGC TCCGCCAACA GGAGCAGACC CTGCAGGCCG AGCTGCGCTC GCTTGGGGCC
AACCTCGGGC GGCTACCTAA GGTCTCGGCC CCCCAAGCCC CCGAGCCGGT GGAACCCCCC
GAACCCAGCG GGCTCGAGGC CCTCTTCCGC GAGCGCGAGA GGCTCCGTCA GGATTTGCAG
GAGGCCCGCG AACTCCAGAT GGCCTGGCGG CGCTACCGCG AGGACCTAGC CCGCTACCAG
GAGGCCCAAA CCCGCCTTGC CGAGCTTATC CAGCGGAAAC ATGCCCTGCA AAACGAACGG
CAGGGAATCG AGGCCCGCCT AAGAGAAATC ACCCTCCAGG AAACCGACCT GGCCGCCCAG
GAAGCCGAGC TAGGGCTTGC CGCCCTCGAG GCCGACCTCC GCGAAGCCCG CCAGGCTACC
CGAGCCTTAG CCGACGAGGA ATCCCGGCTG CTCTCGCGTA CTAACGCGGT GCTGGCCGAA
CTTGAGCAGT CCCGCATTAC CCGGGCTCGC CGCGAAGCCA CCCTGGAAGC TCTCCAGACC
GAGCAATCCG AGCTTCCGCC GGTAGAGGGA GAACTCCCCC AGGGCAGCCA CCGTACCCTA
ACCCGCCAGT TGGCCGAAGC CGAGGCCGCT CTCCAGGCCC TAGGTGCAGT CAACCACCTA
GCCGAAGCCG AGCATCACTC CCTTGCCGAG CAGGCTGAAA CGCTCCAGGC TGCTTTGCGC
GAGGCCGAGG AAGTGATGAG CAAGCTCGAG GCCGAGCTCG AGGCGGTCGA GCGCGAGTAC
CAGGGGAAGC TCACGGTAGC CTACGGGCGC TTCCGACACA AGTTCGCTGA GTACGCCGAG
GCTTTGTTGG GTGCGGAGGC TCGGTTGGAG ATGCTTCCTT CGCCGCCATC CAGCGGGCTG
CCCCATCACC GCGGATTGCA CCTGGTGCTG CGCCCTGCGG GGAAGCGGAC CGTGGACCTC
AATCTGCTCT CTATGGGTGA GCGTACCATG GGGGCCTTGG CTTTCCTCTT TGCCTTATCG
GAAGCCTCGG AAGAGGGGAG AGGGTTACCG GTAGCTGTAC TTGACGAGGT AGACGCCCCC
CTCGACGAAG CCAACATCTT AAGGTTTGCC GGTTTTCTGC GGCGCTTTTC CCAGGAGACC
CAGTTCATCC TCATCACCCA CCAGAAACGC ACCATGGAGG CCTGCGACGC ACTGTACGGA
GTGACCAGCG AGCAGGGCTT GAGCCGGGTG TATAGCATCC AGCGGGATGA GGCCCTAGCT
TGA
 
Protein sequence
MKIDRLILQG FKSFGERTVL EFGSGVTGIV GPNGSGKSNL VEALRWVVGA KPRELRGEEA 
QALLFHGSDA RAPMPFAEVV LELSRGSERL TVSRRLDRDG EAEVRLGHKV STLRAVERAL
AGAGLGRGGY AVIGQGEIGS ILQAGPEVLL GYLEEAAGLK AVALAANNTR ERLAAAAQEM
QALEAEHARM QGALREKSAQ AEAARQARAL STQILRLRHA LIRVRAEEAL AEARKAEERI
TALEAERQEL SERLAQIQIQ KTQAQTALET LQTAHAEALR QAEALVGRRR LLQQERQHHA
DLARRLEREH SLLESEHSRL AALQPPKPPQ IPEEQVEKRL AEASRLQHIE IELHEAQSAL
RAAQARYEAY LKAQATYEAQ RTAFLQAQRE RERLEAEQAQ LSQRLAEATQ HRQQAEIAEK
ALRAELNQRV EQESKLSAEA RALRAEVERL EAFLQSGADL TEGPRRVKEA RLEGIIGVVA
DLLDVPEGLE LAVEVALAAR LQWVLTEDDR SAQAAIKLLK QKGGRATFLP LTLLRPAARP
RRDWSQEKGV RGLARELVEV RGYPQVGATL FGETLVLESL EAALSLAKRY PDAPRMVTRE
GELLEPSGAL TGGKLPKGGQ MLALRRRVRE AAAQAEGLEG EILRLAQQAQ RLREELAKLD
LPALRQQEQT LQAELRSLGA NLGRLPKVSA PQAPEPVEPP EPSGLEALFR ERERLRQDLQ
EARELQMAWR RYREDLARYQ EAQTRLAELI QRKHALQNER QGIEARLREI TLQETDLAAQ
EAELGLAALE ADLREARQAT RALADEESRL LSRTNAVLAE LEQSRITRAR REATLEALQT
EQSELPPVEG ELPQGSHRTL TRQLAEAEAA LQALGAVNHL AEAEHHSLAE QAETLQAALR
EAEEVMSKLE AELEAVEREY QGKLTVAYGR FRHKFAEYAE ALLGAEARLE MLPSPPSSGL
PHHRGLHLVL RPAGKRTVDL NLLSMGERTM GALAFLFALS EASEEGRGLP VAVLDEVDAP
LDEANILRFA GFLRRFSQET QFILITHQKR TMEACDALYG VTSEQGLSRV YSIQRDEALA