Gene Mesil_3173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMesil_3173 
Symbol 
ID9252703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMeiothermus silvanus DSM 9946 
KingdomBacteria 
Replicon accessionNC_014212 
Strand
Start bp3222567 
End bp3224183 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content69% 
IMG OID 
Producttransposase IS4 family protein 
Protein accessionYP_003686505 
Protein GI297567533 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.418801 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCATCC GCCAGAAGGC CTTCAAGAAC AAAGACGGCT CCACCCGCAC CTACCTCCAG 
CTCGTCGAGA GCGTGCGCCA GGGCGGCCGC GTCCGCCAGC GGGTGGTCGC TACCCTGGGC
CGGCTGGAGG ATCTCCAGGA CGGCCGGCTC GATGCCCTCA TCGAGAATCT GGCCCGCTTC
TCCCAGAGCA CCTGGCGTCG GCTGGAGGAA CAAGCCGAGC GCCTGAACGT CCGGTGGTCC
AAGCAGTGGG GACCGGCGCT GATCTTCGAA CGGCTGTGGC GCGAGGCCGA ACTGGACAAG
GCCTTCGAGG CCCTGCTGGA GGATCGCCAG CTGGCCTTCG ACGTGGCCGA GGCCGTCTTC
ACCATGGTAC TCAACCGCCT CACCGACCCC TGCTCCAAGC GGGGCCTCGT GCGCCAGTGG
CTGCAGGGCG TCTACCGGCC CCAGGCCGAG CAGCTGGAAC TGCACCACTA CTACCGCGCC
CTGGACGTCC TGGCCGAGCA CAAGGAGGCG ATCGAGGATC GCCTTTTCGC CCGGGCTCGC
GACCTGTTCT GGACCGAGGT GGACGTCGTC TTCTGGGACA CCACATCTAG CTACTTCGAA
GGCCGGGGGC CCGAGGGCTT GGCGGCCTAC GGGTATTCCC GGGACAAGCG CCCGGATCGG
CCCCAGCTGG TGGTGGGCGT GCTCATGACC CGGGACGGCT ACCCCATCGC CCACGAGGTC
TTCCCAGGCG ACACCGCCGA CAAGGCGACC GTGGAGACCG TGCTGGATGC GCTCAAGCGG
CGCTTCCACC TGCGCCGGGT GATCTTCGTC GCCGACCGGG GCATGGTCAG CCGCCAGATC
CTGCGGGCCA TTGAAGAGGC CGGGATGGAG TACATCGTCG GCATGCCCCT GCGCCGGCAC
CGGGCGGCCG AGGCGGTCTT GAGCCAGCCG GGGCGGTATC GCAAGGTGAA CGACCAGCTC
CAGATCAAGC AGGTGACCCA CCAAGGCCAG CGCTATGTGC TCTGCTACAA CCCCCTCCAG
GCCGAGCACG ACCGCCAGGC TCGGGAGGCG GCCCTCGCGC ACCTGAAGCA GCGGATCGAG
CGCGGCCAAG CCAAGGAGCT CCTGCGAAAC CGTCTCCTGG CCCGTTACCT CAAGGCCCTG
CCCCAGGGCG CGCTGGTGGT CGACACCGAT GCCGTGAAGC GGGCGGCCCG CTACGACGGC
AAGTACCTGC TGCGGACCAA CACCGACCTC GACCCGGAGG CCGTGGTGCG GGCGTACAAG
GATCTCTGGC GGGTCGAGCG CGCCTTCCGC ACCCTCAAGT CCGCCCTGGA CCTGCGGCCC
ATGTTCCACT GGACGGAGCG GCGGGTGCGG GGGCACGTCA TGGTCTGCTT CCTGGCGCTG
GTCCTGGAGA GCCTCTTGTT GCGCAAGCTC CGCCAGCAAA ACCCCGATGT GAGCTACGAG
GACGTGCTCC ACGACCTCTC GCAGCTGCAC GCCGTGGCCG TGGAGCTGGA CGGCGAGGCC
TGCCTCACCC GCACCGAGCT GGTCGGGCAG GCCTACGAGG CCTTCAAGGC CGTGGGCCTG
CGGCCGCCGG CGCGGGTCCA GCCGCTGCCA CGTCCCGAGA CGACCCCCGC CGGGTAG
 
Protein sequence
MFIRQKAFKN KDGSTRTYLQ LVESVRQGGR VRQRVVATLG RLEDLQDGRL DALIENLARF 
SQSTWRRLEE QAERLNVRWS KQWGPALIFE RLWREAELDK AFEALLEDRQ LAFDVAEAVF
TMVLNRLTDP CSKRGLVRQW LQGVYRPQAE QLELHHYYRA LDVLAEHKEA IEDRLFARAR
DLFWTEVDVV FWDTTSSYFE GRGPEGLAAY GYSRDKRPDR PQLVVGVLMT RDGYPIAHEV
FPGDTADKAT VETVLDALKR RFHLRRVIFV ADRGMVSRQI LRAIEEAGME YIVGMPLRRH
RAAEAVLSQP GRYRKVNDQL QIKQVTHQGQ RYVLCYNPLQ AEHDRQAREA ALAHLKQRIE
RGQAKELLRN RLLARYLKAL PQGALVVDTD AVKRAARYDG KYLLRTNTDL DPEAVVRAYK
DLWRVERAFR TLKSALDLRP MFHWTERRVR GHVMVCFLAL VLESLLLRKL RQQNPDVSYE
DVLHDLSQLH AVAVELDGEA CLTRTELVGQ AYEAFKAVGL RPPARVQPLP RPETTPAG