Gene Mesil_3041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMesil_3041 
Symbol 
ID9252564 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMeiothermus silvanus DSM 9946 
KingdomBacteria 
Replicon accessionNC_014212 
Strand
Start bp3086173 
End bp3087321 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content67% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003686387 
Protein GI297567415 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCGGG CTTGGGCCAA GCCAAGGGCC TGGAAAAAGC GCCTTCAGGG AGCGCTGCTG 
GGCTTGCTCT GTCTCTTGCC CCTGGCCCTC TTCGTGCATC CGGCCTGGGG GTTAGTGTCG
CTGCTGGCGC TGCTCTACCC CACCCGCCGC GAAGAAGAAG CCGCCTTGGC CGAGCTAGAC
CGGCGGTACG GCCTGGCCTA CCGCAGCGCC CTGGAGGCTC CCGCGGGCCA CCCCTGGCGG
GGTCAGCTCG AGGCCGAAGC CAGTGCCAGC CTGAGCCGGG CCCGTCCTCC GGCCTTCCCC
TGGCCGCTCG CGGTAGCTTA CCTAGCCTTG ATCGGGCTGA TCTGGGTACT CCCCCCCCTC
CAGAACCCGC TCAACCCTCC CGCGTCGGTG AGCCAGACCT CGAGCCCCTC GCCGCAGGCT
TCGCCAGGCT CGCCCGAGCA AGCCCCCAAC CGCCCCCCGG AGCGAAATCC GCTGCCTAAC
CCACCCCAGG GCCAACGCAC CGAGCAGACC GAGCCGGAAT CTCAAGCGCC TGTCTCGAGC
CCCCCAAACC CGCAGGCCCA ATCTCCAACA GAGCAAAAGC CAAGTGCACC GAATCCGCAG
AACCAGCCCA GCGCGGGCGA GCCGAAAACT GTGAGCGAGC CAGGCCAGCC CGACCAACCT
GGACAAGCGC AGCCCACCCC GAATCAGCCT GGGCAGCAAA AGGGGGATTC TCAAAACGCC
GAACGCCCCA CCCAGCCACA GAAAGGCTCC CAGGGTCAGC AAGGTCCTGG GCCCGGGCAA
AAGCAAATCC CACAGAACGG CCAACAGAGC CAGTCCGCTG ACCAGAGTCC GCGAACTCAG
GGTCAGTCTG GCCCACAACC GGGGCCTAGC TCGAGCCCCA AAGATGAGCG GGGCTCGGGG
CCAGCTTCGC AGCAACCTCT ACCGCAAGCC CCAAACCCCC AGCCCGGCAT CCGCCCCAAT
GGCGAAGCCC CGATCCAGCG GGGATCAAGC CAGGGCCGCC CGCAACCACT CCCCTCCCCC
TGGCCGTCTG GGCAACCTCC GCAAAACGTG CAGCGGCAAG CCGAGAACTA CCTCCAAAGC
GAACCTCTCC CGCCCGAGGT GCGGGAGGTA CTAAGGCAGT ATTTCGAGCT AAGCGCCGAT
AGCCCATAA
 
Protein sequence
MHRAWAKPRA WKKRLQGALL GLLCLLPLAL FVHPAWGLVS LLALLYPTRR EEEAALAELD 
RRYGLAYRSA LEAPAGHPWR GQLEAEASAS LSRARPPAFP WPLAVAYLAL IGLIWVLPPL
QNPLNPPASV SQTSSPSPQA SPGSPEQAPN RPPERNPLPN PPQGQRTEQT EPESQAPVSS
PPNPQAQSPT EQKPSAPNPQ NQPSAGEPKT VSEPGQPDQP GQAQPTPNQP GQQKGDSQNA
ERPTQPQKGS QGQQGPGPGQ KQIPQNGQQS QSADQSPRTQ GQSGPQPGPS SSPKDERGSG
PASQQPLPQA PNPQPGIRPN GEAPIQRGSS QGRPQPLPSP WPSGQPPQNV QRQAENYLQS
EPLPPEVREV LRQYFELSAD SP