Gene Mesil_1104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMesil_1104 
Symbol 
ID9250597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMeiothermus silvanus DSM 9946 
KingdomBacteria 
Replicon accessionNC_014212 
Strand
Start bp1093722 
End bp1095527 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content63% 
IMG OID 
ProductEndonuclease/exonuclease/phosphatase 
Protein accessionYP_003684517 
Protein GI297565545 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.604194 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAATA TTTGTGGAAA AGTTTCAAAA TGGTGGATCG CTTTGTTTTT GCTCATGGCG 
CCTGGCCAGC AAAGCCCCTC TGCTCAAGGG TGGGGCATGG CCGTAGCGGC CTCGCAGGGA
CAGGGCCAAG CGGCTTCCTG TCCGGCAGAT TCCGGCGTAA TACGTACCTT TCAGATAGAA
GGTAGGGGAG CCTCGAGCCC CTTGGCGGGG CAAGTGGTGA CCACCGAAGG GGTAGTGGTG
GGCGATTATC AAGAGCCAAA CCAGCTCCAG GGCTTCTTCC TCCAGGATCT TACCGGGGAC
GGTGACCCCG AGACCTCGGA CGGCCTGTTC GTCTATACCC CTTCAGGCTT CGCGGTGAAC
GCGGGGGATT ACGTGCGGGT CACCGGCAAG GTACGCGAGT ATGCCTCCCC AGGGGATACC
CGAGGCACCC TAACCGAACT CGAGGAGGTA CGTAGCGTCC TGGTGTGCGC TACCGGGGTC
GCCGTGGGCC CCACCCCGGT AACCCTACCC CTTGCGAGTA CGGGCGACCT CGAGCGCTAT
GAGGGGATGC TGGTGAGCTT TTCCCAAACC CTCACCGTCA GCGAGGTGTA TAACCTGGGC
CGCTTTGGGG AGATCAGCCT TTCGGCAGGC GGGAGGCTGT TCCATCCCAA CAACGGCAAC
GCCCTGGGCG AGGCGGCAAT GGGCAATCCC TTGCGGCGCA TCCTGCTGGA TGATGGCAGC
AACGTGCAAA ACCCTCGCCC AATCCCCTAC CTTTCGGCAG CGGATACCTC GGGCACCCGG
CGGGTCGGGG ACAGCGTGGT AGGGCTCACC GGGGTCCTCT CTTACGGCTT CTCCAGCTAC
CGAGTAGAGC CGGTGGGCAC AGTCAACTTT ATCCCTAGCA ACCCCCGGCC CGAGGCCCCC
GAAGACGTCG GTGGCTCCCT CAGGGTGGCC AGCTACAACG TGCTCAACTA CTTCACCACC
CTAGGCGCAC GGGGTGCTAG CAACCAGGCC GAACTCGAGC GGCAAAGGGC CAAGCTGGTA
GCCGCCCTGC GCGCTCTGGA CGCCGATATA GTGGGACTGA TCGAGATACA AAATAATGGC
GACGCTGCGC TCGAGGACCT CGTTCGGGCA CTCAACACAG CGCTGGGCAG CGATGCCTAC
GCCGCCCTAG CGACCGGCAG CCTCGGCACC GATCAGATCA AGGTAGCCCT GATCTACAAA
CCTGCTCGGG TCAGGCCGGA AGGAGCATTC CGCATCGACG ATGACCCTAT CTTCTCCCGC
CCTCCGCTCG CCCAAACCTT CCGCGACCGG GCCACCGGTG GACGCTTTAG CGTGGTGGTC
AACCACTTCA AGTCCAAGGG CTGCGAGGGG GCCAGCGGGG CCGAAACAGA CACCGGTCAA
GGGTGTTGGA ACGCCCTGCG CGTACGACAA GCCCAGCGAC TTTTGACCTT CATCAACGAC
CTCCGGGCCA CCGACCCCGA CGTGCTGGTG GTAGGTGACC TCAACGCTTA CGCCGAGGAG
GACCCGCTGA AGGTGCTGAC CGGGGCCGGA TTGGAAAACC TGATCCTGCA CATCTCCGCG
GCCAAACGTT ACAGCTACGT GTTCAACGGG GAGTCCGGCA ATCTCGACCA CGCCCTGGCG
ACCTCCAGCC TCTCTTCGCA AGTTACCGGG ATCACCGAGT GGCACATCAA CGCCGACGAG
CCCAGGGTGC TCGATTACAA CACCGAGTTC AAACCCGATG ACCGCTACGC CCCCACCCCC
TTCCGCTCCT CCGATCACGA CCCGCTGCTG GTCGGGCTGA ACCTGCGCGC CGACCCCGAG
CCCTGA
 
Protein sequence
MQNICGKVSK WWIALFLLMA PGQQSPSAQG WGMAVAASQG QGQAASCPAD SGVIRTFQIE 
GRGASSPLAG QVVTTEGVVV GDYQEPNQLQ GFFLQDLTGD GDPETSDGLF VYTPSGFAVN
AGDYVRVTGK VREYASPGDT RGTLTELEEV RSVLVCATGV AVGPTPVTLP LASTGDLERY
EGMLVSFSQT LTVSEVYNLG RFGEISLSAG GRLFHPNNGN ALGEAAMGNP LRRILLDDGS
NVQNPRPIPY LSAADTSGTR RVGDSVVGLT GVLSYGFSSY RVEPVGTVNF IPSNPRPEAP
EDVGGSLRVA SYNVLNYFTT LGARGASNQA ELERQRAKLV AALRALDADI VGLIEIQNNG
DAALEDLVRA LNTALGSDAY AALATGSLGT DQIKVALIYK PARVRPEGAF RIDDDPIFSR
PPLAQTFRDR ATGGRFSVVV NHFKSKGCEG ASGAETDTGQ GCWNALRVRQ AQRLLTFIND
LRATDPDVLV VGDLNAYAEE DPLKVLTGAG LENLILHISA AKRYSYVFNG ESGNLDHALA
TSSLSSQVTG ITEWHINADE PRVLDYNTEF KPDDRYAPTP FRSSDHDPLL VGLNLRADPE
P