Gene Mesil_1948 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMesil_1948 
Symbol 
ID9251460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMeiothermus silvanus DSM 9946 
KingdomBacteria 
Replicon accessionNC_014212 
Strand
Start bp1935377 
End bp1937107 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content62% 
IMG OID 
ProductPHP domain protein 
Protein accessionYP_003685331 
Protein GI297566359 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.453444 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.722452 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATG CCGAGATATC CCGCTTGTTC CAGGAAATGG CTGATATGCT CGAGTTCCTG 
GGAGATAACC CTTTTCGCAT CAGGGCCTAC CGGCAAGCCG CACGGGTGCT GGCCGACCTG
GAGACCCCCA TCGAAGACCT GGCCCGGCAG GGGCCAAATA CCTTAGAGGG AATTCCCGGT
ATCGGTCCCG ATCTGGCCGC CAAGATCCAG GAGTATCTAC AAAGCGGTAA GATCGCAGCC
CATGAAGAAC TCGCCAGGAA AGTACCTGCC GGGGTACTCG AGGTGATGCG GATTCCCGGT
GTGGGGCCCA AGACCGCCCA GTTGCTATGG AGTAGGCTCG GAGTGGATTC GTTGGCCAAG
CTGAGGGCGG CCCTGGAATC GAAGAGGGTG CTCGAGCTGC CCCGCTTTGG CGAAAAAAAA
CGGCTGCGCC TGCTGGAAAA CCTGGCTCTG GCGCAGTCCG CCACCCAGCG GCGGCCCCTG
GGTAGCGTGT TGTGGCGGGT GCGGGAGCTG CTGGCGGCGA TTCGCGGCCT CCCCCAGGTG
GAGCAGGCCG AGTGCTGCGG CTCGGTGCGC CGTTACAAGG AAACCGTGGG CGACCTGGAT
TTCCTGGTCG CCACGCGGCA AGGAGAACAA GTGCTGGCTA GGTTCACCCA GCTGCCGGGT
ATCGCCGATG TCGAAGCCGT AGGAGAAAAC CGGGCCACGG TGTTTTTGGA GGATGGCTTG
CAGGTGGACC TGAAAATCGT GCCGCCCGAG TCTTGGGGGA GCGGCCTACA ATACCTCACC
GGCTCTAAAG CCCACAGCAT CCGGCTGCGC AAGCTAGCCC TCGAGCAAGG CTTAAAGCTC
AACGAGTACG GCGTCTGGAA GGGGGAAAAA CGGATCGCGG GTCGGGATGA GGAAAGCGTA
TACGCGGCCC TCGGCCTACC CTTCATCCCG CCCCCTTTGC GCGAGGACTG GGGGGAGATC
GAGGCCGCGC AGGCTGGCAG GCTGCCCAAG CTGGTCGAGC TGAAGGACAT CCGGGGGGAT
TTACAGGTTC ATTCCACCTG GTCCGACGGG AAAAACACCC TGCTCGAGCT GGCCCAGGCC
GCTAAAGCGC TGGGCTACGC ATACCTCGCC GTCACCGATC ACTCCCAAAG CCTGCGCATC
GCCCACGGGG TTCGTGTCCA ACAGATGAAA CAGCGCATCC GGGAGATCCG CGAAATCAAC
GAAAGGACGG GCGCAAAGCC CTATCTGCTG GCTGGGGCCG AGGTGGAAAT CCTCGAGGAC
GGCTCGCTGG ACTACCCCGA CGAGGTGCTC AAAGAGCTCG AGATCGTTCT CGTGGCCATC
CACTCTCACT TCGGCCAGGA CGAAAAGACC GAGACCCGGC GCATACTCAA GGCCCTGGAG
AACCCTTACG TGCACATCCT CTCCCATCCC ACCTGTCGCC TTATTGGCCA GCGCAAGGGG
ATCGAAGCGG ACTGGCAGAA GGTCTTCTCC CGGGCCAAAT CGCTGGGCAA AGCCGTAGAG
ATCGATGGTC ACTACGACCG CCTGGATCTG CCCGACGTGC GAGCCCGACA AGCAGGAGAA
ATGGGGGTGA TGATCTCACT GGGTAGCGAT GCCCACCAGA TTGACCATCT GCGCTTCATG
GACCTAGCCG TGGGCACGGC TCAACGGGCC TGGTTGGGGC CGCTCCAGAT CCTCAACACG
AAATCACTTG AGGAGTTGCT GGGGTGGCTC GAGGGGGTTC GGGAAAGCTA G
 
Protein sequence
MKNAEISRLF QEMADMLEFL GDNPFRIRAY RQAARVLADL ETPIEDLARQ GPNTLEGIPG 
IGPDLAAKIQ EYLQSGKIAA HEELARKVPA GVLEVMRIPG VGPKTAQLLW SRLGVDSLAK
LRAALESKRV LELPRFGEKK RLRLLENLAL AQSATQRRPL GSVLWRVREL LAAIRGLPQV
EQAECCGSVR RYKETVGDLD FLVATRQGEQ VLARFTQLPG IADVEAVGEN RATVFLEDGL
QVDLKIVPPE SWGSGLQYLT GSKAHSIRLR KLALEQGLKL NEYGVWKGEK RIAGRDEESV
YAALGLPFIP PPLREDWGEI EAAQAGRLPK LVELKDIRGD LQVHSTWSDG KNTLLELAQA
AKALGYAYLA VTDHSQSLRI AHGVRVQQMK QRIREIREIN ERTGAKPYLL AGAEVEILED
GSLDYPDEVL KELEIVLVAI HSHFGQDEKT ETRRILKALE NPYVHILSHP TCRLIGQRKG
IEADWQKVFS RAKSLGKAVE IDGHYDRLDL PDVRARQAGE MGVMISLGSD AHQIDHLRFM
DLAVGTAQRA WLGPLQILNT KSLEELLGWL EGVRES