Gene Mesil_1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMesil_1049 
Symbol 
ID9250542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMeiothermus silvanus DSM 9946 
KingdomBacteria 
Replicon accessionNC_014212 
Strand
Start bp1039139 
End bp1040692 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content64% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003684462 
Protein GI297565490 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.154586 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCTT ATTGGAAAAG CCTTCTGGGT CTGCTGATAG GGTTAGGGCT GGCGGCGTGT 
AGCCGATCGG TATCCGCCCC GCCGCACTGG GTTTTGGTCA GCCAAGCCGG CTCGATCCCG
CTGCTCCCAG TGACCTTCGC AGAGTTCAGC ATCCTCGATA GCAGCGGGAA AAAGGTCAAC
CTCAAGCCCA GTGCTTCGGC CATCGTAGAA CTCCCGATAC CGCCGGAGCT ACGCCAGGCC
TACCCGCTCG GCTCCAAGAT CCACTGCTAC GCCTACAACC CCCAGACCGG CCAGTGGGAG
GACTTCGTGG AGGGGACGGT GGAGACCTCG AGCGTGGACG GCTCGACCCC GGTGCTCCGG
GCCAGCATCC GCCACTTCTC CTGGTACGGC GGAGCCCCCG AGGGGACCAA GTGCCGCAAC
GGGGCGGTGC AGGTGATAGA TGCCAACGGG AAGCCCTTGC AGGGGGCTAC GGTGGTGGTG
CAGCCCGGGG TGAACGGCAC CACCAACGCC GAGGGGGTGG CGACCGTCTG GATTCCCGAG
GGCACCTCAA ACCCCAAGAT GTACGCCTAC AGGGTTTCAG ATAACAGCGA CGGTCGTATC
CCCGACCTAC CCAAGACAGC CAAGGTCATT GATATCGGCT ACTTCGTGCT TGACGACCTC
GGGGCTATCC TCCCCGGTCT CAATCAGGCC GACTGTGCCA GCGTGACGCC CTCGAGCCTG
GGCCAGCTCA ACCTACGCCC CCAAGCGGCT TACGGCACCC CCGCTAACCC CTTCGTCATC
AAGCTGGCCC CGATTGGCCA GGCGGTGTAC AAGGTAAACG CTCTGTTGCT AGCGAGTGGG
ACCGCAAACC GTGCGCTCGG CCAACTGAAA GCTGGTGTGC GTCCCTCCGC CGCTGGCAGC
TTCGTGAGCG TGACCCTCGA GCGGGGCATC CCCAACCCCG ACGGCGGATT CGATGTCACC
GAGCCAGTGG ACGGGGCCAA GATCACCCTC TCCGACGGCA AAGGAGGGTC GGCTCAGCTC
ACTGGAGTCG GCAGCGGTAG CTACTATCTG CAAAGCGGCC TCGACATCAC CCCCGGTACC
CGCTATACCC TGATCATTGA CGCTGACGGC AACGGCACCA TAGACGGTAG CGGTTCGATC
TACGCGGTGG GGAACGTGGC CTGGGACCAG GGCCTCGGTG GCTCGGTCCA GCCCGCCCAG
GGCTTCGTAG CGCGCTGGAC CGATTCTGCG GGCGGCACTC CCGGTTACGC CGCGGTCTAC
TACGCCGTCC TGAGCAGCAA GAGCAGCGAC CCCAACAGCT TCGACTTCGA CTACTATATC
GGCCCGGACC TGAGCTTCAC CCCGCACTCG AACGCCCAGG GGACAACCGG CGGCACCGCT
CCGCTTAAGC CGGGTGACTA CAGCGGCTCG CTGTGGGCTT TTAGCGGGGC GTATAGCCCG
GCGGGGAACA ATAACTTCAC CGTGAGCAAC AACATCACTG GGGTGGGCAT CAGCGGCGAG
TTCTCCAGCT TCAGCGCAGC CCAACCGGTG GGCTTCACCC TCACCGGCCC TTAG
 
Protein sequence
MKPYWKSLLG LLIGLGLAAC SRSVSAPPHW VLVSQAGSIP LLPVTFAEFS ILDSSGKKVN 
LKPSASAIVE LPIPPELRQA YPLGSKIHCY AYNPQTGQWE DFVEGTVETS SVDGSTPVLR
ASIRHFSWYG GAPEGTKCRN GAVQVIDANG KPLQGATVVV QPGVNGTTNA EGVATVWIPE
GTSNPKMYAY RVSDNSDGRI PDLPKTAKVI DIGYFVLDDL GAILPGLNQA DCASVTPSSL
GQLNLRPQAA YGTPANPFVI KLAPIGQAVY KVNALLLASG TANRALGQLK AGVRPSAAGS
FVSVTLERGI PNPDGGFDVT EPVDGAKITL SDGKGGSAQL TGVGSGSYYL QSGLDITPGT
RYTLIIDADG NGTIDGSGSI YAVGNVAWDQ GLGGSVQPAQ GFVARWTDSA GGTPGYAAVY
YAVLSSKSSD PNSFDFDYYI GPDLSFTPHS NAQGTTGGTA PLKPGDYSGS LWAFSGAYSP
AGNNNFTVSN NITGVGISGE FSSFSAAQPV GFTLTGP