Gene Mesil_0206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMesil_0206 
Symbol 
ID9249684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMeiothermus silvanus DSM 9946 
KingdomBacteria 
Replicon accessionNC_014212 
Strand
Start bp209448 
End bp211841 
Gene Length2394 bp 
Protein Length797 aa 
Translation table11 
GC content63% 
IMG OID 
Productpeptidase C1A papain 
Protein accessionYP_003683658 
Protein GI297564686 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCAT TCCAGCGGCT GGGAATGGTA GGTATCCTCA TTAGCGCGTT GGCGGCCTGT 
TCGCCCGGCG GGAATGGGGG CAAGCCAGAC CCCAACCTAT TCAACGAGGG CAACTCTTGG
AAAGGCGAGA TCCCCAGCGA TGCTCAGCGG GTCAGCCCCG AGGAGTTCCA GAAAGGCATT
GCCTCGGGCG AGCTGGTGCT TTCCAGTGCG GCCTCGGTCG CCGCAGCTAA ACAGGCTCGA
GAAGCCCAAT ACCAAAGCGA TAAGAACTTC CTGAACAGTA TCCCGGAGAA AGACCCCAAC
ACCGCGGCCC TGCTCACCGA GGCCGCGGGC AGCCCCAGCT TCGAGGGTGA CCGCCCGGTA
AGCGGGCCGG GGGGAGAGTC GGTGGTGCTG TTTGGCCTGG GGACCCAGCT GCGCAACGCT
GCCGAAACCT ACCAGCGCTC GCAGAGCGTG GAAAACGCCC TGGACGATTA TGCCTTGAGC
TACTCGCTCT TGCCCGAGGA CCTCAAACCC CAAGCCCCCA GCCCTGCTAG CCTCAAGGGA
AAGTCGCTGG CCGAGGTCAA GGCCGCCCTC GAGCAGCTCA ACAGCCTGCT GGGAAGCAAG
TCTGCGAGTT TGCGTACGGC CCGGCTCGAG CCCGGCGGCG GTATCCGGCC CCAAGCCATC
AACCCTGGCA ACGGCACCGA CAACAACGGC CCCTGCACCC CTACCAACTT GGTCAAGCGC
TTCTGGTTCC CCCTCAAGAA CTTCATCAGC CCGGTCAAAA ACCAGGCCAA ACGCGGCACC
TGCTGGGCCT TCGCCGCGAT CGGGGCGGTG GAAAGCCGCG AGCGGGTGCA AAACAACAAC
CCAGCCGACC TCTCCGAGCA GTTCCTGGTC AACAAAGTCA AGCAGGACTG GGACTCCAGC
GACTACTCCG ACGGCTACTG GTCGGAGAAG GCCCTCGAGA CCGCGGTCAA TAAAGGCCAG
AGCTTCCCCA GCGAGGGGGG CTGGACCTAC AACGGGGCCA CCTCACGCCC CAGCGTCAAA
GACGGCGACA GCGACTCGTA TGCCAACAGC TGCAACGGCT ACACCGGCAC CTGCTCGGAT
ACGGCCCACG AAAGCCGCCG GGTCTGCACC ACCTTTATCT TCACCTTTTG TAGCTACGCC
AAGGTAAGCT TTGGCGGGCC CGGAGTGGCC TCCTCGAAGA CCATCCAGGT CTGGAAAAAC
GGCGAGGCCT TCAAGCTCAA CCTCTTGCGC CAGAAACTCT CGCAAGGCTA TGTGCTCTTG
GCTTCGTTCC CGGTCTACAA AGGCTTCATG GACGATGTGA AAAACGACGG TGTGGTGAGC
AACTACGCCA AGACCAAGCT CGACGACAAG GGCAAAGAGG TGGCCGGCTC CTACGGCGGC
CACGCCGTGC AGATCGTGGG ATTCCTCTCC AATGAGGACA TGAGCCAGTT CGGCCAGACC
CCCAACATCG GCGGGGGTGG GTATTTCATC GTCAAGAACT CCTGGGGCTG CGGCGCGGGC
GACGGCGGCT TTTACTACGT CCCCGCCGAC TACGTAAGCA GCATCTTCGA TTCCCTGAGC
GTGCTGAACT TCGATGGACG CCGCAGCGAG GCCTGGAAGC AGGAGCAGGC CGCGCCGGGC
GGCTCGGAAG CGCCCAAGAT CACCATCAAA ACCAACCCCG CTACCGTACA GCTCCGGGTC
GAGACCAACC TGGCCCAGTT CTTCCAGATC ACCCACCCCG TGGCCAAGAG CGTCAACCTG
AGCGTGACCT CAAATCTGGA CGGCACCCTC TATAGCGGGG GCTGGAGCAC CGACCAAAGC
AGCCTGTTCG GCCCCGAACT CAAGCGCACC TTCACCTCGG TGGGCAGCAG AACGCTGACG
CTCCTCGCCA AGTACGGCAA CAGCCAAGCC AGCAGCAGCT TCGTGGTCAA GGTGATCAAC
ACCCCGCCCA CCCTGAACCT GCAATACGGG GGAGACCCCC ACCAGGGCGA GGCCTACCCC
ATCACCGCGC TCATCACCGA CCCTAACGAG CCCGACATCA ACAAGCTATG CGGTACCACC
ACCTGGTCAG TGGATGCGCC CGATACGCTT TCGGCAGGTA CGGGCTGCTC GGTCTCGGTC
ACCTTTGGGG CTACCGGATC CCGCCAAGTG CGCGTGACCA CCCAAGATAG CGACGGGGCC
ACCGCTACCC AGACCCTCAA CCTGAACGTG CTGCCCCCAC CCCCCCAACC CCTACCCCAA
GATCACCGAT TACGGGGTGT ACTCGAGGGA GTTTACCGGT GGGCAGTTCC GCTTCTGCGG
GAGCGTGAGG GTGGCGGGTG GCAGTACCAT CGTCTTGAGC GATACCGGCT GTACCCTTCG
CATTGGGCCC GCCCCTACCC GGTACTACGG CGGAATTACC GTGGAAAACC CTAG
 
Protein sequence
MKAFQRLGMV GILISALAAC SPGGNGGKPD PNLFNEGNSW KGEIPSDAQR VSPEEFQKGI 
ASGELVLSSA ASVAAAKQAR EAQYQSDKNF LNSIPEKDPN TAALLTEAAG SPSFEGDRPV
SGPGGESVVL FGLGTQLRNA AETYQRSQSV ENALDDYALS YSLLPEDLKP QAPSPASLKG
KSLAEVKAAL EQLNSLLGSK SASLRTARLE PGGGIRPQAI NPGNGTDNNG PCTPTNLVKR
FWFPLKNFIS PVKNQAKRGT CWAFAAIGAV ESRERVQNNN PADLSEQFLV NKVKQDWDSS
DYSDGYWSEK ALETAVNKGQ SFPSEGGWTY NGATSRPSVK DGDSDSYANS CNGYTGTCSD
TAHESRRVCT TFIFTFCSYA KVSFGGPGVA SSKTIQVWKN GEAFKLNLLR QKLSQGYVLL
ASFPVYKGFM DDVKNDGVVS NYAKTKLDDK GKEVAGSYGG HAVQIVGFLS NEDMSQFGQT
PNIGGGGYFI VKNSWGCGAG DGGFYYVPAD YVSSIFDSLS VLNFDGRRSE AWKQEQAAPG
GSEAPKITIK TNPATVQLRV ETNLAQFFQI THPVAKSVNL SVTSNLDGTL YSGGWSTDQS
SLFGPELKRT FTSVGSRTLT LLAKYGNSQA SSSFVVKVIN TPPTLNLQYG GDPHQGEAYP
ITALITDPNE PDINKLCGTT TWSVDAPDTL SAGTGCSVSV TFGATGSRQV RVTTQDSDGA
TATQTLNLNV LPPPPQPLPQ DHRLRGVLEG VYRWAVPLLR EREGGGWQYH RLERYRLYPS
HWARPYPVLR RNYRGKP