Gene Mesil_1420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMesil_1420 
Symbol 
ID9250920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMeiothermus silvanus DSM 9946 
KingdomBacteria 
Replicon accessionNC_014212 
Strand
Start bp1412743 
End bp1414065 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content61% 
IMG OID 
Productcarboxyl-terminal protease 
Protein accessionYP_003684821 
Protein GI297565849 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA GAGCTTGGAT CATTATCGGC GGGGCCGTAG TGGCGGCCCT GGTATACGCC 
CAACTCAACC GCGGAGGGGC CGAAAGCTTC TCCCGCAACC CCAACGGTCA GGCCCTGATC
GAAACCTACT CGCTCCTCCA AGACCAGTAC CTGAAACCCC TGGACCAAAC CCAGCTCAAC
AAGGTGCTCG AGGGGGGGAT CCGGGGGATG CTGAATGCCC TGGGCGACGA GTTCACCAGC
TACTCGCCCC CCGCCAGGGC TGCGCAGCGG CAAGAAGACC TACGCGGGGA ATTCTTCGGC
ATTGGGGCCA CTTTGGCTCC AGCTCAGCAA GGGGGCACCG GGGCCCAAAT CCAAGGCCTC
ATCCGGGGCT TGCCCGCCTT CAACGCTGGC CTGCGGGTAG GCGACCAGAT CGTGGAGGTC
AACGGCGAGG ACGTGACCAA GCTGGACCTC GAGGAAATAG TCTCCAAGAT CCGCGGTCCG
CGAGGGACCA AGGTCACCAT AGGGGTCAAG CGCGAGGGCA ACAACGCGGT TTTGCGTTTT
GAACTCATTC GCGAGCTGGT GAAGATCATC GAGGTGAACA AGGCGTTGCT TCCGGACAAC
ATCGGCTACA TCGAGCTGCG CTCGTTCGCC AATATCAATG TATCTTCCCA GCTCAACGCG
GCCATCAGCG ACCTGCGCAA ACAGGGAATG CAAAAGCTTA TCTTCGACCT GCGTGACAAC
GGCGGGGGTC TTTTGGATCA GGGCTGCTCC GTGGCCAAGG CCTTCATAAA GGAAGGACCT
ATCGTCTACA CCAAGACCCG CAGCGAGACC CGCTTGTACT GCGAAGCCAA CGGGCAGGTG
CAGTGGAGTG GACCGATGGT GGTACTGGTC AACGGGAACT CGGCCTCGGC CTCGGAGATC
GTGGCGGGGG CTCTGCAGGA CACCGGTCGG GCCAAGATCG TGGGGGAGAA GACCTTCGGT
AAAGGGGTAG GTCAGAACGT GATTGACCTG GCCAACGGCG GCGACCTGAC CCTGGTGACC
TTCCAGTGGC TCACTCCCAA GAAGCGGGCC ATCACCCGCG ATCAGGGCAT CCAGCCCGAT
GTGGTGGTGC GGGATAACCG CTTCCCGGTA CCGGTCTCGT TCGAGGGCAC CGGCGCCAAG
CCAGGGGCTA CGGTAACGCT CACCATAGAC GGGAAAACCT ATACCGCCAA GGCTGATGAA
ACCGGCAAGT ATGCCTTCAG CCAGCCACTA CCGGCCCGCC CAGCCAACGA TAACTCCGGC
AACGCAATGG TGGATCCGCA AAACGACGCC ATCCTGGCTC GAGCCCTGCA AGAGCTTAAG
TAA
 
Protein sequence
MKKRAWIIIG GAVVAALVYA QLNRGGAESF SRNPNGQALI ETYSLLQDQY LKPLDQTQLN 
KVLEGGIRGM LNALGDEFTS YSPPARAAQR QEDLRGEFFG IGATLAPAQQ GGTGAQIQGL
IRGLPAFNAG LRVGDQIVEV NGEDVTKLDL EEIVSKIRGP RGTKVTIGVK REGNNAVLRF
ELIRELVKII EVNKALLPDN IGYIELRSFA NINVSSQLNA AISDLRKQGM QKLIFDLRDN
GGGLLDQGCS VAKAFIKEGP IVYTKTRSET RLYCEANGQV QWSGPMVVLV NGNSASASEI
VAGALQDTGR AKIVGEKTFG KGVGQNVIDL ANGGDLTLVT FQWLTPKKRA ITRDQGIQPD
VVVRDNRFPV PVSFEGTGAK PGATVTLTID GKTYTAKADE TGKYAFSQPL PARPANDNSG
NAMVDPQNDA ILARALQELK