Gene Mesil_3385 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMesil_3385 
Symbol 
ID9252915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMeiothermus silvanus DSM 9946 
KingdomBacteria 
Replicon accessionNC_014213 
Strand
Start bp198814 
End bp202068 
Gene Length3255 bp 
Protein Length1084 aa 
Translation table11 
GC content65% 
IMG OID 
ProductCRISPR-associated protein Cas5 
Protein accessionYP_003686705 
Protein GI297567734 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.335684 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCCCC ACGACCCTTC GCTGAAGGCT ACCCACGAGA AAGCCCTGCG TCTAATTCGG 
CTTCTGGAGT TACTGAAGCT CAAGCCCTGG ACAGCCCAGC AGCTAGCACA GGCGCTCGGG
ATAAAAAAGC GGGCGGTGCT TTATTACCTG CAAGACCTCC AGGAACTAGG CTCTCATCTG
GGGTTTCGGC TCATCCACGA TGAGATCCGC CGCACTTACA GCCTGAATGC CGGGGTGGTG
CTCAGCGATA TCGAGAAGGT GGTGATCCAT ACCGCTTTGC GGATGCTCTA CTACCACTCG
CCGGGGCACA ACCAGCAGTA CCAGGAAGCC CTGCTCAAGC TAGCCCGGGG CCTGCCCGAG
CCCGCTCGTA GCGTAGCGCA AAAGAGCGTC GAGGCCATGG CTGCCCGTGG ACCCGGCCTC
GAGGGAAGCA ACCTGGAGAA GATCACCAAC GCCTGGTTCC GCCGCCAGCT CGTCCAGTTT
CACTACCAGC TCCCCGGGCG GCGCGAGTTG ACCAAATTCG AGCTGGAAAC CTACTTCATC
GAGGTCTCGC GGGCCAACAT GGCGGTGTAC GTCATCGGCC GCGAGCGCAG CTATAAAGGC
CAACTGCGCA CCTTCAAACT CGCCCGGATG CAGTTCGTCA CCTTGGTCGG TCCCACCGAG
GCCTACGAGA TCCCGGCCAG CTTCGATCCT CGGGAGTACC TCTCTGACGC CTGGGGCATC
GTGGGGGGTA GGGGAGACCC GGTCCGGGTG CGGCTGCGCT TTTCGCCGCA AGCCGTCCCG
CGCATTCGGG AGGGAGGCTA CCCCAACCTC CAGGAGTTGG AGGAGGGTGA CGACGGCAGC
CTGGTGGTGG AGGTCGCGGT CGGAGCCGAC GAGGACGGTT TCCCCATCGA GCTGCTCTCG
TGGGTGCAAA GCTGGGGACC GAGGGTGGAG GTGCTGGAGC CCGAGGGCCT CCGGCAGGTC
TGGTTGGCCG AGGCCCACCA GCTGCTCGAA AGGTACGACC CGCAGAGGGT GGCGGGCCCC
AAAACCTACT GGGCCCACAC CCACAAAGAC CCCACCCGCT GGCAAACCCT GCAGGAACAC
GCCGCCCAGG TAGCCCGCCT GGCCGCCGAA AAAGCCCAGC CCTTTGGTGA TTCGGAGCAG
GCCGACCTAG CGGGGAAGCT GCACGACCTG GGCAAATACG GCGACCTTTT TCAGCGCCGC
CTGGAGGGCA AGGAAAAAGG CCTCGACCAC TGGTCGGCGG GGGCGCATCT GGCCTTGTTC
GAGTACCGCG CCCCCGCGCT GGCCTTGGCC ATTCAGGGCC ACCACATCGG CCTCCAGAGC
GGGGCCAAAG AAAACCTCAT GGCGATGCGG CTCCGCGAGG ATGGCCAAGG CTTCCCGCCC
GAGCTGCGCC TGAGCGAGAC CCGCCTGGAG GTGCTCAAAT CGCGCATGCA ATCCGATGGA
CTCGAGTTAC CCCCACCCAG CCAGGCCCCG ATAAGGCTGC CCCAAAGCGC CGCGGCCATG
CTCGAGACCC GCATGCTTTT CTCGGCTCTA GTCGACGCCG ACTTTCTGGA CACCGAGCAG
CATATGCGAG GCCCCGAAGT CCGCCGCCCT GAGCCCCCGC CTTTGCAGGC TCGGGAAGCC
TTGGAGCGGC TCGAGGCCCG GCTAGCCCAG CTCGCCGCCG ACCCCGGCAT TCCCCCCCAA
ATCCGCGCCC TGCGGCAGAC GGTGGCCCAG GCTTGCGCCG AGGCCGCCTT AGAGGGCCGC
CTCTTTACCC TCACCGCCCC CACCGGCAGC GGCAAAACCC TGGCCATGCT GCGCTTTGCC
CTGAAGCGGG CTGCGCAAGA CCCGCGCATA CGGCGCATCG TGGTGGTGCT GCCCTACCTG
TCCATCCTCG ACCAGACCGT GCAGATCTAC CGCGAACTCT TCGCCGATTT TGGTTCCCAC
TACATCCTGG AAGACCACAG CCTGGCCTAC CGCCCGTTGC GCAAGGAGCT TTCCGACGAG
CAGGACATCC TGGAACGGGA GCGCCGCCTC CTGAGCGAGA ACTGGGAAGC CCCCATCATC
CTCACCACCC ACGTGCAACT GCTGGAGAGC CTGCACGCGA GCCGCCCCGG CGCCTGCCGC
AAGCTGCACA ACCTGGCCGG GAGCGTGCTG CTGTTTGACG AGGTACAAAC CCTGCCCACC
CCCCTGGCCG TGCCCACCCT CAAAACCCTC GCGCACCTGG CCAGCGACAG GTACGGCTCG
GTGGTGGTCT TCGCCACTGC CACCCAGCCC GCCTTCGATA CCCTGCACGC CAAAGTCCAG
GAAAACGAAC CGCAGGGCTG GCAGCCGCGG GAGATTGTGC CCAACCCGGC AGCGCTCTTT
GGCCAGAGCC AGCGGGTAGC GGTGGAATGG TGGCTCAAAA ACCCCACCCC CAACCCGGCT
CTGACGACCC TGCTGGCTGC CGATAGCCAG GCCCTGGTGG TGCTCAACCT CAAGCGGCAG
GCCCAGGCCC TCTTTCAGCA GGCCCAGGAG CGGGGCTTAG AGGGGCTTTA TCATCTATCC
ACCGCGCTGT GCCCGGCCCA CCGCAAGGAG GTGCTGAGAG AGATTAAAGC CCGCCTGGAC
CGGCGAGAAC CTTGTCGTCT GATCGCCACC CAGGTGGTGG AGGCCGGGGT AGAACTGGAC
TTTCCCATAG GCTACCGGGC TTTGGGGCCT TTGGAGGCCA TCGCCCAGAC TGCCGGACGG
GTCAACCGGC ATGGCCTGCG TTGGGAAGGA CGTCTGGTGG TCTTTTTGCC GGAGGACGAG
GGCTACCCCG ACCGGGCCTA CGCCCAGGCC GCAAAGCTCA CCCTAGCCTT GCAGGCCGAG
GGGGGCCTCG AGCTCACCCC CGCCACCTTC CGCCGCTACT ACCAAAGCCT TTACGGGATC
CAACAGGTGA GCGACCCCGA GATCGAAGGG CATATCAAAA CCCAAAACTA CGCCGAACTG
GCCCGCCGCT ACCGCATAAT TGAAACCCCG GCAGTGAACG TGGTGGTGCC TTACAACGAA
GAGGCCCTGG CCCTCATGCA AGAAGCCCGC GAGCACGGCA TCGCCGCCGA CTGGCTGCAC
CGGGCGCGGC CCTACACGGT GCCCTTCTTC TTGCCCCACG GGGGGCCGCC GGCCTTCCTC
GAGCCGGTCT TTCTGCGCTA CGGTCGCCAA AATAGCGAGG TGCCCGACTG GTTTTTGTCC
CCCGACCCGG CCTGCTACCA CGCTTTGCTG GGCTTCACCC CTGCCGAGGG CGGGCCGGGG
AGCTTGGTTA TTTGA
 
Protein sequence
MEPHDPSLKA THEKALRLIR LLELLKLKPW TAQQLAQALG IKKRAVLYYL QDLQELGSHL 
GFRLIHDEIR RTYSLNAGVV LSDIEKVVIH TALRMLYYHS PGHNQQYQEA LLKLARGLPE
PARSVAQKSV EAMAARGPGL EGSNLEKITN AWFRRQLVQF HYQLPGRREL TKFELETYFI
EVSRANMAVY VIGRERSYKG QLRTFKLARM QFVTLVGPTE AYEIPASFDP REYLSDAWGI
VGGRGDPVRV RLRFSPQAVP RIREGGYPNL QELEEGDDGS LVVEVAVGAD EDGFPIELLS
WVQSWGPRVE VLEPEGLRQV WLAEAHQLLE RYDPQRVAGP KTYWAHTHKD PTRWQTLQEH
AAQVARLAAE KAQPFGDSEQ ADLAGKLHDL GKYGDLFQRR LEGKEKGLDH WSAGAHLALF
EYRAPALALA IQGHHIGLQS GAKENLMAMR LREDGQGFPP ELRLSETRLE VLKSRMQSDG
LELPPPSQAP IRLPQSAAAM LETRMLFSAL VDADFLDTEQ HMRGPEVRRP EPPPLQAREA
LERLEARLAQ LAADPGIPPQ IRALRQTVAQ ACAEAALEGR LFTLTAPTGS GKTLAMLRFA
LKRAAQDPRI RRIVVVLPYL SILDQTVQIY RELFADFGSH YILEDHSLAY RPLRKELSDE
QDILERERRL LSENWEAPII LTTHVQLLES LHASRPGACR KLHNLAGSVL LFDEVQTLPT
PLAVPTLKTL AHLASDRYGS VVVFATATQP AFDTLHAKVQ ENEPQGWQPR EIVPNPAALF
GQSQRVAVEW WLKNPTPNPA LTTLLAADSQ ALVVLNLKRQ AQALFQQAQE RGLEGLYHLS
TALCPAHRKE VLREIKARLD RREPCRLIAT QVVEAGVELD FPIGYRALGP LEAIAQTAGR
VNRHGLRWEG RLVVFLPEDE GYPDRAYAQA AKLTLALQAE GGLELTPATF RRYYQSLYGI
QQVSDPEIEG HIKTQNYAEL ARRYRIIETP AVNVVVPYNE EALALMQEAR EHGIAADWLH
RARPYTVPFF LPHGGPPAFL EPVFLRYGRQ NSEVPDWFLS PDPACYHALL GFTPAEGGPG
SLVI