Gene Mesil_1233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMesil_1233 
Symbol 
ID9250727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMeiothermus silvanus DSM 9946 
KingdomBacteria 
Replicon accessionNC_014212 
Strand
Start bp1217218 
End bp1218384 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content60% 
IMG OID 
ProductCRISPR-associated protein, Cse4 family 
Protein accessionYP_003684638 
Protein GI297565666 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00111756 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.718248 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACACC TGCTCGAGAT CCACATCTTG CAAAACTTTG CCCCCAGCAA CCTCAACCGC 
GACGATACCG GTTCTCCCAA GGACGCCATT TTTGGCGGGG TGCGCCGGGG GCGCATCAGC
AGCCAGTGCC TCAAGCGGGC CGCGCGGGAG TATGTGCGCG ATCACCCGGG CGGCCTGCCT
CAGGAGGCGC TGGCTTTGCG CACCAAGCGG CTGGTGCAAG CATTGGTAGA ACAGCTTAAG
GCCAAGGGCC GGGACGAGGA GGAAGCCCGG CAGAAAGTAG AGCAAGCCTT AGGTGGGATG
GGCCTGAAGG TAGATGCAGA GGGCAAAACC CAGTACCTGC TGTTCTTGGG CAAGCAAGAG
GTAGCGAGAA TTGCCGACCT TATCGAACAG CACTGGGATG GCCTGGTGGC CCCCCAAGCC
GAGGAGGAGG GGGGTAAAAA GAAAGCCAGG GAAGCTAAGA AAGCTGCCAA GGAAGCCGTC
CCTGACGAGA TCAAAAAAGC TTTGGGCAGT GTGCTGGATG GGGGCAAGGC CCTGGATGTA
GCGCTCTTTG GCCGCATGCT GGCCGATTTG CCTGAGAAGA ACCAGGACGC CGCCTGCCAG
GTAGCCCACG CCATCTCCAC CCACGCCGTC GAGCGCGAGT TCGACTTCTA CACCGCCGTG
GACGACCTCA AACCTGACGA CAACGCCGGG GCGGACATGC TGGGCACGGT AGAGTTCAAC
TCGGCCTGCT TCTACCGTTA TGCGGCCATA GACCTCGAGA AGCTACGTGC GAACCTCCAG
GGCGATGCCG AGCTGATGCT TAAGAGCCTC GAGGCTTTCC TCAGGGCCAT GGTCAAGGCC
AAGCCCAGCG GAAAGCAAAA CTCCTTCGCC GCCCACAATG ACCCGGAGTA CGTCGTTTTC
ACCGTGCGCC AGGAGGCCGA CCCGCGCAAC CTGGCCAACG CCTTTGAGAA GCCGATTCGT
CCTAACAAGG AGAAGAGCCT CACCGAGGCT TCGCTGGAGC AGCTCGAGGC CAAGTGGCAG
AAACTCTCCG AGGCCTACGA CCAAAATGGA GAGGCCTGGG TACTCAACCT GACCGAGGTA
AAAAGCCAAA TCGGCACACC TGTCAAAAAC CTGGGCGAAC TCGTCGCAAA GGCGCTGGAA
AAGGTCAGGG CTAACATGGG GGTCTGA
 
Protein sequence
MKHLLEIHIL QNFAPSNLNR DDTGSPKDAI FGGVRRGRIS SQCLKRAARE YVRDHPGGLP 
QEALALRTKR LVQALVEQLK AKGRDEEEAR QKVEQALGGM GLKVDAEGKT QYLLFLGKQE
VARIADLIEQ HWDGLVAPQA EEEGGKKKAR EAKKAAKEAV PDEIKKALGS VLDGGKALDV
ALFGRMLADL PEKNQDAACQ VAHAISTHAV EREFDFYTAV DDLKPDDNAG ADMLGTVEFN
SACFYRYAAI DLEKLRANLQ GDAELMLKSL EAFLRAMVKA KPSGKQNSFA AHNDPEYVVF
TVRQEADPRN LANAFEKPIR PNKEKSLTEA SLEQLEAKWQ KLSEAYDQNG EAWVLNLTEV
KSQIGTPVKN LGELVAKALE KVRANMGV