Gene Mesil_1230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMesil_1230 
Symbol 
ID9250724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMeiothermus silvanus DSM 9946 
KingdomBacteria 
Replicon accessionNC_014212 
Strand
Start bp1214412 
End bp1216097 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content64% 
IMG OID 
ProductCRISPR-associated protein, Cse1 family 
Protein accessionYP_003684635 
Protein GI297565663 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000214368 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.502097 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCACACGT TCAACCTAAT AACCCAGCCC TGGATTCCCG TAAGGGAGGG CAACCAACTA 
AAGGAAGTGA GCCTCGAGCA GGCCCTGCTC GAGGGCCGTC GATTCGAGCG TATCGAAGAC
CCCAGTCCGC TCGTGACCGT AGCGCTATAT CGCTTGCTCC TGGCCATTTT GCACCGGGCA
CTGCAAGGCC CAGAGAACTC CGACGAGGCG GCAAAGTGGT TCAGCAACGG TTTTGACGCC
GAGAAGATTC GGGATTATCT GGCCAAACAC CAAGACCGCT TTGATTTGTT CCACCCCGAA
CGGCCCTTCT ACCAGGTGCC TGACTTCACC CTCGAGCGCT CCTGCCGCTC CTGGACGGTG
CTGGCCCCCG AACTCAACTC CGACAACAAC AAGGTTTTGT TCGACCACAC CGTCACCTCG
AGGCCCCGCC CCCTCCACCC CGCCGAGGCC GCCCGATTGC TGGTGGCCAA CCAGACCTTC
GCCCTTTCGG CGGGCAAGAG CGTGCTTTGC CACACCGCTA CCGCGCCCGT GGCGACGGCA
GCATTGGCCC TCATGCTGGG CGAGAACCTC CACGAGACGC TGTGTTTGAA CCTCGTCAGC
TATCCCAAAA GCGAGTACGA GCGCGATTTT GCCACCTGGG AGCGGGAGCC GCTGCGGGTA
TCCGACCTGA AAAACTGCGA GGCCGCCAGG GCCACCCCCA AGGGCATCGT TCATCGCTAC
ACCTGGCTCT CGCGCGCGGT GCGCCTCGAT CCCGAAGAGG GAAACGGCCA GGCGGACGAT
CCCCTGTCGG GACGCTTTGC GAGCGTACAC CGGACGGACG ATCCCCTGTC GGGACAGGGT
ACTCACGCGC CTGTTCACCC GGGACGATCC CCTGCGGGAC AGACAGCTCA CGCCGCTGTT
TACCGGACCG TGGTGCGCTG GATCGCCTAC GCTTCGGGGA TTCGCTACGA GGAAGCCGCC
ATACGCCCCG ACCCCATGGT GGCCTTCCGC CCCGACCCCA AGGACCTCTC CAAGCAGTAT
CCGCTCGGCT TTCGCGAGGG TCGGGCGCTG TGGCGAGACT TCGCCTCGCT CTTGCCCCGG
CCAGGTTCCG CGCACAGCCC GAGGGTGGTG GAGCACGCCC GTAACGTCTA CAGGGCGCTG
GGGACTCGGT TTAAGGGACG GGGCATTCCG GTAATGGTCG CGGGCCAGGC CAACGACCAG
GCCAAGGTCG AACTCTGGCG GGGCGAGGTT TACCGGCTGC CCGAGGCCAT CCTGAGTGAT
AAGGACATCT GGCGCTTTGT GGAGGAGAAC CTGGAGAGAG CCGAAGAAAT GGGAAGGGCC
TTGAACGGGG CGGCCCGAGC CCTGGCCACA CAACTCCTAA CTCTGGGAGA CCGGCAGCCG
CACAAGGACG ACGTGATCAA GCTGATGCAG AGCTTTCCCC ACCAAGCCGC CTACTGGTCG
GCGCTCGAGG GCCAGTTCGC CAACTGGATC GTGCGGCTGG GCCCCGATTT CGAAGAGCAG
CAAGCCCGGC TCGAGCAGGA CTGGCTAAAG ACCCTCCAGC GCGAAGCCTT GCAGGCCTGG
CAGCTCACCA AACTCGCCGC CGGGGACGAT GCCAGGGCTT TGCGGGCCAT TCACAAGAGC
GAAGGCATCC TGCTGGCCTA TATCTACGGC AAAGGGAAGG AGGAAGCGGG TGCAAAAGGA
AATTAG
 
Protein sequence
MHTFNLITQP WIPVREGNQL KEVSLEQALL EGRRFERIED PSPLVTVALY RLLLAILHRA 
LQGPENSDEA AKWFSNGFDA EKIRDYLAKH QDRFDLFHPE RPFYQVPDFT LERSCRSWTV
LAPELNSDNN KVLFDHTVTS RPRPLHPAEA ARLLVANQTF ALSAGKSVLC HTATAPVATA
ALALMLGENL HETLCLNLVS YPKSEYERDF ATWEREPLRV SDLKNCEAAR ATPKGIVHRY
TWLSRAVRLD PEEGNGQADD PLSGRFASVH RTDDPLSGQG THAPVHPGRS PAGQTAHAAV
YRTVVRWIAY ASGIRYEEAA IRPDPMVAFR PDPKDLSKQY PLGFREGRAL WRDFASLLPR
PGSAHSPRVV EHARNVYRAL GTRFKGRGIP VMVAGQANDQ AKVELWRGEV YRLPEAILSD
KDIWRFVEEN LERAEEMGRA LNGAARALAT QLLTLGDRQP HKDDVIKLMQ SFPHQAAYWS
ALEGQFANWI VRLGPDFEEQ QARLEQDWLK TLQREALQAW QLTKLAAGDD ARALRAIHKS
EGILLAYIYG KGKEEAGAKG N