Gene Rmar_2841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmar_2841 
Symbol 
ID8569512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodothermus marinus DSM 4252 
KingdomBacteria 
Replicon accessionNC_013502 
Strand
Start bp44730 
End bp46307 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content67% 
IMG OID 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_003292096 
Protein GI268318378 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00141566 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGTTT CGCACGAGGG GGTTTGCTGG CCGGTGGCAT GGGTGTACGA GTTTGCCTAC 
TGCCTGCGGC GGGGCTACTA CGCCTGGGTT GAAGGGCCCG GAGAGCCTGT TGCGCTGGAT
ACCGGAGAAG GCGTGTGGCT GGCCTCGGAC GCCTGGTGTG GGCCGGTGCG GTGGGTACAG
GACGGTGCCG GTTGTCCCGT TCCCCTGCTA CGGGTGCCCC GGGCATGCGC CGACGATTCC
GACAGACGCC CCGAACATCT GATGCTGGCG GCCCTTGCCG ATCTGGTGGA GGCGCATACA
GGGCAACCAG TGCGTCGGGG TCTTGTTCTC GACGCAACAA CAGGGCACCG GGAAGAGGTA
ACGCTCGACG AAACGTTACG GGCGGCCTGG CAGTCGCTCG ACAGGCGTAT TCGGGCCGTC
ATTCATGAAG GAAGCGTGCC GCCGCCAAGG GCCGACGGGC GCTGCCATCA CTGCCCACGT
CTGGAGGTGT GCCGACCTTT TGACGGCCAT TCGAAAGAGG ATGCGGTGGC GATCATGCCA
CCGGTTCGGC AGGCACGTAC GCTCTACGTG GACGAGATCG GAGCGGTCGT ACGCCGCAAG
GGACGGCAGC TTGTCGTCAC CGTCAGCCGC GACGGCAGGC GGCAGGAACT GCTGCGTGTG
CCCGCCCTGC TGGTGGATCA GGTGGTGCTG GTCGGACCCG TCCAGATCAC CTCGCAGGCG
CTGCGGATGC TGCTACGTCG GAACGTGGAT ATCGTGTATC TATCAGGCGA GGGACGCTTC
GAGGGAAGGC TGGCGGCTGA GTTCCATCCG CACGTGGCAT TGCGGCTGGC ACAGTACGAG
GCCTTTCGCG ATCCTGAGCG GACCCTTACG CTGGCCCGCC TGTTTGTGCG GGGCAAGTTG
CAGAACATGG CCGGCCTGTT GCGCCGCTAT GCCGACGAGT ATGGCAGCGC TTCGTTGCGC
GCCGCGGCTT CCGAAATCAA CCGCGATCTG GAGCGCCTCG AGCAGGTTAC CACGCTGGAG
GCGTTGCGCG GTGTGGAGGG CACGGCAAGC CGACGCTACT TTTCGGTTTT CGGCGAAATG
CTGCGTGCCG AGGCCTATGC CCCGACCGGC TGGCCCGCGT TCCCGGGGCG CCACCGGCGA
CCGCCTACCG ATCCCGTCAA CGCCACGCTG GGCTATCTCT ACGCGTTGCT ACTGGGTAAC
GTGGTGGCGG CCTGCGCGCT GGCCGGGCTG GATCCCTACG TGGGCTATCT ACATGCGCCG
GCTTACGGGC GCCCGTCGCT GGCACTCGAC CTGATGGAAG AGTTTCGCGC GCCCGCGGCC
GATCGGCTTG CGCTGCGGCT GTTCAACCGG GGACGACTGC GGCCCCAACA CTTCGAGGAG
CGCAACGGCG GGGTGTACCT GAACGAAGCG GGCCGAGCCG TCGTGCTGGA AGCCTGGCAG
GCGCACCGCC AGCAAACCTC GGCGCATCCG GTGCTCGGCA TGGAGTTGTC GCTGGCCCGG
CATTTCGAGG CGCAGGCCCG GCTGCTTGCC CGGGCGCTTC AGGAGCAGGG TATCGCCTAC
ACGCCATTTG TCGCATAG
 
Protein sequence
MQVSHEGVCW PVAWVYEFAY CLRRGYYAWV EGPGEPVALD TGEGVWLASD AWCGPVRWVQ 
DGAGCPVPLL RVPRACADDS DRRPEHLMLA ALADLVEAHT GQPVRRGLVL DATTGHREEV
TLDETLRAAW QSLDRRIRAV IHEGSVPPPR ADGRCHHCPR LEVCRPFDGH SKEDAVAIMP
PVRQARTLYV DEIGAVVRRK GRQLVVTVSR DGRRQELLRV PALLVDQVVL VGPVQITSQA
LRMLLRRNVD IVYLSGEGRF EGRLAAEFHP HVALRLAQYE AFRDPERTLT LARLFVRGKL
QNMAGLLRRY ADEYGSASLR AAASEINRDL ERLEQVTTLE ALRGVEGTAS RRYFSVFGEM
LRAEAYAPTG WPAFPGRHRR PPTDPVNATL GYLYALLLGN VVAACALAGL DPYVGYLHAP
AYGRPSLALD LMEEFRAPAA DRLALRLFNR GRLRPQHFEE RNGGVYLNEA GRAVVLEAWQ
AHRQQTSAHP VLGMELSLAR HFEAQARLLA RALQEQGIAY TPFVA