Gene RoseRS_1865 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1865 
Symbol 
ID5208825 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp2307164 
End bp2308657 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content56% 
IMG OID640595473 
ProductCRISPR-associated Cst1 family protein 
Protein accessionYP_001276204 
Protein GI148655999 
COG category 
COG ID 
TIGRFAM ID[TIGR01908] CRISPR-associated CXXC_CXXC protein Cst1 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.858378 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCACCT ATACCGGACA TCCATTCGTC GATACGGGGT TTGCCGTGAT CACCGCTTTT 
GTGCGTAAAC GGCGCTTTGC CGATCTCGCC GACGACGATT TTCAGCAGAT CGCCGACTAT
ATCGAAGCGA ACTATGTGCG ACAGCCCCTG CGCAGCTTTT TGACCGTGGC GTTTACCAGT
AATGCATGGT TCGCGCAATC GGCGTTCAAT CCCGACCGGC CCGACCTGTC GCCGGAAAAA
CAGACTGAAG CGCGTGAGAA GCGCCAGTAC TGGGCGGATC GGCATTTGCG CCAGTGGCAG
CAGAGCGCTG CTGCGCTCGA AACCTGCCTT TTCACCGGAT TACCGGCGGC AGGTCTTGAA
TTGTCGCAGA AGTTGCAACC GGGACGGGTA GGGCGGGCGC AAATGCCATT GCTTCAGGGT
GATGATGCGA TCAACTTCTT TATCAATGGC GACCCTGGTT TGCCGATGGC GGCGGAAGCG
ATTCTGGCAC TCCAGGCGAT GCCTCTGGGA TGCGCTAAAG TCGGTGGGGG CTTGCTCGCC
GTGCACTGCG ATGATGAGGC GTTGACGATC GCCTTCGCAA CACGCTTCTT GCAGCGCAAT
CTCAACGATG TGGCGAAAGC GCAGGCTGCC GGCGAAAAGA AACTGCCCGG TTCGCCGCGC
AGTCTGAAGA CGCTGCTGGT TGAGACATTG ACCGAGATTC TGATTCGGCA GATTCAGGAA
GAGGAGCGAC GCGCACGGCG TCCGGCGATC ACGGCCTACT ATTTCAACAA TGGTCAGTCG
CCGTTTCTTG AAATCTACCA TCTGCCGCTC CAGATTACCG GTTTTCTCCT GGCAGTGCAT
ACCCCTGCCT ACCGCGCGAT CTGGAATGAA CTGGTGCAAC GTGGCTGGCA GCGCGCAGGA
ATATCAGGCA AGCAGGGGAA GGCAGTCGAT CCGGTTGAAC CACATTTCAA CTATCTGTAC
GAAGACCTTT TTACCCTGCC GGCGCAGGCG GCGCGGTTCG TGCGCACCTA TTTTCTGCGC
ATTCCCGATC TTCGTCGCTC AGCGGACGAT CCGCGGCGCG AGTATTCGCC GCGCCGCGAA
GCCGATCTGG TTTCATGGCC GCTCGTTGAA CTCTTTGCAC AGGAGGTATT GCTCATGACC
GATGACCGGG TAACGAAATT GAAGGAGTTG GGCGATAAAC TGGCTGATTA CACCCGTTAT
CAGGGAGGTA AGCGTTTTTT TCGCCAGTTC TTTGTCGAGC AGCGAAGTGA TAACTTTCTC
AGCCTGCTGA ACAAAGCCAA TATCGACTAC ACGCGCTACA AGCGTGGTCA GGAGACATTG
TTCGATCTCG ATAGTTTTCT GACCATTTTT ATGGAGGGCG ATGAGGTCTT GCGCAAGGAC
TGGCGCCTGA TGCGAGATCT GGTGCTCATT CGTATGGTCG AACAGTTGCG CGACTGGATC
GCCGGCAACC CTGATGCCAT TCCAGCCGAA GAAGAAGTTG CAGCAAACGA ATAG
 
Protein sequence
MITYTGHPFV DTGFAVITAF VRKRRFADLA DDDFQQIADY IEANYVRQPL RSFLTVAFTS 
NAWFAQSAFN PDRPDLSPEK QTEAREKRQY WADRHLRQWQ QSAAALETCL FTGLPAAGLE
LSQKLQPGRV GRAQMPLLQG DDAINFFING DPGLPMAAEA ILALQAMPLG CAKVGGGLLA
VHCDDEALTI AFATRFLQRN LNDVAKAQAA GEKKLPGSPR SLKTLLVETL TEILIRQIQE
EERRARRPAI TAYYFNNGQS PFLEIYHLPL QITGFLLAVH TPAYRAIWNE LVQRGWQRAG
ISGKQGKAVD PVEPHFNYLY EDLFTLPAQA ARFVRTYFLR IPDLRRSADD PRREYSPRRE
ADLVSWPLVE LFAQEVLLMT DDRVTKLKEL GDKLADYTRY QGGKRFFRQF FVEQRSDNFL
SLLNKANIDY TRYKRGQETL FDLDSFLTIF MEGDEVLRKD WRLMRDLVLI RMVEQLRDWI
AGNPDAIPAE EEVAANE