Gene Rcas_3298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3298 
Symbol 
ID5540796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4287700 
End bp4289205 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content58% 
IMG OID640895416 
ProductCRISPR-associated Cst1 family protein 
Protein accessionYP_001433367 
Protein GI156743238 
COG category 
COG ID 
TIGRFAM ID[TIGR01908] CRISPR-associated CXXC_CXXC protein Cst1 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.696898 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAACC CGATAGCGTA CACCGGCCAT CCCTTCATCG ATGTCGGCTT TGCCACGATG 
TGCGCTTTGA CCTGCAAGCG TCGCTTTGCC GATCTGACAG CAGATGATTT TCAAAAGGTC
GTCGATTATA TCGAGACCAA CTACGTGCGC CAGCCGTTGC GCAGTTTTCT AACGGTGGCG
TTCACCAGCA ATGCATGGTT CGCCCAATCG GCGTTCAACC CTGATCGGTT TGATGACCCT
AACAAGAAGA ACGAAGCGCA GCAGAAACGC ACGTATTGGG CGGATCGACA CCTGCGCCAG
TGGGCGCAGG CTGCTGAGTC GCTCGAAACC TGCCTCTTCA CCGGACTTCC GGCAGCGGCG
CTCGAGTTGT CGGGCAAACT GCAACCAGGT CGGGTTGGGC GGGCGCAAAT GCCCCTGTTG
CAGGGTGATG ACTCGATCAA CTTCTTCACC AACGGAGATC CAGGATTGCC GATGGCGCCG
GAGGCGATTC TGGCGCTCCA GGCGATGCCG TTGGGCTGCG CCAAGGTTGG CGGCGGGCTG
CTGGCGGTGC ACTGTGATGA TGAAGCATTG ACAATTGAGT TTGCCGGGCA GTTTTTGCAG
CGTAATCTCG CCGATGTCAC CAAAGCGCAG GCAGCCGGTG AAGAGAAGCT GCCCGGATCA
CCGCGTAGCT TGAAGACGTT GTTGATCGAA ACGCTCAATG CCATTCAAAC GCGGCAGGCG
CAGGAGGAGT GGCGGCGCCA ACACCGGCCG GCCATCACGG CCTACTATTT CAACAATAGT
CAATCGCCCT CGCTCGAAAT CTACTACTTG CCATTACAGA TCACCGGTTT TCTGAGCGCT
GTTCACACTC CCACGTATCG CGCGCTCTGG AATGAACTGG TCGCGCGCAG CTGGCAGCGC
CCGGCAGCGG CGGGCAAGCG AGGAAAGGCG ACGGAACCAA CAGAGCCGCG CTTCAATTAT
CTGTTCGAAG ACCTCTTTAC CCTGCCAGCG CAGGCGGCGC GCTTTGTGCG CACCTATTTT
CTGCGCATTC CCGATCTGCG TCGTCCGGCG GATGACCCGC GGCGCGCCTA TTCGCCACGC
CGCGAAGTCG ATCTTGTTTC ATGGACCCTC GTTGAACTCT TTGTGCAGGA GGTAATGCTG
ATGACCGATG ACCGGGTAGC CAAATTGAAG GAACTGGGCG ATAAACTGGC CGACTATACG
CGCGCTCAGG GCGGCAAACG CTTCTTTCGC CAGTTCTTTA CCGTGCAGCG CACCGATCAC
TTCCTGTCGC TGCTCAACAA GACCAATATC GATTATACGC GCTACAAGGG CGGCGCGGAG
ACGCTGTTCG ATCTCGATAG CTTTCTCACC CTCTTTATGG AAGGTGAAGA GGTCCTGCGA
TCCGACTGGC GATTGATCCG CGATCTGGTG CTCATCCGCA TGGTCGAGCA ATTGCGCGAC
TGGATCGCTA ACAACGCGGA TGCTGTACCT TCCGAGGAGG AAGTTACAAT TGCCGAACCA
GCCTGA
 
Protein sequence
MPNPIAYTGH PFIDVGFATM CALTCKRRFA DLTADDFQKV VDYIETNYVR QPLRSFLTVA 
FTSNAWFAQS AFNPDRFDDP NKKNEAQQKR TYWADRHLRQ WAQAAESLET CLFTGLPAAA
LELSGKLQPG RVGRAQMPLL QGDDSINFFT NGDPGLPMAP EAILALQAMP LGCAKVGGGL
LAVHCDDEAL TIEFAGQFLQ RNLADVTKAQ AAGEEKLPGS PRSLKTLLIE TLNAIQTRQA
QEEWRRQHRP AITAYYFNNS QSPSLEIYYL PLQITGFLSA VHTPTYRALW NELVARSWQR
PAAAGKRGKA TEPTEPRFNY LFEDLFTLPA QAARFVRTYF LRIPDLRRPA DDPRRAYSPR
REVDLVSWTL VELFVQEVML MTDDRVAKLK ELGDKLADYT RAQGGKRFFR QFFTVQRTDH
FLSLLNKTNI DYTRYKGGAE TLFDLDSFLT LFMEGEEVLR SDWRLIRDLV LIRMVEQLRD
WIANNADAVP SEEEVTIAEP A