Gene Rcas_3288 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3288 
Symbol 
ID5540786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4270864 
End bp4272114 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content49% 
IMG OID640895406 
ProductCRISPR-associated TM1812 family protein 
Protein accessionYP_001433357 
Protein GI156743228 
COG category 
COG ID 
TIGRFAM ID[TIGR02221] CRISPR-associated protein, TM1812 family
[TIGR02549] CRISPR-associated DxTHG motif protein 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.392091 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCCA TTACCTTCCT CGGCGCTAGA CCGCAAGAAA CATGCTATGT CTTTCCAGAT 
GGTTGCGAGC ATGTTGCGCC CTTTTTCGGG CTAGCATTGG CTCAGTACCT TCCAGATCTG
GATTTCGTTG TTTTTACTAC TGAACTGACG GCTCAATTCT ACCATCAATA TTTTGCCAGT
GCGCAAGCTG CGAGCATTCA AGCAGTTCGC ATCCCTGATG GGCGCGATGA TGCTGAATTG
TGGCAGATCT TTCAGGCTGT GGTTGAAATC ATTGAACCGA ACGAAGCGGT TGTGTTCGAT
ATTACTCACG GCTTTCGCTC GCTGCCATTC TTATCCTTCC TCGCCGCTGC CTATTTGCGC
AAAGTCAAGG CCATTCAACT AAAACATGTA TTCTTCGGTA ATTTTGAAAT GCGTGATCAG
AGTGTCACTC CTCATCGCAC GCCTGTGCTT GATCTGACCA ATTTTGTCGA ATTACTCGAC
TGGATGGTCG GGGCCGACCT GTTTGTGCGT TTCGGTGATG CCCGTGATTT GGCTACGTTG
CTGCATACGC AGCATAACCG GGTCAAGCCC GATCCAAAAA CTGCTAGTAA GGATGAAATG
GCTGCTTGGA ACAATTCACC AATTAAAGCA ACAGCGAAGA ATCTGACGAA AGTCTCAAAG
GCATTGCGCG TCGTGCGACC GGCTGAGGTT ATGGAGGTGA GCGAGCAAAT CTACCAACAG
TTACCGCAGG CTATTTCATC GATTGGTTCT CTAGCGAGGC CCTTCAATCC TCTCGCTGAG
CAAGTTATCA ATAGCTTTCA AAAGATTGCG CTTGGCGACA ATGAGCTTAA GTCCGAGCGG
GAATTGATTG GCTGGTATCT GGATCGCAAT CAGGTTTTCC AGGCAGTGGC ACTGGCGCGT
GAGTGGTTAA TTTCATGGAC AATGGCGCAG CTTGGGCTGT ATGACCAATT GCTGGAAAGA
GACGTTCGCA AGCGTGTAGA AGACGCATTG GGCGCAGAAG TGCAGAGACA AAGCAAGAAT
ATGACTGCTG AAGAACATTC GGGCGAAGGG CTTGACCTCA GCAAGTTGCC ATCATGCAAC
GATGTGGTCA AGCTCTTTGG TCAGCTCGGT GAATTACGTA ACGATTTGAT GCACGCCGGA
AAGCGAAAAA ACCCGCTTAC CGCCAATAAG GTTGAGGAAC GGGCAAGTCT TCTTTATGAA
CAAGTGAAAA AACTTGAGGT TGACAATGTC GTGAACAGCG ACTGCGCGTG A
 
Protein sequence
MKAITFLGAR PQETCYVFPD GCEHVAPFFG LALAQYLPDL DFVVFTTELT AQFYHQYFAS 
AQAASIQAVR IPDGRDDAEL WQIFQAVVEI IEPNEAVVFD ITHGFRSLPF LSFLAAAYLR
KVKAIQLKHV FFGNFEMRDQ SVTPHRTPVL DLTNFVELLD WMVGADLFVR FGDARDLATL
LHTQHNRVKP DPKTASKDEM AAWNNSPIKA TAKNLTKVSK ALRVVRPAEV MEVSEQIYQQ
LPQAISSIGS LARPFNPLAE QVINSFQKIA LGDNELKSER ELIGWYLDRN QVFQAVALAR
EWLISWTMAQ LGLYDQLLER DVRKRVEDAL GAEVQRQSKN MTAEEHSGEG LDLSKLPSCN
DVVKLFGQLG ELRNDLMHAG KRKNPLTANK VEERASLLYE QVKKLEVDNV VNSDCA