Gene Rcas_1981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1981 
Symbol 
ID5539459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2537003 
End bp2538433 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content65% 
IMG OID640894116 
ProductCRISPR-associated RAMP Csm5 family protein 
Protein accessionYP_001432087 
Protein GI156741958 
COG category[L] Replication, recombination and repair 
COG ID[COG1332] Uncharacterized protein predicted to be involved in DNA repair (RAMP superfamily) 
TIGRFAM ID[TIGR01899] CRISPR-associated RAMP protein, Csm5 family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.244308 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATTGG TGCGCAACCA GGTCGTTTCC CTGACCGTCA CTACCATCAC GCCGCTCCAC 
ATCGGCACAG GCGACCGGCT GGCGGCAAAC CTGGATTACT ATGTGGACGG CGACGCCACG
CTGGTGATCA ATGCTGACGC TGCGCTCGAA TTGGTGGTCG AAACCTGGCA GCAGCGCCGC
GTGCCGTATG AGGAGCAACT GCGGCGCTTC AACGCAGAAC TGGCGCAGGA AGAAGAACGC
ATCCGCCGGG CGGACGAGCG TCTGCGTCGT CAGATTGAGC AGTTCGAGGA GAGTCCGCCG
CGGCGCAAGG ACGAGTATGA ACGTCAGGCG AACCGTTTTC GTGAGGAGGC GCAGCGTCTG
AAGGAACGCA AGCAGCGCCT GGCGGAGCAC AGCGCGAATC CGCCGCAGCC GGACGATACC
GGCGATCTGT TGCCGCCGGA ATTGATCGCC GGCAGCACGT TCGACCAACT GGTTGATGGC
GGGTTGTTGC CACGCGACCG GTTGCGTGAA CGCGCAACGG TCAACGGGCG TCCGCTGGTG
CGCTATGCGC TGAACGGACG CCCGGCGTCC GGCGAGGTCT ACGAGCAGAT TAAGGATGTC
GCCGACCGAC TCTATCTGCC GGGGTCGTCG CTCAAAGGCG CAATCCGCAG CGCGCTGGCG
TGGGACATGG CGCACAGTCC GGCGGTCGCG GCGCTTCAAC ATGCGGTGAA GGGGGGGCCG
AAAAACGCTG ATGACGCCAT CGAACAGGAG GTGTTTCTCG GCACGCTGCG AACGCAACGC
CGTATCAACA ACACGGTGCG TGACGTGCTG CGCGCGCTCC GCATCGGTGA CAGCGCGCCG
GTCGCAGTTG CGCCCGATCT GCTGGCGGTG CGCATCTACC GCAGCCGGTC GGCGCAGGGA
TTGATTGCGC TCGAAGCCAT CCCCGTCGAT GTCGAGTTTC GCGCAGCGCT TCAGATCGAA
CAGTATCCGT TCGAGAGCGG GGTCGCGCGC GCCGTGATCG ACTTCGGCGA TTGGCAGCGC
CGGTTGCAGC CGGATGAACT TGCGGCAGCG TGCCGACGGC GCGCCGGGCG CCTGATCGAC
GGCGAACTCG CATATTTCAA CCGCCAGACC GACGCCGCCG AACTGGTCCG CTTCTATGCC
GATCTGCGCG CGCGCCTGGA AAGGATGGAT GCGCGCGCGT TTCTGTTGCC CATCGGCTGG
GGCGCCGGTT GGCGCTCCAA GACCCTCGAC GACCGGTTGC GCCAGGGGAC GGATCGTGAC
AATGCGTTTG CGCAAATCGT TCAACGTCAC ACCCTCAAAA AGCACAAATC CGCCGGTTTT
CGCCCCGGCG ACGCTTTCCC GGAGACGCGC AAAGTCATCA TGCGCGGCGC ATTACCCTGG
CGACCGCTTG GGTGGGTCGA GGCGCGCTTC GATCTGAACG GTGAACGTTG A
 
Protein sequence
MALVRNQVVS LTVTTITPLH IGTGDRLAAN LDYYVDGDAT LVINADAALE LVVETWQQRR 
VPYEEQLRRF NAELAQEEER IRRADERLRR QIEQFEESPP RRKDEYERQA NRFREEAQRL
KERKQRLAEH SANPPQPDDT GDLLPPELIA GSTFDQLVDG GLLPRDRLRE RATVNGRPLV
RYALNGRPAS GEVYEQIKDV ADRLYLPGSS LKGAIRSALA WDMAHSPAVA ALQHAVKGGP
KNADDAIEQE VFLGTLRTQR RINNTVRDVL RALRIGDSAP VAVAPDLLAV RIYRSRSAQG
LIALEAIPVD VEFRAALQIE QYPFESGVAR AVIDFGDWQR RLQPDELAAA CRRRAGRLID
GELAYFNRQT DAAELVRFYA DLRARLERMD ARAFLLPIGW GAGWRSKTLD DRLRQGTDRD
NAFAQIVQRH TLKKHKSAGF RPGDAFPETR KVIMRGALPW RPLGWVEARF DLNGER