Gene Rcas_0192 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0192 
Symbol 
ID5537653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp227583 
End bp229766 
Gene Length2184 bp 
Protein Length727 aa 
Translation table11 
GC content62% 
IMG OID640892355 
Producthydrolase 
Protein accessionYP_001430343 
Protein GI156740214 
COG category[R] General function prediction only 
COG ID[COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) 
TIGRFAM ID[TIGR02577] CRISPR-associated protein, Crm2 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.928607 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAACG CAGATCGTTT CACCTTTACT CGCGCGATTG CAGCGTGCCT GGTGTATGGA 
ACGGGTATCG ACGCCCGTTC AGTGGAAGAC GTAGTGCGTT CGGCTTTGGA CAACGCGCCG
GTCGCGCCGT CCAAAGCCAT GCTCGAGGAA TTGCACACGA CGGTGACGAC GCGACTGCAC
GACGCCGGTT TGCCGGAGGA AGCGCAGCGA GTGGCGCTGG TCTACGGCGG CGCAACGAAG
ATCAAGGGAT ACGTCTTCGA GTCGCCGGCG TTGCCCGAAA TTCGCGGCGC CAGTGCGCTG
CTCGAGTGGG TGAATGATCA TCATCTTTCT GCGATCTGGC GTGAGACGCT CGGTGAGAGT
CTCGGAGAAT CGTGCATTAT CTATGCTGGC GGGGGTAGTC TTCTCGCCTT CGCGCCAGTT
GCGAAAGGTG CGGAACTCGC CGGTCGCATT GAACAGGCGT TTACCCGCCA GACCCTCACC
GCCAATAGCG TCGCCGTCGC CGCAACCTTT TCGCTCCTCG AACTGCGCTA CGGACGTCGA
CCGCTCGACT TTTGGGCAGA CGCCTTTCTG GATCGATGGC AGAACCTGCA CCTGCGACCG
GAACTCGAAG CGTACTACTA CGCGCCGTTG CCGGGCAGCG CCGCAGCGAA TCTGAGTGAT
GACGCGATCC GTACTGCCGT TCCGGGGCGC GCGCTCACAG TTGATGACAT AAGCGCCGTG
CGTCGCTTTC TGAACCGCAA GCAATTCGGC GAACTGGTGA CGATCCTGGC GACTATGTTC
AACCGGCGCC GCGACGAGCG CGCTTACGCT GGCGCGCCGC GCGTTCTACC GCTCTACCCG
ATGACGCCAT GGGCGGAACG GTGCGCCAGC AGCGATGTGC GCCCCGCCGA ATGGCGCGGG
ACAGTCGCCG ATGAAACACG CGCCTACAGC GAACCTTCGG CGCGCAAACG GTATGTCGGG
CAACTGATCA AGCGCGACGA CGACGATCAG ACGCGCTGGT ACACGTCAAC GTTCGCCTGG
CGCGCACCTG ACGACCTGCG CAATCGCTCG TGGGAAACAC GGTGGGAAGC GTTTCTGGCT
AAGGAAGGCG CCGCTACGCC TTACCGTTGC GCCATGGATC AGCAGCCGAA TGTGACGCCG
CCGCGCGACC TGAATGAGAT CGGCGCCGCT TCCCGACCAG ATCGCTACAT CGGCATCATC
TACGCTGACG GGAACAACGT CGGGCGGTTA ATCGCCACCC TGTCGACCCC CGACGACTTG
CACCAGACTT CTGCGCATCT GAGCACAGCC GCAACCGATG CAGTCTTCAA AGCGCTAGCG
CAGTGTCTGC GACCGGCTGA GGTGCGACGC AAGCGGCAGC GTGCGGTTGT GCATCCCTTC
GAGATTCTGA CTATTGGCGG TGACGATCTG CTCCTCATTG TCCCCGGCAG TCGCGCCTTC
GACGTGGCGC TGGCGATTGC GAGTGAGTTT GAACGCTCCC TGGCGCAGAA CCTCCCTGCG
CCATCCGACG CCTGTGCGTC GAATGCCATT CACACGCGCT ACATCCGCGA AACGCTGGTC
ACGCGCGAGC CGTACACGCC GTCGGTGGGG CTTTCGGCAG GCGTGGTCGT TGCACAGGAG
TCGGCGCCGA TCTTCTTCCT GCGTGATCTG GTGGAGGAAT TGCTCAAACG CGCCAAGAAA
CTGGCGCGCT CCTTGACCGG ACAACGCTAC TACGGCGGCG CTGTCGATTT TATGGTGCTG
AAATCGGTCA CCATGGTCGC CGACACGGTC GAAACGTTCC GCAAAGCGGC GCTGCATGAT
GAGAGCGATC GGCGTCTCAC CGCTCGTCCC TACACCTGGC ACGAGTTTGC CGGTTTGCTG
GAAACCGCCC GCGCACTCAA GCGCAGCCGC TTCCCGCGCT CGCAACTCTA CCGTCTGCGG
CGCGTCATGG AAACAACGCC AGGGGTCATG ACCAGTTCTC TGGAATACCT CTATACCCGT
GTGCGGCAGA AGGACGCCAA CACCATGCTG ATCGAACACA TCGAACAGGC ATGGCGCCAG
GCGGACGCTG CGTTGCGTCG TCCGGCGACG CATCCGTGGC TGTTGCGTGC CGCAGGAGGA
CACGAAACCA TCTGGTCCGA TCTCGCCGAA ATCTACGATA TGGTCTCTTT GCCGGAGGGT
GAGGATGGTC AACGTACAGT TTGA
 
Protein sequence
MMNADRFTFT RAIAACLVYG TGIDARSVED VVRSALDNAP VAPSKAMLEE LHTTVTTRLH 
DAGLPEEAQR VALVYGGATK IKGYVFESPA LPEIRGASAL LEWVNDHHLS AIWRETLGES
LGESCIIYAG GGSLLAFAPV AKGAELAGRI EQAFTRQTLT ANSVAVAATF SLLELRYGRR
PLDFWADAFL DRWQNLHLRP ELEAYYYAPL PGSAAANLSD DAIRTAVPGR ALTVDDISAV
RRFLNRKQFG ELVTILATMF NRRRDERAYA GAPRVLPLYP MTPWAERCAS SDVRPAEWRG
TVADETRAYS EPSARKRYVG QLIKRDDDDQ TRWYTSTFAW RAPDDLRNRS WETRWEAFLA
KEGAATPYRC AMDQQPNVTP PRDLNEIGAA SRPDRYIGII YADGNNVGRL IATLSTPDDL
HQTSAHLSTA ATDAVFKALA QCLRPAEVRR KRQRAVVHPF EILTIGGDDL LLIVPGSRAF
DVALAIASEF ERSLAQNLPA PSDACASNAI HTRYIRETLV TREPYTPSVG LSAGVVVAQE
SAPIFFLRDL VEELLKRAKK LARSLTGQRY YGGAVDFMVL KSVTMVADTV ETFRKAALHD
ESDRRLTARP YTWHEFAGLL ETARALKRSR FPRSQLYRLR RVMETTPGVM TSSLEYLYTR
VRQKDANTML IEHIEQAWRQ ADAALRRPAT HPWLLRAAGG HETIWSDLAE IYDMVSLPEG
EDGQRTV