Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0192 |
Symbol | |
ID | 5537653 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 227583 |
End bp | 229766 |
Gene Length | 2184 bp |
Protein Length | 727 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640892355 |
Product | hydrolase |
Protein accession | YP_001430343 |
Protein GI | 156740214 |
COG category | [R] General function prediction only |
COG ID | [COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) |
TIGRFAM ID | [TIGR02577] CRISPR-associated protein, Crm2 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.928607 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAACG CAGATCGTTT CACCTTTACT CGCGCGATTG CAGCGTGCCT GGTGTATGGA ACGGGTATCG ACGCCCGTTC AGTGGAAGAC GTAGTGCGTT CGGCTTTGGA CAACGCGCCG GTCGCGCCGT CCAAAGCCAT GCTCGAGGAA TTGCACACGA CGGTGACGAC GCGACTGCAC GACGCCGGTT TGCCGGAGGA AGCGCAGCGA GTGGCGCTGG TCTACGGCGG CGCAACGAAG ATCAAGGGAT ACGTCTTCGA GTCGCCGGCG TTGCCCGAAA TTCGCGGCGC CAGTGCGCTG CTCGAGTGGG TGAATGATCA TCATCTTTCT GCGATCTGGC GTGAGACGCT CGGTGAGAGT CTCGGAGAAT CGTGCATTAT CTATGCTGGC GGGGGTAGTC TTCTCGCCTT CGCGCCAGTT GCGAAAGGTG CGGAACTCGC CGGTCGCATT GAACAGGCGT TTACCCGCCA GACCCTCACC GCCAATAGCG TCGCCGTCGC CGCAACCTTT TCGCTCCTCG AACTGCGCTA CGGACGTCGA CCGCTCGACT TTTGGGCAGA CGCCTTTCTG GATCGATGGC AGAACCTGCA CCTGCGACCG GAACTCGAAG CGTACTACTA CGCGCCGTTG CCGGGCAGCG CCGCAGCGAA TCTGAGTGAT GACGCGATCC GTACTGCCGT TCCGGGGCGC GCGCTCACAG TTGATGACAT AAGCGCCGTG CGTCGCTTTC TGAACCGCAA GCAATTCGGC GAACTGGTGA CGATCCTGGC GACTATGTTC AACCGGCGCC GCGACGAGCG CGCTTACGCT GGCGCGCCGC GCGTTCTACC GCTCTACCCG ATGACGCCAT GGGCGGAACG GTGCGCCAGC AGCGATGTGC GCCCCGCCGA ATGGCGCGGG ACAGTCGCCG ATGAAACACG CGCCTACAGC GAACCTTCGG CGCGCAAACG GTATGTCGGG CAACTGATCA AGCGCGACGA CGACGATCAG ACGCGCTGGT ACACGTCAAC GTTCGCCTGG CGCGCACCTG ACGACCTGCG CAATCGCTCG TGGGAAACAC GGTGGGAAGC GTTTCTGGCT AAGGAAGGCG CCGCTACGCC TTACCGTTGC GCCATGGATC AGCAGCCGAA TGTGACGCCG CCGCGCGACC TGAATGAGAT CGGCGCCGCT TCCCGACCAG ATCGCTACAT CGGCATCATC TACGCTGACG GGAACAACGT CGGGCGGTTA ATCGCCACCC TGTCGACCCC CGACGACTTG CACCAGACTT CTGCGCATCT GAGCACAGCC GCAACCGATG CAGTCTTCAA AGCGCTAGCG CAGTGTCTGC GACCGGCTGA GGTGCGACGC AAGCGGCAGC GTGCGGTTGT GCATCCCTTC GAGATTCTGA CTATTGGCGG TGACGATCTG CTCCTCATTG TCCCCGGCAG TCGCGCCTTC GACGTGGCGC TGGCGATTGC GAGTGAGTTT GAACGCTCCC TGGCGCAGAA CCTCCCTGCG CCATCCGACG CCTGTGCGTC GAATGCCATT CACACGCGCT ACATCCGCGA AACGCTGGTC ACGCGCGAGC CGTACACGCC GTCGGTGGGG CTTTCGGCAG GCGTGGTCGT TGCACAGGAG TCGGCGCCGA TCTTCTTCCT GCGTGATCTG GTGGAGGAAT TGCTCAAACG CGCCAAGAAA CTGGCGCGCT CCTTGACCGG ACAACGCTAC TACGGCGGCG CTGTCGATTT TATGGTGCTG AAATCGGTCA CCATGGTCGC CGACACGGTC GAAACGTTCC GCAAAGCGGC GCTGCATGAT GAGAGCGATC GGCGTCTCAC CGCTCGTCCC TACACCTGGC ACGAGTTTGC CGGTTTGCTG GAAACCGCCC GCGCACTCAA GCGCAGCCGC TTCCCGCGCT CGCAACTCTA CCGTCTGCGG CGCGTCATGG AAACAACGCC AGGGGTCATG ACCAGTTCTC TGGAATACCT CTATACCCGT GTGCGGCAGA AGGACGCCAA CACCATGCTG ATCGAACACA TCGAACAGGC ATGGCGCCAG GCGGACGCTG CGTTGCGTCG TCCGGCGACG CATCCGTGGC TGTTGCGTGC CGCAGGAGGA CACGAAACCA TCTGGTCCGA TCTCGCCGAA ATCTACGATA TGGTCTCTTT GCCGGAGGGT GAGGATGGTC AACGTACAGT TTGA
|
Protein sequence | MMNADRFTFT RAIAACLVYG TGIDARSVED VVRSALDNAP VAPSKAMLEE LHTTVTTRLH DAGLPEEAQR VALVYGGATK IKGYVFESPA LPEIRGASAL LEWVNDHHLS AIWRETLGES LGESCIIYAG GGSLLAFAPV AKGAELAGRI EQAFTRQTLT ANSVAVAATF SLLELRYGRR PLDFWADAFL DRWQNLHLRP ELEAYYYAPL PGSAAANLSD DAIRTAVPGR ALTVDDISAV RRFLNRKQFG ELVTILATMF NRRRDERAYA GAPRVLPLYP MTPWAERCAS SDVRPAEWRG TVADETRAYS EPSARKRYVG QLIKRDDDDQ TRWYTSTFAW RAPDDLRNRS WETRWEAFLA KEGAATPYRC AMDQQPNVTP PRDLNEIGAA SRPDRYIGII YADGNNVGRL IATLSTPDDL HQTSAHLSTA ATDAVFKALA QCLRPAEVRR KRQRAVVHPF EILTIGGDDL LLIVPGSRAF DVALAIASEF ERSLAQNLPA PSDACASNAI HTRYIRETLV TREPYTPSVG LSAGVVVAQE SAPIFFLRDL VEELLKRAKK LARSLTGQRY YGGAVDFMVL KSVTMVADTV ETFRKAALHD ESDRRLTARP YTWHEFAGLL ETARALKRSR FPRSQLYRLR RVMETTPGVM TSSLEYLYTR VRQKDANTML IEHIEQAWRQ ADAALRRPAT HPWLLRAAGG HETIWSDLAE IYDMVSLPEG EDGQRTV
|
| |