Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_4317 |
Symbol | |
ID | 5211301 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 5419021 |
End bp | 5421012 |
Gene Length | 1992 bp |
Protein Length | 663 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640597902 |
Product | amidohydrolase |
Protein accession | YP_001278606 |
Protein GI | 148658401 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.866715 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGACTG TCGATATTCT GCTTGTTCAT GGCGCGGTGG TGACCATGGA CTCCGCGTGG CGCATCTTTC TCGACGGGGC AGTCGCCGTG CGCGGCAACG AGATTGTCGC CGTCGGTCCT TCCGCCGACC TCACGGCGCG GTTCAGCGCA CGCGAAACCG TCGATTGCCG GGGATGCGCG ATCATCCCCG GTCTGATCAA TGCCCACGCA CACGTGCCGA TGAGCCTGTT GCGCGGTCTG GTCGCCGATC AACAACTCGA TGTCTGGCTC TTCGGGTATA TGTTCCCGGT CGAGAGCCGC TTTGTCGATC CGGAGTTTGT TTTCACCGGT ACGCAACTCT CGTGCGCCGA GATGATCCGC GGCGGGACGA CGACCTTCGT CGATATGTAC TATTTCGAAG AAGAGGTCGC CCGCGCCGCC GACCTTGCCG GTATGCGCGC GATCTGCGGG CAGACGGTGA TGCGCCTGCC CACCCCCGAT GCGGCGTCCT TCGATGAGGG GTTGGAGCGC GCGCGCATGT TCATCGAACA GTGGCACGGG CATGAGCGGA TCATTCCAAC CATTGCGCCC CATGCCCCCT ATACCTGCAC CGATACGATC TACCGTGAAG CGGCTGCACT CTGCCGTCGC TACGGCGTGC CGCTGGTCAC CCACCTCTCG GAAACCGAAC GCGAGGTCGA GGAGAGTCGT CAGGAGCGCG AGGTGACCCC TATTCGCTAC GCCAGACGGG TTGGCGCGTT TGATGGCAAG TGCATCGCGG CGCACTGCGT CCACGCAACC GAAGATGATA TCCGGTTGCT GCGCGAGGGA CACGTCGGGG TCGTCCCCTG CCCATCATCG AACCTGAAAC TTGCCAGCGG CATCGCTCCC ATCCGACGCT TCATCGAAGC CGGTCTGCGC GTGGGTCTGG GCACCGATGG ACCGGCATCG AACGATGATC AGGATATGTT TACCGAGGTT CATCTGGCTG CCCTGCTGCC AAAAGGGGTG AGCGGCGATC CGACGGCAGT GCCGGCACGC GATGCGCTGG CGCTGGCAAC ATCATCTGGC GCGCGGGCGA TCCATCTCGA CCACCTGATC GGGTCGCTCG AAGCAGGGAA ACGCGCCGAC ATCGCAGTTG TCGCGCTGGG GCGGCTCCAT TCCGCGCCGC GCTATCACTA CGCGCCCGAC GCGCTCTATT CACATCTGGT CTACGGCGCG CGCTCGGCGG ATGTGCGCGA CGTTTTGGTG GATGGGCGCT TCCTTCTGCG CAATCAGACG CTGCTGACGA TCGATGAGGA AGATGTGTTG CGCCGCGCGC AGATCATCGC CGACCGGATC GATGTGTTCC TGGCTGCGCG GGAAGACAAC CTGCTCGATA AGATCCTGGC AATCGGCGGC GTTCAGCAAT CCGAGATTTT CGAGGTGCAG GCGAAGGCGC CAATCGACCC GCAAACCGCC GAACGTGTCA TTCAGTCGCT CTACGAACCC GGCATTACGA TTACCAAGGC GAGTGAACGC ACCCAGTACG ATACCTACTT CCTGTGGGAC GACGAAGAGC GTGGGCGTAT CCGCATTCGT GAAGATCACC GTACCGATCC AGGCGCGCGC GCCGAGCCAA AGTACACCAT CACCCTGATG GCGCCCGCCC TGCGTGGCGA ATACCAGTCG GCGATCCTGT TCGGTCGCGC CCGCTACACC GCCCGCGCCG ACCGTACCCT GCGCTTCTAC CGCGAGTACT TCCAGCCAGA TCGGATCGTC GAGATCGAAA AGCGCCGCCG CCGCTGGCGT ATTCAGTACC GCGACGCCGA TTTTGCAGTC AATCTCGACA CCCTGATCGG GCACGCACGC CCCGGACCGT ACCTGGAAAT CAAGAGTCGC ACCTGGAGTC GGAAGGACGC CGAACACAAG GTGGAACTCA TCGGTGAACT GCTGCGACGC TTCGGCGTTC CCGAAGATGC GCTGATCAGG CAGGAGTACG TTGAACTCGA ACTGGCGAGT GTTGAACGGT GA
|
Protein sequence | METVDILLVH GAVVTMDSAW RIFLDGAVAV RGNEIVAVGP SADLTARFSA RETVDCRGCA IIPGLINAHA HVPMSLLRGL VADQQLDVWL FGYMFPVESR FVDPEFVFTG TQLSCAEMIR GGTTTFVDMY YFEEEVARAA DLAGMRAICG QTVMRLPTPD AASFDEGLER ARMFIEQWHG HERIIPTIAP HAPYTCTDTI YREAAALCRR YGVPLVTHLS ETEREVEESR QEREVTPIRY ARRVGAFDGK CIAAHCVHAT EDDIRLLREG HVGVVPCPSS NLKLASGIAP IRRFIEAGLR VGLGTDGPAS NDDQDMFTEV HLAALLPKGV SGDPTAVPAR DALALATSSG ARAIHLDHLI GSLEAGKRAD IAVVALGRLH SAPRYHYAPD ALYSHLVYGA RSADVRDVLV DGRFLLRNQT LLTIDEEDVL RRAQIIADRI DVFLAAREDN LLDKILAIGG VQQSEIFEVQ AKAPIDPQTA ERVIQSLYEP GITITKASER TQYDTYFLWD DEERGRIRIR EDHRTDPGAR AEPKYTITLM APALRGEYQS AILFGRARYT ARADRTLRFY REYFQPDRIV EIEKRRRRWR IQYRDADFAV NLDTLIGHAR PGPYLEIKSR TWSRKDAEHK VELIGELLRR FGVPEDALIR QEYVELELAS VER
|
| |