Gene RoseRS_4317 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4317 
Symbol 
ID5211301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5419021 
End bp5421012 
Gene Length1992 bp 
Protein Length663 aa 
Translation table11 
GC content63% 
IMG OID640597902 
Productamidohydrolase 
Protein accessionYP_001278606 
Protein GI148658401 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.866715 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGACTG TCGATATTCT GCTTGTTCAT GGCGCGGTGG TGACCATGGA CTCCGCGTGG 
CGCATCTTTC TCGACGGGGC AGTCGCCGTG CGCGGCAACG AGATTGTCGC CGTCGGTCCT
TCCGCCGACC TCACGGCGCG GTTCAGCGCA CGCGAAACCG TCGATTGCCG GGGATGCGCG
ATCATCCCCG GTCTGATCAA TGCCCACGCA CACGTGCCGA TGAGCCTGTT GCGCGGTCTG
GTCGCCGATC AACAACTCGA TGTCTGGCTC TTCGGGTATA TGTTCCCGGT CGAGAGCCGC
TTTGTCGATC CGGAGTTTGT TTTCACCGGT ACGCAACTCT CGTGCGCCGA GATGATCCGC
GGCGGGACGA CGACCTTCGT CGATATGTAC TATTTCGAAG AAGAGGTCGC CCGCGCCGCC
GACCTTGCCG GTATGCGCGC GATCTGCGGG CAGACGGTGA TGCGCCTGCC CACCCCCGAT
GCGGCGTCCT TCGATGAGGG GTTGGAGCGC GCGCGCATGT TCATCGAACA GTGGCACGGG
CATGAGCGGA TCATTCCAAC CATTGCGCCC CATGCCCCCT ATACCTGCAC CGATACGATC
TACCGTGAAG CGGCTGCACT CTGCCGTCGC TACGGCGTGC CGCTGGTCAC CCACCTCTCG
GAAACCGAAC GCGAGGTCGA GGAGAGTCGT CAGGAGCGCG AGGTGACCCC TATTCGCTAC
GCCAGACGGG TTGGCGCGTT TGATGGCAAG TGCATCGCGG CGCACTGCGT CCACGCAACC
GAAGATGATA TCCGGTTGCT GCGCGAGGGA CACGTCGGGG TCGTCCCCTG CCCATCATCG
AACCTGAAAC TTGCCAGCGG CATCGCTCCC ATCCGACGCT TCATCGAAGC CGGTCTGCGC
GTGGGTCTGG GCACCGATGG ACCGGCATCG AACGATGATC AGGATATGTT TACCGAGGTT
CATCTGGCTG CCCTGCTGCC AAAAGGGGTG AGCGGCGATC CGACGGCAGT GCCGGCACGC
GATGCGCTGG CGCTGGCAAC ATCATCTGGC GCGCGGGCGA TCCATCTCGA CCACCTGATC
GGGTCGCTCG AAGCAGGGAA ACGCGCCGAC ATCGCAGTTG TCGCGCTGGG GCGGCTCCAT
TCCGCGCCGC GCTATCACTA CGCGCCCGAC GCGCTCTATT CACATCTGGT CTACGGCGCG
CGCTCGGCGG ATGTGCGCGA CGTTTTGGTG GATGGGCGCT TCCTTCTGCG CAATCAGACG
CTGCTGACGA TCGATGAGGA AGATGTGTTG CGCCGCGCGC AGATCATCGC CGACCGGATC
GATGTGTTCC TGGCTGCGCG GGAAGACAAC CTGCTCGATA AGATCCTGGC AATCGGCGGC
GTTCAGCAAT CCGAGATTTT CGAGGTGCAG GCGAAGGCGC CAATCGACCC GCAAACCGCC
GAACGTGTCA TTCAGTCGCT CTACGAACCC GGCATTACGA TTACCAAGGC GAGTGAACGC
ACCCAGTACG ATACCTACTT CCTGTGGGAC GACGAAGAGC GTGGGCGTAT CCGCATTCGT
GAAGATCACC GTACCGATCC AGGCGCGCGC GCCGAGCCAA AGTACACCAT CACCCTGATG
GCGCCCGCCC TGCGTGGCGA ATACCAGTCG GCGATCCTGT TCGGTCGCGC CCGCTACACC
GCCCGCGCCG ACCGTACCCT GCGCTTCTAC CGCGAGTACT TCCAGCCAGA TCGGATCGTC
GAGATCGAAA AGCGCCGCCG CCGCTGGCGT ATTCAGTACC GCGACGCCGA TTTTGCAGTC
AATCTCGACA CCCTGATCGG GCACGCACGC CCCGGACCGT ACCTGGAAAT CAAGAGTCGC
ACCTGGAGTC GGAAGGACGC CGAACACAAG GTGGAACTCA TCGGTGAACT GCTGCGACGC
TTCGGCGTTC CCGAAGATGC GCTGATCAGG CAGGAGTACG TTGAACTCGA ACTGGCGAGT
GTTGAACGGT GA
 
Protein sequence
METVDILLVH GAVVTMDSAW RIFLDGAVAV RGNEIVAVGP SADLTARFSA RETVDCRGCA 
IIPGLINAHA HVPMSLLRGL VADQQLDVWL FGYMFPVESR FVDPEFVFTG TQLSCAEMIR
GGTTTFVDMY YFEEEVARAA DLAGMRAICG QTVMRLPTPD AASFDEGLER ARMFIEQWHG
HERIIPTIAP HAPYTCTDTI YREAAALCRR YGVPLVTHLS ETEREVEESR QEREVTPIRY
ARRVGAFDGK CIAAHCVHAT EDDIRLLREG HVGVVPCPSS NLKLASGIAP IRRFIEAGLR
VGLGTDGPAS NDDQDMFTEV HLAALLPKGV SGDPTAVPAR DALALATSSG ARAIHLDHLI
GSLEAGKRAD IAVVALGRLH SAPRYHYAPD ALYSHLVYGA RSADVRDVLV DGRFLLRNQT
LLTIDEEDVL RRAQIIADRI DVFLAAREDN LLDKILAIGG VQQSEIFEVQ AKAPIDPQTA
ERVIQSLYEP GITITKASER TQYDTYFLWD DEERGRIRIR EDHRTDPGAR AEPKYTITLM
APALRGEYQS AILFGRARYT ARADRTLRFY REYFQPDRIV EIEKRRRRWR IQYRDADFAV
NLDTLIGHAR PGPYLEIKSR TWSRKDAEHK VELIGELLRR FGVPEDALIR QEYVELELAS
VER