Gene Rcas_4181 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4181 
Symbol 
ID5541692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5410647 
End bp5412638 
Gene Length1992 bp 
Protein Length663 aa 
Translation table11 
GC content61% 
IMG OID640896292 
Productamidohydrolase 
Protein accessionYP_001434230 
Protein GI156744101 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.748776 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000471707 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGGAAACCG TTGACCTTCT ACTCATCCAC GGCGCTGTGG TGACCATGGA CGCCGCAGGG 
CGCATTTTTC TCGATGGTGC GGTGGCGGTA CGCGGCAATG AGATCGTTGC GGTCGGTTCG
TCCGAAGACC TGACTGCGCG CTTTACCGCC GGCGAAACGC TCGATTGCCA GGGGTGCGCC
ATTATTCCCG GTCTGATCAA TGCCCACGCG CATGTGCCGA TGAGCCTGTT GCGCGGTCTG
GTTGCCGATC AGCAGCTCGA TGTCTGGCTG TTCGGCTACA TGTTCCCAGT CGAAAGCCGC
TTTGTCGATC CCGAGTTCGT TTTCACCGGC ACACAACTCT CGTGCGCCGA GATGATCCGG
GGCGGGACGA CGACGTTTGT GGATATGTAC TATTTCGAGG AGGAAGTGGC GCGCGCTGCC
GATCTGGCTG GGATGCGCGC GATCTGCGGG CAAACAGTAA TGCGCCTTCC CACTCCCGAT
GCCGCTTCGT TCGATGAGGG GTTGGAGCGG GCGCGCGCGT TCATTGAGCA GTGGCATGGG
CATGAGCGTA TTATTCCGAC TATCGCGCCT CACGCGCCCT ATACCTGCAC CGATACGATC
TATCGCGAGG CCGCAGCGCT CTGCCGCCGC TATGGTGTGC CGCTCGTCAC TCACCTCTCC
GAGACCGAAC GCGAAGTCGA GGAGAGTCGC CGCGAGCGTG AGGTGACGCC GATCCGCTAT
GCCAAACGGG TTGGCGCGTT CGATGGCAAG TGCATTGCCG CGCACTGCGT CCATGCGACG
GAAGACGATA TCCGCTTGCT CAAGGAAGGA CACGTTGGAG TTGTGCCCTG CCCTTCGTCG
AACCTGAAAC TTGCGAGCGG CATTGCGCCT CTCCGCCGCT TTATTGAAGC CAGCCTGCGC
GTCGGATTGG GCACTGATGG TCCTGCCTCC AACGATGATC AGGATATGTT CACCGAAATT
CACCTGGCTG CGCTGTTGCC GAAAGGAGTG AGCGGCGACC CGACGGCGGT TCCGGCGCAC
GATGCGCTGG CGCTGGCTAC ATCGTCCGGG GCGCGCGCCA TTCATCTCGA TCACCTGATC
GGCTCGCTCG AACCGGGAAA ACGCGCCGAT ATTGCGGTCG TCGCGCTGGG ACGGTTGCAT
TCCGCGCCTC GCTATCACTA CGCGCCCGAT GCGCTCTACT CACACCTGGT CTATGGCGCT
CGCTCGGCGG ATGTGCGCGA TGTGCTGGTG GACGGTCGCT TCCTCCTGCG CAACCAGCAA
CTGTTGACCA TCGATGAAGA CGATGTGTTA CGGCGCGCGC AGGCAATCGC CAACCGGATC
GATGTCTTTC TGGCGGCTCG CGAGGATAAC CTGCTCGACA AAATCCTGGC AATCGGCGGC
GTGCAGCAAT CCGAGATTTT CGAGGTTCAG GCGAAGGCGC TCATCGATCC GCAAACCGCC
GAGCGCGTTA TTCAGGCGTT GCACGAATCG GACATCACGA TCACCAAGGC GAGCGAACGC
ACGCAGTACG ACACCTACTT TCTGTGGGAC GATGAGGAAC GTGGGCGCAT CCGTATCCGG
GAAGACCACC GGACTGATCC TGGCGCACGC GTCGAACCAA AGTACACGAT CACCCTTATG
GCTCCCGCGC AGCGGGGAGA GTATCGGATG GCAGTGCTCT TCGGACGGGC GCGCTATACG
GCGCTTGCTG ATCGCACGTT GCGCTTCTAC CGTGAATACT TCCAGCCGGA TCGGATCGTC
GAGATCGAAA AGCGCCGCCG CCGCTGGCGC ATTCAGTACC GCGACGCCGA TTTCGCTGTC
AATCTCGATA CGCTGATCGG GCACGCGCGT CCCGGACCGT ACCTGGAGAT CAAGAGCCGC
ACCTGGAGCC GTAAGGACGC CGAGCATAAG GTCGAACTCA TTGGCGAACT GTTGCGTCGC
TTTGGCGTCT CCGAGGATGC ATTGATCAAA CAGGAATATG TCGAATTCGA AACAGCGGTG
GTCGAGCGCT GA
 
Protein sequence
METVDLLLIH GAVVTMDAAG RIFLDGAVAV RGNEIVAVGS SEDLTARFTA GETLDCQGCA 
IIPGLINAHA HVPMSLLRGL VADQQLDVWL FGYMFPVESR FVDPEFVFTG TQLSCAEMIR
GGTTTFVDMY YFEEEVARAA DLAGMRAICG QTVMRLPTPD AASFDEGLER ARAFIEQWHG
HERIIPTIAP HAPYTCTDTI YREAAALCRR YGVPLVTHLS ETEREVEESR REREVTPIRY
AKRVGAFDGK CIAAHCVHAT EDDIRLLKEG HVGVVPCPSS NLKLASGIAP LRRFIEASLR
VGLGTDGPAS NDDQDMFTEI HLAALLPKGV SGDPTAVPAH DALALATSSG ARAIHLDHLI
GSLEPGKRAD IAVVALGRLH SAPRYHYAPD ALYSHLVYGA RSADVRDVLV DGRFLLRNQQ
LLTIDEDDVL RRAQAIANRI DVFLAAREDN LLDKILAIGG VQQSEIFEVQ AKALIDPQTA
ERVIQALHES DITITKASER TQYDTYFLWD DEERGRIRIR EDHRTDPGAR VEPKYTITLM
APAQRGEYRM AVLFGRARYT ALADRTLRFY REYFQPDRIV EIEKRRRRWR IQYRDADFAV
NLDTLIGHAR PGPYLEIKSR TWSRKDAEHK VELIGELLRR FGVSEDALIK QEYVEFETAV
VER