Gene Rcas_3042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3042 
Symbol 
ID5540538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3942789 
End bp3943814 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content61% 
IMG OID640895161 
Productinosine/uridine-preferring nucleoside hydrolase 
Protein accessionYP_001433114 
Protein GI156742985 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1957] Inosine-uridine nucleoside N-ribohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACGC GCGTCATTCT CGACACAGAC CCCGGCATCG ATGATTCGCT GGCCATTCTG 
CTGGCAGTCG CCTCCCCCGA AGTCGAGCTT GCAGGCGTTA CCGTGACGAG CGGCAACTGT
CCGCTCGCCG ACGGCGTGCG CAATGCGCGC AATGTGCTGG CGCTCGCCGG TCGTTCCGAT
ATTCCGGTGT GCGGCGGCGT GTCATTGCCG CTCATCCGAC CGCTCTACAC CGCGCCCGAA
ACCCACGGCG AAAGCGGCGT CGGTTTTGCC CGCCCGCCGG AGTCGCCCGC CCCGCTCCAC
AGAGAGAACG GTGTGGACCT GATCATTCGT GAAATCCTGG AGCATCCTGG CGAGGTGACG
CTCGTGGCCG TCGCGCCGCT CACGAATGTT GCTATTGCGG TGCGCAAAGA GCCACGCATC
ATTAATGCCG TGCGTGAGGT CATTATTATG GGAGGCGCGC TGCGCGCCGA TGGCAATACC
ACCTCCCTGG CGGAGTTCAA TTTTTATGTT GATCCGCACG CCGCACATAT CGTTCTTGAA
AGCGGTATGC CGATTACGCT GCTGCCGTGG GACATTACGC AGCACATTAT TCTCACGCAG
GCGGATGTTG ATCGCCTGAA CCGCATCTCG TCGCCAATCA CCCGCTTCAT TGCCGATGCC
ACTCGTTTCT ACATCGAGTT TCATCTGGCA GCCTTCGGAT TTGCGGGGTG CTCAATCAAC
GATCCGGCGG CGCTGGCACT GGCGTTTCTG CCCGATCTGG CGCGCACCGA ACCAATGCAT
GTGGCAGTCG AGTATACGAG CGAACTGACC GCCGGCAAGA GCGTCATCAG TTACATCGGA
CCGGCGACCC GTGAGCCGGA TGCGCACGAC CTGACCGGCT ACGACACCGC CCGATGGCCA
CCGCAGTGGC GACACGCATT CCGCCCGGCG CCAAACGTGC GCGCAGTCGT TGACTTCGAT
ACGCAACGTT TTCTCAATCT CTTTGTCGAA CGCATGGAAC ACCTGGCACA GACCGTATCT
GTATGA
 
Protein sequence
MTTRVILDTD PGIDDSLAIL LAVASPEVEL AGVTVTSGNC PLADGVRNAR NVLALAGRSD 
IPVCGGVSLP LIRPLYTAPE THGESGVGFA RPPESPAPLH RENGVDLIIR EILEHPGEVT
LVAVAPLTNV AIAVRKEPRI INAVREVIIM GGALRADGNT TSLAEFNFYV DPHAAHIVLE
SGMPITLLPW DITQHIILTQ ADVDRLNRIS SPITRFIADA TRFYIEFHLA AFGFAGCSIN
DPAALALAFL PDLARTEPMH VAVEYTSELT AGKSVISYIG PATREPDAHD LTGYDTARWP
PQWRHAFRPA PNVRAVVDFD TQRFLNLFVE RMEHLAQTVS V