Gene RoseRS_3337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3337 
Symbol 
ID5210314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4186392 
End bp4187798 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content62% 
IMG OID640596935 
Producttype I phosphodiesterase/nucleotide pyrophosphatase 
Protein accessionYP_001277648 
Protein GI148657443 
COG category[S] Function unknown 
COG ID[COG3379] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.912336 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACATC CCCGACTACT CATCATCGGT CTCGATTGCG CCGAGCCGTC GCTGGTGTTT 
GATCGCTGGC GCGCCGATCT GCCTGCCCTC AACCGCCTGA TGACAGAGGG AGTCTACGGT
GAACTGGAGA GTTGTATTCC GGCGATCACC GTCCCTGCCT GGAGTTGCAT GATGAGCGGG
CGCGACCCGG GTGAACTTGG CGTCTATGGG TTTCGCAACC GCGTTGATCG CTCCTATGGT
CGCATGGTTG TTGCCGATAG CCGTGCGATC CGGTTTCCGC GTTTGTGGGA TATTCTCGGC
GAGGCGGGAT GGCGCGTGGC AGTGATCGGC GTGCCCGGCG CCTATCCGCC GTCCGCTGTG
AATGGCGCGC TGGTTTCCTG CTTTCTTGCG CCTTCGACCG ATGTGACCTA TACCTTTCCG
CCAGCGCTCG CAGAACGTCT TGCCGTCTGG GCAGCGAAAG CGACGCCAGG GCGCGCCTAT
CTGCTCGATG TGCCCGATTT CCGCTCCGAT GACAAAGAGC GCATTGTGCG CGACATCTAC
GCCATGTGCG ATCAACGTTT CGCGGTTGCC GCAGCATTGA TCGAGGAAGA TCATCCCGAC
TTTCTCATGC TGGTGGACAT GGGGGTCGAT CGCATCCACC ACGCGCTCTG GAAGCATATG
GACCCGCGGC ATCCGTTGTT TGTGCCCGAT TCGCCCTTTG CAGATGCGAT TCATGCGTAC
TATCGCCACG TAGATGCACA GATCGCCGCT TTGCTGACGC ACTGCGGACC TGAGACGGCA
GTGCTGGTGG TGTCTGACCA TGGTGCGCGT CCGCTGATGG GTGGGGTGCG GATCAATCAA
TGGTTGATCG CGCAGGGTGA CCTGACGTTG CACACAATGC CGGACGTGCC AACCAACCTC
GATCAGGTGG ATGTTGACTG GTCGCGCACC CGCGCCTGGG GTGCGGGCGG CTACTACGGG
CGTATCTTTC TCAATGTGCG CGGGCGTGAG CCGCAGGGCG TCATTCCGCC AGCAGAGTAC
GAACGTGTGC GCGCCGACCT TGCGGCGCGT CTGGAAGCGA TGCCCGGTCC AGATGGTCAT
CCGCTCGGAA ACAGGGTCTT TGTGCCACAG CGCCTCTATC GCGTCGTGCG AGGCGTTGCC
CCTGACCTGA TCGTCTACTT CGGCGATCTT GCCTGGCGGG CAGTGGGAAC GGTTGGTGGC
GATGGGATAT TCACCCAGGA AAACGACACC GGTCCCGATG ACGCCAATCA CGCGCAGCAT
GGACTGTTCA TCTGGCGCGA CCCGCAGCGC CCCGGCGGCG GGCGGCGACT CGACAATGCG
CAGATTTACG ATATACTGCC TACCCTGTTG AGACGGTTCA ACATGCCGGT CCCTGCGGGA
CTGCGCGGTA CGATGCTGGA ACTATGA
 
Protein sequence
MTHPRLLIIG LDCAEPSLVF DRWRADLPAL NRLMTEGVYG ELESCIPAIT VPAWSCMMSG 
RDPGELGVYG FRNRVDRSYG RMVVADSRAI RFPRLWDILG EAGWRVAVIG VPGAYPPSAV
NGALVSCFLA PSTDVTYTFP PALAERLAVW AAKATPGRAY LLDVPDFRSD DKERIVRDIY
AMCDQRFAVA AALIEEDHPD FLMLVDMGVD RIHHALWKHM DPRHPLFVPD SPFADAIHAY
YRHVDAQIAA LLTHCGPETA VLVVSDHGAR PLMGGVRINQ WLIAQGDLTL HTMPDVPTNL
DQVDVDWSRT RAWGAGGYYG RIFLNVRGRE PQGVIPPAEY ERVRADLAAR LEAMPGPDGH
PLGNRVFVPQ RLYRVVRGVA PDLIVYFGDL AWRAVGTVGG DGIFTQENDT GPDDANHAQH
GLFIWRDPQR PGGGRRLDNA QIYDILPTLL RRFNMPVPAG LRGTMLEL