Gene RoseRS_0465 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_0465 
Symbol 
ID5207401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp591861 
End bp593261 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content48% 
IMG OID640594085 
Producttype I phosphodiesterase/nucleotide pyrophosphatase 
Protein accessionYP_001274840 
Protein GI148654635 
COG category[S] Function unknown 
COG ID[COG3379] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.267369 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000014976 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGCCCGTAC ATCGACGCAT ACTGATCATC GGCCTCGATT GTGCTGAACC CGAACTGGTG 
TTCGACAAAT GGCGCAACGA ACTGTCAACG ATAGATAGTC TCATGCGGCA GGGTGTCTAC
ACACGTCTGG AGAGCTGCAT TCCGGCTATC ACGGTACCCG CCTGGGCCTG TATGATGAGT
AGCCATGATC CGGGGCAACT AGGTATCTAC GGTTTTCGCA ATCGTGCCGA TCACAGTTAT
GCTCGAATGA GCGTAGCAAA CTCTCGTTCG ATCACTGTTC CGCGTCTGTG GGATATACTC
GGCAGGGTGG GGCGCCGTGT TGGTGTGGTT GGCGTTCCTC AGACCTATCC CGTCACATCT
GTTAATGGCG ACCTGGTCAG TTGTTTTCTG ACGCCCAATG CACAAGTGCA GTTTACATAT
CCGCTATCCC TCAAGCACGA TATTGCTCGC TGGATTGAGG GCGAATTCTT GATGGATGTG
CCGAATTTTC GGTCTGAAGA CAAGGAATCC ATCCTCAAGA ATATCTATCG AATGACGGAT
CAGCATTTTA CTGTTTGCAG GAGGTTGTTG GAGCGAAAGC GGTACGATTT GTTTATGACG
GTTGATATGG GAGTAGATAG AATCCACCAT GCCTTTTGGA AACATATGGA CCCTCGACAT
CCCAAGTACA TCCCAGAAAG TTCGTTTGCG TATGCTATTC GCGATTATTA TCGTTTTGTC
GACGAAAAGA TTGCAGAACT ATTAAGACTG GTTGATGACG ATACTATTGT GCTTATTGTC
TCTGATCATG GCGCGAAAGC GATGAAAGGT GGTTTTTGTT TGAACGAATG GCTTATTAAT
GAAGGTTATC TGGTACTGAC TGAATATCCC GATACCCCGA TGTCACTTGA ACAATGTCGC
GTAGACTGGT CGCATACACA TGCATGGGGT GCAGGTGGAT ATTATGGTCG TCTTTTCCTC
AACATTGCGG GCCGTGATCC AGATGGTATT GTTGCACCGG GCGATGCAGA TCGATTGCGT
AGTGAAATTA CGACAAAACT GGAGTCATTG CTCGATCATA ACGGAACGAT AATGGGAACA
CGTGTATTTT GGCCCGAAGC GATCTATCAT GAAGTTCGAG GTTTTGCTCC CGATCTTATT
ATCTATTTTG GTGATCTGAA CTGGCGATCA GTAGGAAGCA TCGGAGGGAA AACATTATAC
ACTTTTGAGA ACGACACTGG TCCTGACGAT GCTAATCATG CACAATATGG CATCTTTATC
CTCTACGACC CGCGGCAAGC GGGTGGTGGA CGTTATATCG ACACCATGAG CATTTACGAT
GTTGCGCCAA CCTTGCTTCA CCTTCTTGAT ATGCAGGCGC CATCCAGTAT GATAGGAAAG
GTGCGGGATC TGTGGGCTTG A
 
Protein sequence
MPVHRRILII GLDCAEPELV FDKWRNELST IDSLMRQGVY TRLESCIPAI TVPAWACMMS 
SHDPGQLGIY GFRNRADHSY ARMSVANSRS ITVPRLWDIL GRVGRRVGVV GVPQTYPVTS
VNGDLVSCFL TPNAQVQFTY PLSLKHDIAR WIEGEFLMDV PNFRSEDKES ILKNIYRMTD
QHFTVCRRLL ERKRYDLFMT VDMGVDRIHH AFWKHMDPRH PKYIPESSFA YAIRDYYRFV
DEKIAELLRL VDDDTIVLIV SDHGAKAMKG GFCLNEWLIN EGYLVLTEYP DTPMSLEQCR
VDWSHTHAWG AGGYYGRLFL NIAGRDPDGI VAPGDADRLR SEITTKLESL LDHNGTIMGT
RVFWPEAIYH EVRGFAPDLI IYFGDLNWRS VGSIGGKTLY TFENDTGPDD ANHAQYGIFI
LYDPRQAGGG RYIDTMSIYD VAPTLLHLLD MQAPSSMIGK VRDLWA