Gene RoseRS_0819 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_0819 
Symbol 
ID5207762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp1011750 
End bp1013252 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content61% 
IMG OID640594435 
ProductPpx/GppA phosphatase 
Protein accessionYP_001275183 
Protein GI148654978 
COG category[F] Nucleotide transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0248] Exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAAC GCATCGGCAT TATCGATCTG GGATCGAACA CCACGCGCAT GATCGTGATG 
GGGTACACGC CACATCACTC CTTCCGCCTG CTCGACGAAG TGCGCGAAAG CGTGCGCCTG
GCGGAGGGCA TCGGTCCCGA CGGTCGACTG AAACCGGCAG CGATGGATCG GGCGGTTGCG
ACGATGCGTT TGTTCCATAC GCTGAGTCAC TCGGAGCATG TGCAGACGAT CGTTCCGGTT
GCGACGAGCG CCGTCCGCGA GGCGACCAAT CAGGCGGAAT TCATTGCCCG TCTTGCCGCC
GAAGCGGGTC TGTCGATGCG GGTGCTCAGC GCGCGCGAGG AGGCATACTA CGGTTACCTC
GGCGTCGTCA ATTCGATCGA CCTGCGCGAT GGGTTCGTCA TCGATATTGG CGGCGGTAGC
ACGCAGGTGT CGCAGGTGCG CGGGCGCGGC TTCCTGCGCT GGTTCAGTCA GCCTATCGGT
GCGTTGCGCG CAATGGAACG GTTCGTTCAC TCCGACCCGA TCAGCCCCAA GGATTTTCGC
GCGCTCGAAG CGGGCATCGC CGACTATTTT GCCCCGCTCG ATTGGTTGAG CGTCGATCAG
GGTCCGATGC TGGTCGGCAT TGGCGGCACC ATCCGCACGC TGGCGGAGAT CGATCAAAAA
GTACACACCT ATCCCCTTGA TCGCATTCAC GGATACGTAC TGACACGCGA TCGTCTCGAG
GCGCTGATCG AACGCCTGCG CGGCATGAAC CAGCGCCAGC GTGAGGAAGT TCCCGGTCTG
CGGCGTGATC GCGCCGATCT CATCCTTCCC GGCGCTGTCA TCCTGGCGCA CCTGATGCGA
CGTGGCGGGT TCGAGGCGCT GACCGTCGGT GGTCAGGGGT TGCGCGAAGG GATTTTTTAC
GAGCACTTCC TGGTTGGCGA GCAACCGCCG TTATTCGCCG ATATGCGCAG TTTCAGCGTG
CAGAACCTGG CGCGCATCTA CAACTACGAG GTGTTGCATG CCGCCAAGGT GCGCGATCTG
GCATTGTCGA TGTTCGACCA GCTTCGCCCG CTGCACAATT ATGGCGAGTG GGAACGCACG
CTCCTCGCTT ACGCGGCAAC CCTGCACGAT ATTGGATTGG CGGTCAACTA TTACGACCAT
CATAAACACG GGGCGTATCT GGTGCTGAAC AGCGCGTTGC AGGGGTTCAC CCACCGCGAG
ATTGCGCTGA TCGCGCTCCT GGTTCAGTTC CACCGCAAGG GAGATGTGTC GCTCGGGGCG
TTGCGCGAAC TGCTGCATCC CGACGATGAG CCGCGCGTTT CCCGACTGGC GGCGCTGCTC
CGGATCGCCG AATATCTGGA GCGTCGGAAG TGTCAGGTGG TGCAGGATAT TACCGTCGAG
ATTGGCGATA CGATCCGGAT GACGGCGCGC ACGGTCGGTG ATGCAACCGT GGAGATCTGG
GACGCGAACC GTGGAGCGCG GTTGTTCCGC AAGGCGTATG GGCGTGATGT AGAGATCGTC
TGA
 
Protein sequence
MQKRIGIIDL GSNTTRMIVM GYTPHHSFRL LDEVRESVRL AEGIGPDGRL KPAAMDRAVA 
TMRLFHTLSH SEHVQTIVPV ATSAVREATN QAEFIARLAA EAGLSMRVLS AREEAYYGYL
GVVNSIDLRD GFVIDIGGGS TQVSQVRGRG FLRWFSQPIG ALRAMERFVH SDPISPKDFR
ALEAGIADYF APLDWLSVDQ GPMLVGIGGT IRTLAEIDQK VHTYPLDRIH GYVLTRDRLE
ALIERLRGMN QRQREEVPGL RRDRADLILP GAVILAHLMR RGGFEALTVG GQGLREGIFY
EHFLVGEQPP LFADMRSFSV QNLARIYNYE VLHAAKVRDL ALSMFDQLRP LHNYGEWERT
LLAYAATLHD IGLAVNYYDH HKHGAYLVLN SALQGFTHRE IALIALLVQF HRKGDVSLGA
LRELLHPDDE PRVSRLAALL RIAEYLERRK CQVVQDITVE IGDTIRMTAR TVGDATVEIW
DANRGARLFR KAYGRDVEIV