Gene RoseRS_4157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4157 
Symbol 
ID5211141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5204061 
End bp5206061 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content63% 
IMG OID640597746 
Productphosphate binding protein 
Protein accessionYP_001278451 
Protein GI148658246 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0226] ABC-type phosphate transport system, periplasmic component 
TIGRFAM ID[TIGR02136] phosphate binding protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.16838 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACATCA TCAGGAAACG GTTTTTCGCG TCGCTGCTGA TGCTGGCGCT GATCGCGCCA 
ATCATCGCAG CCTGTGGCGG TCAGACCGCG CAACCGACCG CCGCGCCCGC GCAACCGACC
GCCGCGCCCG CGGAAGCAAC CCCGGCGCCA GGACAACCGA CCGCCGCGCC CGCGCAACCG
ACCGCCGCGC CCACGCAACC GACTGCCGAA CCGGCGGCGG AGATCGATTA CGATGGTCTT
CCCGACGTCG ATCCGGGAGC AGTAACGGGG AATATCGTCA CCGCCGGGTC ATCGACGGTC
TTCCCGCTGA CCCAGCGGAT GGCGGAGCGC TTCAAGGACG AGGGGTACAC TGGCAACATC
ACGATCGACT CGATCGGCAC CGGCGCCGGG TTCGAGCGCT TCTGCAAGGC GGGCGAAACC
GACATCTCCA ACGCCAGCCG CCCCATCAAA GCCGCTGAAG TCGAGAACTG CCGCGCCATC
GGTCGCGAGC CGGTCGAGTT CCGCGTCGGC ACCGACGCCC TGGCAGTGGT CGTCAACCCG
GCCAACACCT TCGTCGACAG CCTGACCAAG GCGCAACTCG CCGATATCTT CTCCGGCAAA
GCCAAGACCT GGAAGGACGT CAACCCCGAC TGGCCCGCCA ATCCGATCAA ACTCTTCAGC
CCCGGCTCCG ACTCCGGCAC CTTCGACTTC TTCGTCGAAG TGGTGATGGA CCCGGCGTTT
GAGAAGAAGG GGAAGGAAGC CATCCTGAAC GCTCCCGGCA TTCAGTTGAG CGAGAACGAC
AACGTGCTGG TGCAGGGCGT GGAAGGCGAC CCCAACGCCA TCGGCTACTT CGGCTACGCC
TACTTCGTTC CCGAGAAGGA TCGCCTCAAA GCGGTGAAGG TCGAAGGCGT CGAGCCTACT
GAGAAGACCG CCGAAACCGG CGAGTATCCG CTGGCGCGAC CGCTCTTCAT CTACTCCGAC
GCGAAGATTA TGAAGGAGAA GCCGCAGGTC GCCGCGTTCA TCAACTTCTA TCTGACCTTC
GTCAACGATG AGGTACTCGA CGTCGGCTAC TTCCCCGCCT CCCAGCAGGC GATCAACGTC
GCCAAAGCAA ACTGGCTGGC AGCGATGGGA ATGGAAGTCA AGATGCCTGA GGTCGATCCG
GGAGCAGTAA CGGGGAATAT CGTCACCGCC GGGTCATCGA CGGTCTTCCC GCTGACCCAG
CGGATGGCGG AGCGCTTCAA GGACGAGGGG TACACTGGCA ACATCACGAT CGACTCGATC
GGCACCGGCG CCGGGTTCGA GCGCTTCTGC AAGGCGGGCG AAACCGACAT CTCCAACGCC
AGCCGCCCCA TCAAAGCCGC TGAAGTCGAG AACTGCCGCG CCATCGGTCG CGAGCCGGTC
GAGTTCCGCG TCGGCACCGA CGCCCTGGCA GTGGTCGTCA ACCCGGCCAA CACCTTCGTC
GACAGCCTGA CCAAGGCGCA ACTCGCCGAT ATCTTCTCCG GCAAAGCCAA GACCTGGAAG
GACGTCAACC CCGACTGGCC CGCCAATCCG ATCAAACTCT TCAGCCCCGG CACCGACTCC
GGCACCTTCG ACTTCTTCGT CGAAGTGGTG ATGGACCCGG CGTTTGAGAA GAAGGGGAAG
GAAGCCATCC TGAACGCTCC CGGCATTCAG TTGAGCGAGA ACGACAACGT GCTGGTGCAG
GGCGTGGAAG GCGACCCCAA CGCCATCGGC TACTTCGGCT ACGCCTACTT CATCGCCGAG
AAGGATCGCC TCAAAGCGGT GAAGGTCGAA GGCGTCGAGC CGAATGACCA GACGGCGGAG
AGCGGGCAGT ATCCGCTGGC GCGCCCGCTC TTCATCTACT CCGACGCCAA GATTATGAAG
GAGAAGCCGC AGGTCGCCGC CTTCATCAAC TTCTATCTGA CCTTCGTCAA CGATGAGGTG
CTCGACGTCG GCTACTTCCC TGCCTCTCAG CAGGCGATCA ACGTCGCCAA ACTGAATTGG
CTGGCAGCGA TGAAGCCATA A
 
Protein sequence
MHIIRKRFFA SLLMLALIAP IIAACGGQTA QPTAAPAQPT AAPAEATPAP GQPTAAPAQP 
TAAPTQPTAE PAAEIDYDGL PDVDPGAVTG NIVTAGSSTV FPLTQRMAER FKDEGYTGNI
TIDSIGTGAG FERFCKAGET DISNASRPIK AAEVENCRAI GREPVEFRVG TDALAVVVNP
ANTFVDSLTK AQLADIFSGK AKTWKDVNPD WPANPIKLFS PGSDSGTFDF FVEVVMDPAF
EKKGKEAILN APGIQLSEND NVLVQGVEGD PNAIGYFGYA YFVPEKDRLK AVKVEGVEPT
EKTAETGEYP LARPLFIYSD AKIMKEKPQV AAFINFYLTF VNDEVLDVGY FPASQQAINV
AKANWLAAMG MEVKMPEVDP GAVTGNIVTA GSSTVFPLTQ RMAERFKDEG YTGNITIDSI
GTGAGFERFC KAGETDISNA SRPIKAAEVE NCRAIGREPV EFRVGTDALA VVVNPANTFV
DSLTKAQLAD IFSGKAKTWK DVNPDWPANP IKLFSPGTDS GTFDFFVEVV MDPAFEKKGK
EAILNAPGIQ LSENDNVLVQ GVEGDPNAIG YFGYAYFIAE KDRLKAVKVE GVEPNDQTAE
SGQYPLARPL FIYSDAKIMK EKPQVAAFIN FYLTFVNDEV LDVGYFPASQ QAINVAKLNW
LAAMKP