Gene RoseRS_1770 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1770 
Symbol 
ID5208727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp2186121 
End bp2188643 
Gene Length2523 bp 
Protein Length840 aa 
Translation table11 
GC content59% 
IMG OID640595376 
Productphosphoenolpyruvate-protein phosphotransferase 
Protein accessionYP_001276110 
Protein GI148655905 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria)
[COG1925] Phosphotransferase system, HPr-related proteins
[COG3412] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01003] Phosphotransferase System HPr (HPr) Family
[TIGR01417] phosphoenolpyruvate-protein phosphotransferase
[TIGR02364] dihydroxyacetone kinase, phosphotransfer subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.334796 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0969879 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAATA TCGTCCTTGT GTCGCATAGT TCCCTGCTGG CTGCCGGGAT CGTGGACATG 
ATGCGCATGG TGATGCAGCA GTCGCAGGTC TCAATTGCTG TGGCGGCAGG CGCCGACGAC
TCATCACAGA CGCTTGGAAC AGATGCGGCA AAGATCCGTG ATGCCATCGA AGACGTGTAC
AGCGATGATG GTGTGCTGGT GCTGATGGAT CTGGGCAGCG CAGTGCTGAG TGCGGAGATG
GCGCTCGATT TTCTTGCCGA AGAAAAGCGG AATCGTGTAC GATTGTGCGC TGCTCCGCTC
GTCGAAGGCG CCATCGCTGC CGCTATTCAA GCCAGTCTGG GTGCGTCGCT TGATCGCGTG
GCGGCAGAGG CGGAAGGGGC GCTGTCCGGC AAGATCGAGA GTCTGAGTCA GCGTGCGGGC
ACAGATGCTG CCGTACCCCC ACCATCGTCT GCGTCTCCGG TTGCGACCGA TGTGCAGCAG
GTGCGACTGG TTGTTGAAAA TCGCCTGGGA CTGCACGCCC GACCGGCTGC GCTGTTTGTC
CAGACTGCCG GTCGTTTTCA ATCCGATATT CGTGTTGCCC GTGCTCATGA CTCCCGGCAG
GTCAATGCAA AAAGTTTCAA CGCAGTGGCA GCGCTTGGCA TCCGGCAGTA CGACGAGATC
GTCGTCTCCG CCACCGGTGC GGATGCTGCT GAAGCGCTGG CGGCATTGCA GCGACTCGCG
GCAGAGAAGT TTGGAGAAGC CGATGATGTG GCGGACGAGC CGCAACCGCT GCCATCGCCT
CGATCGACAG ATACGCCTCC TGGCGCGCTG CGTGGCATTG CCGCATCACC CGGGTATGCT
CTTGGTCAGG CGGTGGTGCT GCGCAACGTC GAACCACAGA TTGAGCGCCT TGCTATCGAC
GATCCTGCAG CCGAGATGTC CCGGTTTTCT GTCGCTCTGG AAGCCGTTCG CAGCAAGACA
CGCCAGGTGC GTGATCAGAT TGCCCAACAC CATCCCTACG AGGCGGCGAT CTTCGACGCA
TACCTGATGT TTCTCAGCGA TCCTGATGTG TTGTCGCGCG TCCAGCAGAT CGTCGAGCGT
GAGCGCGTCA ATGTCGAGTG GGCATGGCAA CAGGCAGTGC GCGAGTCGGT ACAGGCGTTC
GAATCGCTCG ACAATGATTA CATGCGCGCA CGCGCCGTCG ATATTCGGGA TGTCGGATTG
CAGGTGCTGA CACAGTTGCT GGGACATACT GCCGTGACGC ATGTCGATCA GTCCGGGATC
GTCGTTGTTG ACGATCTCTC GCCGTCCGAC ACCGCGCGGC TCGATCCGGC AAACGTGTTG
GGTATCTGTA CCGAACGCGG AAGTTCGACC TCGCACAGCG CCATTCTGGC GCGCACGCTC
GGCATCCCTG CCGTCGTCGG CGTCGGTCCG GCAGTCGCAC AGGTGCGACC AGAGACGCCG
CTTATTATTG ATGGCTTCGC CGGTCTGGTC TGGATCGATC CTGATGAGTC GATTACTGCC
GATTATGCAG CGAAACTTGC GCAATGGCGT ACCACGTATG AGCGTGCACA GAGATCGAGT
GCCGCGCCAT CCGTGACGAA AGACGGGATC GGTATCGAGG TTGCAGCGAA TATCGGCAAT
CTCGAAGATG CACGCGCAGC ACTGGCGAAC GGCGCCGACG GGGTTGGATT GCTGCGCACT
GAATTTCTCT TTCTTGATCG AGCGACGGCG CCCGATGAGG ATGAGCAGTT CGAGGTGTAT
CATGCCATAG CTCGTCTGAT GGATCAGCGG CCGGTTGTCA TCCGCACGCT TGATGTGGGG
GGCGATAAGC CGCTGCTGTA TCTTCATATG GCGCGCGAAG AGAATCCATT TCTGGGGCAA
CGCGCCATCC GGTTGTGTCT GGAACGTCCC GATTTGTTCA AACCGCAACT GCGCGCTATC
CTGCGTGCTG CCGCTGGTCA TCGGATACGA ATCATGGTTC CCATGATCGC CGATATTGGC
GAGTGGCGGC GCGCGCGCAG CATCCTGGAC GAAACCATCG CCGAATTGCG GAACCGGGGT
GTGCCGATTC CCGATCACGT GGATGTCGGT ATGATGGTTG AAGTGCCGTC TGCGGCTTTG
CTGGCGCATA TCTTTGCGCC TGAGGTTGAT TTTTTCAGTA TCGGATCCAA CGATCTGACG
CAGTATACCC TTGCCGCCGA GCGGGGCAAT GCATCTGTCG CCTATCTCCA GGATGGATTG
CACCCGGCAG TCCTGATCCA GATTCGCCAG GTGGTGCAGA GCGCCGAAGC CGCCGGAAAA
TGGGTGAGCG TGTGCGGTGA ACTGGCTGCG GATCGCCAGG CTTTGCCAAT ACTGGTCGGG
TTGGGAGTGA AGAAACTCAG TATGTCGCCG GGTTCGATCC CGCAGGCGAA AGAACTCGTG
CGACAACTGA CGCTCAGGGA TGTGCAGCAA TGGGCAAACC AGGCGCTTAC CCTGGAGTCG
GCAAAAGCGG TTCGCCACTT TATCCGGCGA CAACTGGCGA CGATTGGCGA ATATGAGGGG
TGA
 
Protein sequence
MVNIVLVSHS SLLAAGIVDM MRMVMQQSQV SIAVAAGADD SSQTLGTDAA KIRDAIEDVY 
SDDGVLVLMD LGSAVLSAEM ALDFLAEEKR NRVRLCAAPL VEGAIAAAIQ ASLGASLDRV
AAEAEGALSG KIESLSQRAG TDAAVPPPSS ASPVATDVQQ VRLVVENRLG LHARPAALFV
QTAGRFQSDI RVARAHDSRQ VNAKSFNAVA ALGIRQYDEI VVSATGADAA EALAALQRLA
AEKFGEADDV ADEPQPLPSP RSTDTPPGAL RGIAASPGYA LGQAVVLRNV EPQIERLAID
DPAAEMSRFS VALEAVRSKT RQVRDQIAQH HPYEAAIFDA YLMFLSDPDV LSRVQQIVER
ERVNVEWAWQ QAVRESVQAF ESLDNDYMRA RAVDIRDVGL QVLTQLLGHT AVTHVDQSGI
VVVDDLSPSD TARLDPANVL GICTERGSST SHSAILARTL GIPAVVGVGP AVAQVRPETP
LIIDGFAGLV WIDPDESITA DYAAKLAQWR TTYERAQRSS AAPSVTKDGI GIEVAANIGN
LEDARAALAN GADGVGLLRT EFLFLDRATA PDEDEQFEVY HAIARLMDQR PVVIRTLDVG
GDKPLLYLHM AREENPFLGQ RAIRLCLERP DLFKPQLRAI LRAAAGHRIR IMVPMIADIG
EWRRARSILD ETIAELRNRG VPIPDHVDVG MMVEVPSAAL LAHIFAPEVD FFSIGSNDLT
QYTLAAERGN ASVAYLQDGL HPAVLIQIRQ VVQSAEAAGK WVSVCGELAA DRQALPILVG
LGVKKLSMSP GSIPQAKELV RQLTLRDVQQ WANQALTLES AKAVRHFIRR QLATIGEYEG