Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_1770 |
Symbol | |
ID | 5208727 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 2186121 |
End bp | 2188643 |
Gene Length | 2523 bp |
Protein Length | 840 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640595376 |
Product | phosphoenolpyruvate-protein phosphotransferase |
Protein accession | YP_001276110 |
Protein GI | 148655905 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) [COG1925] Phosphotransferase system, HPr-related proteins [COG3412] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01003] Phosphotransferase System HPr (HPr) Family [TIGR01417] phosphoenolpyruvate-protein phosphotransferase [TIGR02364] dihydroxyacetone kinase, phosphotransfer subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.334796 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0969879 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGAATA TCGTCCTTGT GTCGCATAGT TCCCTGCTGG CTGCCGGGAT CGTGGACATG ATGCGCATGG TGATGCAGCA GTCGCAGGTC TCAATTGCTG TGGCGGCAGG CGCCGACGAC TCATCACAGA CGCTTGGAAC AGATGCGGCA AAGATCCGTG ATGCCATCGA AGACGTGTAC AGCGATGATG GTGTGCTGGT GCTGATGGAT CTGGGCAGCG CAGTGCTGAG TGCGGAGATG GCGCTCGATT TTCTTGCCGA AGAAAAGCGG AATCGTGTAC GATTGTGCGC TGCTCCGCTC GTCGAAGGCG CCATCGCTGC CGCTATTCAA GCCAGTCTGG GTGCGTCGCT TGATCGCGTG GCGGCAGAGG CGGAAGGGGC GCTGTCCGGC AAGATCGAGA GTCTGAGTCA GCGTGCGGGC ACAGATGCTG CCGTACCCCC ACCATCGTCT GCGTCTCCGG TTGCGACCGA TGTGCAGCAG GTGCGACTGG TTGTTGAAAA TCGCCTGGGA CTGCACGCCC GACCGGCTGC GCTGTTTGTC CAGACTGCCG GTCGTTTTCA ATCCGATATT CGTGTTGCCC GTGCTCATGA CTCCCGGCAG GTCAATGCAA AAAGTTTCAA CGCAGTGGCA GCGCTTGGCA TCCGGCAGTA CGACGAGATC GTCGTCTCCG CCACCGGTGC GGATGCTGCT GAAGCGCTGG CGGCATTGCA GCGACTCGCG GCAGAGAAGT TTGGAGAAGC CGATGATGTG GCGGACGAGC CGCAACCGCT GCCATCGCCT CGATCGACAG ATACGCCTCC TGGCGCGCTG CGTGGCATTG CCGCATCACC CGGGTATGCT CTTGGTCAGG CGGTGGTGCT GCGCAACGTC GAACCACAGA TTGAGCGCCT TGCTATCGAC GATCCTGCAG CCGAGATGTC CCGGTTTTCT GTCGCTCTGG AAGCCGTTCG CAGCAAGACA CGCCAGGTGC GTGATCAGAT TGCCCAACAC CATCCCTACG AGGCGGCGAT CTTCGACGCA TACCTGATGT TTCTCAGCGA TCCTGATGTG TTGTCGCGCG TCCAGCAGAT CGTCGAGCGT GAGCGCGTCA ATGTCGAGTG GGCATGGCAA CAGGCAGTGC GCGAGTCGGT ACAGGCGTTC GAATCGCTCG ACAATGATTA CATGCGCGCA CGCGCCGTCG ATATTCGGGA TGTCGGATTG CAGGTGCTGA CACAGTTGCT GGGACATACT GCCGTGACGC ATGTCGATCA GTCCGGGATC GTCGTTGTTG ACGATCTCTC GCCGTCCGAC ACCGCGCGGC TCGATCCGGC AAACGTGTTG GGTATCTGTA CCGAACGCGG AAGTTCGACC TCGCACAGCG CCATTCTGGC GCGCACGCTC GGCATCCCTG CCGTCGTCGG CGTCGGTCCG GCAGTCGCAC AGGTGCGACC AGAGACGCCG CTTATTATTG ATGGCTTCGC CGGTCTGGTC TGGATCGATC CTGATGAGTC GATTACTGCC GATTATGCAG CGAAACTTGC GCAATGGCGT ACCACGTATG AGCGTGCACA GAGATCGAGT GCCGCGCCAT CCGTGACGAA AGACGGGATC GGTATCGAGG TTGCAGCGAA TATCGGCAAT CTCGAAGATG CACGCGCAGC ACTGGCGAAC GGCGCCGACG GGGTTGGATT GCTGCGCACT GAATTTCTCT TTCTTGATCG AGCGACGGCG CCCGATGAGG ATGAGCAGTT CGAGGTGTAT CATGCCATAG CTCGTCTGAT GGATCAGCGG CCGGTTGTCA TCCGCACGCT TGATGTGGGG GGCGATAAGC CGCTGCTGTA TCTTCATATG GCGCGCGAAG AGAATCCATT TCTGGGGCAA CGCGCCATCC GGTTGTGTCT GGAACGTCCC GATTTGTTCA AACCGCAACT GCGCGCTATC CTGCGTGCTG CCGCTGGTCA TCGGATACGA ATCATGGTTC CCATGATCGC CGATATTGGC GAGTGGCGGC GCGCGCGCAG CATCCTGGAC GAAACCATCG CCGAATTGCG GAACCGGGGT GTGCCGATTC CCGATCACGT GGATGTCGGT ATGATGGTTG AAGTGCCGTC TGCGGCTTTG CTGGCGCATA TCTTTGCGCC TGAGGTTGAT TTTTTCAGTA TCGGATCCAA CGATCTGACG CAGTATACCC TTGCCGCCGA GCGGGGCAAT GCATCTGTCG CCTATCTCCA GGATGGATTG CACCCGGCAG TCCTGATCCA GATTCGCCAG GTGGTGCAGA GCGCCGAAGC CGCCGGAAAA TGGGTGAGCG TGTGCGGTGA ACTGGCTGCG GATCGCCAGG CTTTGCCAAT ACTGGTCGGG TTGGGAGTGA AGAAACTCAG TATGTCGCCG GGTTCGATCC CGCAGGCGAA AGAACTCGTG CGACAACTGA CGCTCAGGGA TGTGCAGCAA TGGGCAAACC AGGCGCTTAC CCTGGAGTCG GCAAAAGCGG TTCGCCACTT TATCCGGCGA CAACTGGCGA CGATTGGCGA ATATGAGGGG TGA
|
Protein sequence | MVNIVLVSHS SLLAAGIVDM MRMVMQQSQV SIAVAAGADD SSQTLGTDAA KIRDAIEDVY SDDGVLVLMD LGSAVLSAEM ALDFLAEEKR NRVRLCAAPL VEGAIAAAIQ ASLGASLDRV AAEAEGALSG KIESLSQRAG TDAAVPPPSS ASPVATDVQQ VRLVVENRLG LHARPAALFV QTAGRFQSDI RVARAHDSRQ VNAKSFNAVA ALGIRQYDEI VVSATGADAA EALAALQRLA AEKFGEADDV ADEPQPLPSP RSTDTPPGAL RGIAASPGYA LGQAVVLRNV EPQIERLAID DPAAEMSRFS VALEAVRSKT RQVRDQIAQH HPYEAAIFDA YLMFLSDPDV LSRVQQIVER ERVNVEWAWQ QAVRESVQAF ESLDNDYMRA RAVDIRDVGL QVLTQLLGHT AVTHVDQSGI VVVDDLSPSD TARLDPANVL GICTERGSST SHSAILARTL GIPAVVGVGP AVAQVRPETP LIIDGFAGLV WIDPDESITA DYAAKLAQWR TTYERAQRSS AAPSVTKDGI GIEVAANIGN LEDARAALAN GADGVGLLRT EFLFLDRATA PDEDEQFEVY HAIARLMDQR PVVIRTLDVG GDKPLLYLHM AREENPFLGQ RAIRLCLERP DLFKPQLRAI LRAAAGHRIR IMVPMIADIG EWRRARSILD ETIAELRNRG VPIPDHVDVG MMVEVPSAAL LAHIFAPEVD FFSIGSNDLT QYTLAAERGN ASVAYLQDGL HPAVLIQIRQ VVQSAEAAGK WVSVCGELAA DRQALPILVG LGVKKLSMSP GSIPQAKELV RQLTLRDVQQ WANQALTLES AKAVRHFIRR QLATIGEYEG
|
| |