Gene RoseRS_3304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3304 
Symbol 
ID5210279 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4150357 
End bp4152027 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content66% 
IMG OID640596900 
Productphosphoenolpyruvate-protein phosphotransferase 
Protein accessionYP_001277615 
Protein GI148657410 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATTT ACCTGCGTGG CGCTGGCGGT TCGCCGGGAG TGGCGCTCGG ACGGGCGGTG 
CGTTACCTGC CTGACCATCA CGCCTGGCAC GTGATCGATG CCGACGTCGA TGCGGCGATC
AGACGACTCG TTGCTGCTCA GGCAACGGCT GCCGGTCAAC TTCGGGCGCT GGCAGCGATC
CTGCGCGAAG AGGGACGCCT GGAGGAAGCA CGCATCTTCG ATGCCCATGC GCTGCTCGTC
GAAGATGAGA CCCTGACACA GGATGTGGCG CGGCGTATGC GCGAAGGGCA CGTCAGCCTG
GAACAGGCGC TGACGGTTGC CATCAGCGCG CTGCGCGAAA CCATCGAAGC GATTGATGAC
CCCTACCTGC GCGAACGCTC CAGCGACATC GACAGCGTGC GCCGCGCGAT CCTGACCGCA
CTGCGTGGCG AGGCGCGCCA CATGCACGAT CTCCCGATTG GCGCGATCCT GGTCGCCAGC
GACCTGACCC CCGCCGAGGC GGTCAGCCTG CGCGATGGGC GTGTCGCCGG GTTTGTCACT
GCCGAAGGCG GACCGAACAG TCACACGACA ATCCTGGCGC GCGCCTTCGG CATCCCGGCA
GTCGTCGGTT TGGGCGCCGC AACCCTGGCG ATCCCCGATC ATGCGCCGCT GGTGCTCGAC
GGACACGCTG CCCTGGTGAT CGTCGATCCT GATACGTTTG AGTGGTCGGC GTATGAACGC
CGGGCTTCGG GAACGGTCGC GGCGCGGGTG CAGCGCCATC CCCTGCACGA TCAGCCGGGA
CGCATGGCGA GCGGCGAACT GGTGACGATC TGGGCGAATA TCAGCCATCC GCTGGAAGCG
CGCATCGCGC TTGAACGGGG CGCGGAGGGG ATCGGATTGT TTCGCACCGA GTTTCTGTTC
ATGGGACGAA ACACGCCGCC TGATGAGCAG GAGCAGTACG AAGCGTATCG GACAGTGGTG
GAAACCATGA AAGGGCAGGC GGTCATTATT CGCACGCTGG ACATCGGCGG CGATAAGCGG
GTGGAGTATC TCGAACTGCC GCGTGAACTC AACCCTTCGC TCGGCATTCG CGGGTTGCGC
CTCTCCATGC TGCATCCCGA TCTGTTCCAG ACGCAGATCC GCGCCATGCT CCGGGCGGCG
GTTCACGGCG ACCTGCGCAT CCTGCTCCCT ATGGTCACAA CCCCCGACGA AGTGACGTGG
GCGCGGGCGC AGGTCCGCGC CGCCGCCGAG TCGCTGGCGC GCGATCAGAT CCCGCACCGC
GCCGATGTGC CGGTCGGCGT TATGATCGAA ACGCCGGCTG CGGCGGTGAC CGCCGACCTG
ATCGCGCGTG AGGCGGCATT CTTCAGCATT GGCAGCAACG ATCTGGCGCA GTATACGCTC
GCTGCCGATC GTACCAGCGC CGATGTTTCG ACCCGCTATC CGCAGCACTC CGCTGCGGTG
CTGCGACTGA TCGCGCAGAC TGTCGGCGCT GCGGCGCGCG CCCATCTGCC GGTATGCGTC
TGCGGCGAGA TTGCCGGCGT CCCGGAACTG GCGTCGCTTC TGGTTGGACT TGGCGTGTTT
CAGTTGAGCA TGAATCCGGC AAGCATCCCT GGGGTCAAGG AGCGTCTCAG CGAAACCGCT
CTGGCGGAAG CGCGCGCCGC AGCACGCTCC GTATTGAATA TCTACGTATG A
 
Protein sequence
MAIYLRGAGG SPGVALGRAV RYLPDHHAWH VIDADVDAAI RRLVAAQATA AGQLRALAAI 
LREEGRLEEA RIFDAHALLV EDETLTQDVA RRMREGHVSL EQALTVAISA LRETIEAIDD
PYLRERSSDI DSVRRAILTA LRGEARHMHD LPIGAILVAS DLTPAEAVSL RDGRVAGFVT
AEGGPNSHTT ILARAFGIPA VVGLGAATLA IPDHAPLVLD GHAALVIVDP DTFEWSAYER
RASGTVAARV QRHPLHDQPG RMASGELVTI WANISHPLEA RIALERGAEG IGLFRTEFLF
MGRNTPPDEQ EQYEAYRTVV ETMKGQAVII RTLDIGGDKR VEYLELPREL NPSLGIRGLR
LSMLHPDLFQ TQIRAMLRAA VHGDLRILLP MVTTPDEVTW ARAQVRAAAE SLARDQIPHR
ADVPVGVMIE TPAAAVTADL IAREAAFFSI GSNDLAQYTL AADRTSADVS TRYPQHSAAV
LRLIAQTVGA AARAHLPVCV CGEIAGVPEL ASLLVGLGVF QLSMNPASIP GVKERLSETA
LAEARAAARS VLNIYV