Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2413 |
Symbol | |
ID | 3916732 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 2585441 |
End bp | 2587951 |
Gene Length | 2511 bp |
Protein Length | 836 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640445168 |
Product | phosphoenolpyruvate--protein phosphotransferase |
Protein accession | YP_497683 |
Protein GI | 87200426 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) [COG1925] Phosphotransferase system, HPr-related proteins [COG2190] Phosphotransferase system IIA components |
TIGRFAM ID | [TIGR00830] PTS system, glucose subfamily, IIA component [TIGR01003] Phosphotransferase System HPr (HPr) Family [TIGR01417] phosphoenolpyruvate-protein phosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGGGT CGATTGTTCT CTCTGCGCCG ATTGCAGGTT GGGTCTCCGC GCTCGACGAG GTCCCTGACG GCGTCTTCTC CGCCCGCCTT CTGGGCGATG GCGTGGCCAT CGATCCGGTC GAAGGCCTGC TCTGCGCGCC TTGCGATGGC GAAATCCTTT CCGTCCACGC CGCGCGCCAT GCCGTCACCA TGGCAGGGGA GGGCGGCGTC GAACTGCTCA TGCACCTCGG CATCGACACG GTCGAGTTGA AGGGCGATGG CTTCGAGACG CTGGTCCGGG CAGGCCATCG CGTCGTGCGA GGGCAGCCTC TGCTGCGCTT CTCGCTCGAC GACCTTGCCG GCCGCGCGCC CTCGCTCGTC TCGCCGGTCA TCGTCACCAA TGGCGACCGT TTCACGATTT CCTCCTGCAC TGTCGACGCG CTCGTTGCCG TCGGAGACGA CCTGATCCGG CTCGATCCCG TCGGAGTCGC CCCTTCGCCG GATCGCGCGC CGGTGGGCGA GGCGGTCGGG GCCATTGTCG TCGTGCCCCT GCCCAACGGC CTTCACGCCC GGCCCGCCGC GCGCCTGGGC GAAGCGGCGC GCGCCTTCGC CGCGGAAACG CGCATCGTGA AGAACGGCCG GGCGGTTTCC ACCCGCTCGC CGGTCGGCCT GCTCGGTCTG TCCATCCAGC TGGGCGACGA AATTCTCGTC GAGGCCAGCG GACCGGACGC CTCACAGGCA GTGGCTGCGC TGGTCGACCT CATCCGCAGT GGGCTGGGCG AGGGCGCTGA ACACGATGCC GCCGCAGAGC CGCGCGCCTC GTCTGCCGGC ATGCCGCCCT TCGCTCCGGT GCCGGATGGC GCGCTGCGCG GCAATCTTGC CTCGCCCGGC TTCGCGCTCG GCACCTGCCA CCGCCTCGAC CGCTCCGAAC CGGAACTGCC GGTGCAGGGA AAGGGCAAGG AAGAGGAGCG CATTCGCCTG CTCTCGGCGA TGGACCATCT GCGGTCGCGG CTCGCGATCT CGTCCGAGGT CAGCCCTAAG GCAGCGATAT GCAGCGCTCA CCGCGCGCTG CTGGACGATC CCGAACTCGA AGCCGCGGCC CTTTCGCGTA TCGCCGAGGG CGAAGACGCC GCCCATGCCT GGCGCGAGGC TTGCCGCGCT TCGGCGGTGG TTCTGCGCGG CAGCGGTTCG GCCCGCTTTG CCGAGCGCGC GGACGACGTT CTCGACCTCG GGAGGCAGCT CGTCACCATC CTCATCGGCG AACTCGACGA GACGTTGGCG TTTCCTCCGG GGACGATCCT CCTGGCAGAC GAACTGCTGC CTTCCCAGAT CATGCAGCTC GGCGCCGAGG TTACCGGCAT CGCGCTCGCG AACGGTGGCC CGACCTCGCA TGTCGCGATC CTTGCGGCCA GCATGGGCAT TCCAATGCTT GTCGCCATCG GCGATGCGCT CGCAGGCGTT CGCGCGGGCG ACAAGGCCAT TCTCGATGCC GACGGCGGCT TTCTCCTCCC GGCGCCCGGC GCTGCCGCGC TTGCCGAGGC AGAAGCGGAA GTTGCCGCGC GTCAGGCGAG GCGCGCCGAT GCCCTCGCCA GCGCGAGCGA GCTTTGCCAC AGCCGCGACG GCGCGCGCAT CGAAGTCTTC GCCAATCTCG GCTCGCTCGA CGATGCCCGT CGGGCAGTCT CCGCCGGGGC CGAGGGCTGC GGATTGCTGC GCACGGAATT CCTGTTTCTC GAAAGGGAAA GCGCGCCGAC CGTCGCCCAG CAGGCCGAAC TCTACGCCGG GATTGCCGAT GCGCTGGGCG GTCGCCCCCT GATCGTCCGC CTGCTCGACA TCGGCGGTGA CAAGCCCGCA ACCTATCTGC CCATCGCGCC AGAGGCCAAT CCGGCACTCG GTTTGCGCGG CATCCGGGTG GGACTTGCCC ATCCCGATGT GCTGGAAGAC CAGTTGCGCG CCATTCTTTC GGTCGAACGC GGAGGGGCGC TGCGGATCAT GGTGCCGATG GTCACCGGCG TCGCCGAGGT CCGCGAGGTG CGCCAGCGGG TCGACCGGAT CAGCGCCGAA CTTGGCCTCG CGCAGCGCGT CGAAGTCGGC ATCATGGTCG AAACGCCTGC CGCCGCCGCC ACCGCGTTCC TGCTCGCGCC GCACGCCGAT TTCATGTCCA TCGGCACGAA CGACCTGACC CAGTACGTCC TCGCGATGGA CCGGGACAAT CCGGCCGTGG CGGGTGGGAT CGACGGGCTC CACCCGGCCG TGCTCAACCT GATCGCGCAG ACCGTGCGCG GCGCGCAATC GGTCGGTCGC TGGACCGGAG TTTGCGGCGG ACTTGCGGCA GATCGCCTTG CGGTGCCGCT GCTCCTCGGC CTCGGCGTGA CCGAGCTGTC GGTGCCGCTG CGTCAATTGC CCGAAATCAA GGCGCTCGTG CGCGGGCTGT CCATCTCGGA CTGCAAGACT CTCGCCCTCG AGGCGCTTCA GCTTGAATCC GCTGCCGAAA TCCGTGCCCT GTCGCGGGCA TTCGTGGAGA CACTGCCATG A
|
Protein sequence | MMGSIVLSAP IAGWVSALDE VPDGVFSARL LGDGVAIDPV EGLLCAPCDG EILSVHAARH AVTMAGEGGV ELLMHLGIDT VELKGDGFET LVRAGHRVVR GQPLLRFSLD DLAGRAPSLV SPVIVTNGDR FTISSCTVDA LVAVGDDLIR LDPVGVAPSP DRAPVGEAVG AIVVVPLPNG LHARPAARLG EAARAFAAET RIVKNGRAVS TRSPVGLLGL SIQLGDEILV EASGPDASQA VAALVDLIRS GLGEGAEHDA AAEPRASSAG MPPFAPVPDG ALRGNLASPG FALGTCHRLD RSEPELPVQG KGKEEERIRL LSAMDHLRSR LAISSEVSPK AAICSAHRAL LDDPELEAAA LSRIAEGEDA AHAWREACRA SAVVLRGSGS ARFAERADDV LDLGRQLVTI LIGELDETLA FPPGTILLAD ELLPSQIMQL GAEVTGIALA NGGPTSHVAI LAASMGIPML VAIGDALAGV RAGDKAILDA DGGFLLPAPG AAALAEAEAE VAARQARRAD ALASASELCH SRDGARIEVF ANLGSLDDAR RAVSAGAEGC GLLRTEFLFL ERESAPTVAQ QAELYAGIAD ALGGRPLIVR LLDIGGDKPA TYLPIAPEAN PALGLRGIRV GLAHPDVLED QLRAILSVER GGALRIMVPM VTGVAEVREV RQRVDRISAE LGLAQRVEVG IMVETPAAAA TAFLLAPHAD FMSIGTNDLT QYVLAMDRDN PAVAGGIDGL HPAVLNLIAQ TVRGAQSVGR WTGVCGGLAA DRLAVPLLLG LGVTELSVPL RQLPEIKALV RGLSISDCKT LALEALQLES AAEIRALSRA FVETLP
|
| |