Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0294 |
Symbol | |
ID | 5897568 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 327313 |
End bp | 329835 |
Gene Length | 2523 bp |
Protein Length | 840 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641560778 |
Product | phosphoenolpyruvate-protein phosphotransferase |
Protein accession | YP_001681929 |
Protein GI | 167644266 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) [COG1925] Phosphotransferase system, HPr-related proteins [COG2190] Phosphotransferase system IIA components |
TIGRFAM ID | [TIGR00830] PTS system, glucose subfamily, IIA component [TIGR01003] Phosphotransferase System HPr (HPr) Family [TIGR01417] phosphoenolpyruvate-protein phosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.640838 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCCAATC TCGTACTGTC CTCGCCGCTC AAGGGCTGGG TCGCCCCGCT CGACGAGGCG CCCGACGCGG TGTTCGCCGA GCGCATGCTG GGCGACGGCC TGGCCATCGA TCCGCTGGGC TCCAGCCTTC ATGCGCCGTG CGACGGGCGG GTGATCGCCG TGCACGCCAC GCGTCACGCC GTCACCCTGC GGGCCGACAA CGGGGCCGAG ATTCTGATGC ATGTCGGGCT GGAGACCGTC GGCCTGGGCG GCGAGGGCTT CGAGGTCCAC GTCAAGGATG GCCAGGCGGT GAAGGCCGGC GACAAGCTGA TCAGCTTCGA CCTCGACCTG CTGTCGCAGC GCGCCAAGAG CCTGATCACC CCGGTGGTGA TCACCAATCC CGACGCCTTC CGGATCGTCC GCCGCGCGCA GGACCATGCC GCCTCGGTGG GCGATTTCCT GATGGAGCTG GCCCCGGTCG GCGGCGCGGG CGCGGTCCTG GCGGTCGCCG GGGGCGAGGT GTCGCGCGAG GTCGTCGTGC CTCTGCGGCA TGGCGTGCAC GCGCGTCCCG CCGCCCGTAT CGCGGAGGTC GCCCGCGAGT TCGCCGCGGA TGTGGCGATC GTGGCCGTCA ACCGCCGGGC CAGCGCCAAG AGCCCGGTCG GGGTGATGTC CCTGGCCATC CACCATGGCG ACCGCATCAG CGTCGTGGCC AGCGGCCCGG ACGCCGACTT CGCCGCCGCC GCCGTCGTCG CCCTGATCGA GAGCGGCATG GGCGAAAGCG CGGTCCTGCC GCTGACCGCC GCGCCGGCCT CCGTCGTCGA GGCCGTCCGC GCCGTGCCCG CCAGCGAGCC AGGCCTGCTG CGCGGGGTGA TGGCGGCGCC GGGCCTCGCC ATCGGCCAGG CCGTGCGCTT CGTCTCCGCC GACATCGTCG TGGCCGAGAC CGGGGCCGGC GCGGGCGCCG AGCGCGCGGC GCTGGAGGAG GCGCTGATCC ACGTCCGCGC CAAGATCGAG AAGGCCGCCG CTCAAAGCGC CTTGGGGGGC GACACCGCCC GAAAGGCCAT CCTCGGCGCC CACCTGGCCT TCCTGGAAGA TCCCGAATTG ACCGCCTCGG CCCACAGGCT GATCGCTGGC GGCAAGAGCG CGGGCTACGC CTGGCGCCGC GCCGTGGGCG GCTATGTCGA AGCCCTGCGC GGCCTGGGCG ACCGCCGCCT GGCCGAGCGC GTCGATGACC TGATCGATCT GGAGCGCCAG GTTCTGCGCG CGCTTTCCGG CGAGGAGGAC GAGGCTCGCG CCCTGCCGCC GGGTTCGATC CTGCTGGCCG ACGAACTGCT GCCCTCGCAG CTGATGGGCC TGGACGCCGC CCAGGTGGCG GGGCTGTGCA CCGCCAAGGG CGGGCCGACC TCGCATGTCG CCATCCTGGC CGCCGCCATG GGCATACCCG CCATCGTCGC GGCCGGTCCC GGCGTGCTGG ACGTCACCGA AGGCGCGGGC CTGATCCTCG ACGCCGAGAA CGGCGCGCTG CGGGTTGGAC CCGACGCTGG CCAGTTGGCC GCCGCCCAGA CCGCCATCGC CGCTCGCCAC GAACGCAAGG CGGCCGCCCA AGCCGCCGCC CATCAGGAAA GCCGCACCGC CGATGGGGTC CGCATCGAGG TGTTCGGCAA TGTCGGCTCG CTGAACGACG CTGTGGCCGC CGCCGCCAAC GGCGCTGAAG GCTGCGGCCT GCTGCGCACC GAGTTTCTGT TCCTTGAGCG CGAGACCCCG CCCGACGAGG ACGAGCAGGC CCGCCAGTAC CAGGCCATCG CCTCGGCCCT GGACGGCCGT CCGCTGATCA TCCGCACCCT CGACGTCGGC GGCGACAAGG CCGCGCCCTA CCTGCCGATC CCGGCCGAGG AGAACCCGGC CCTGGGGCTG CGCGGCGTTC GCGTGTCGCT GTGGCGGCCG CATCTGCTCA AGACCCAGCT GCGCGCCATC CTGCGGGTCA AACCGTTGGG CCAATGCAAG ATCATGGTTC CGATGATCGC CAGCCTCGAC GAGTTGCGCG CCGTCCGCGC GGTCCTCGAA GACGCCAAGC GCGAGATGGG GATCACCGAC CACGTCGAGC TCGGCGTCAT GATCGAGACC CCCGCCGCCG CCGTCACCGC CGACCTGCTG GCCGCCGAGG CCGACTTCCT GTCGATCGGC ACCAACGACC TGACCCAGTA TGTGCTGGCC ATGGACCGGG GCAATCCGGA GCTGGCCGCG CGCATCGACG CCTTGCACCC GGCGGTGCTG CGGATGATCG CCCAGACCTG CGCCGGCGCG GCCAAGCACC AGCGTTGGGT CGGCGTCTGC GGAGGCCTGG CCTCCGACCT TGTCGCCGTG CCGGTGCTGG TCGGGCTGGG CGTCACCGAG CTGTCCGCGA CCGCCGCCGC CGTGCCCGAG GTCAAGGCCC TGGTCCGGAC CCTGAACGTC CCAGCCTGCC AGGCCCTGGC CCGCCAAGCG CTGGACCTGA CCTCGCCCGA GGCTGTGCGC CAACTCTGCA AATCCTTCCA GGCGGGGGCC TGA
|
Protein sequence | MANLVLSSPL KGWVAPLDEA PDAVFAERML GDGLAIDPLG SSLHAPCDGR VIAVHATRHA VTLRADNGAE ILMHVGLETV GLGGEGFEVH VKDGQAVKAG DKLISFDLDL LSQRAKSLIT PVVITNPDAF RIVRRAQDHA ASVGDFLMEL APVGGAGAVL AVAGGEVSRE VVVPLRHGVH ARPAARIAEV AREFAADVAI VAVNRRASAK SPVGVMSLAI HHGDRISVVA SGPDADFAAA AVVALIESGM GESAVLPLTA APASVVEAVR AVPASEPGLL RGVMAAPGLA IGQAVRFVSA DIVVAETGAG AGAERAALEE ALIHVRAKIE KAAAQSALGG DTARKAILGA HLAFLEDPEL TASAHRLIAG GKSAGYAWRR AVGGYVEALR GLGDRRLAER VDDLIDLERQ VLRALSGEED EARALPPGSI LLADELLPSQ LMGLDAAQVA GLCTAKGGPT SHVAILAAAM GIPAIVAAGP GVLDVTEGAG LILDAENGAL RVGPDAGQLA AAQTAIAARH ERKAAAQAAA HQESRTADGV RIEVFGNVGS LNDAVAAAAN GAEGCGLLRT EFLFLERETP PDEDEQARQY QAIASALDGR PLIIRTLDVG GDKAAPYLPI PAEENPALGL RGVRVSLWRP HLLKTQLRAI LRVKPLGQCK IMVPMIASLD ELRAVRAVLE DAKREMGITD HVELGVMIET PAAAVTADLL AAEADFLSIG TNDLTQYVLA MDRGNPELAA RIDALHPAVL RMIAQTCAGA AKHQRWVGVC GGLASDLVAV PVLVGLGVTE LSATAAAVPE VKALVRTLNV PACQALARQA LDLTSPEAVR QLCKSFQAGA
|
| |