Gene Caul_0294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0294 
Symbol 
ID5897568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp327313 
End bp329835 
Gene Length2523 bp 
Protein Length840 aa 
Translation table11 
GC content73% 
IMG OID641560778 
Productphosphoenolpyruvate-protein phosphotransferase 
Protein accessionYP_001681929 
Protein GI167644266 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria)
[COG1925] Phosphotransferase system, HPr-related proteins
[COG2190] Phosphotransferase system IIA components 
TIGRFAM ID[TIGR00830] PTS system, glucose subfamily, IIA component
[TIGR01003] Phosphotransferase System HPr (HPr) Family
[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.640838 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCAATC TCGTACTGTC CTCGCCGCTC AAGGGCTGGG TCGCCCCGCT CGACGAGGCG 
CCCGACGCGG TGTTCGCCGA GCGCATGCTG GGCGACGGCC TGGCCATCGA TCCGCTGGGC
TCCAGCCTTC ATGCGCCGTG CGACGGGCGG GTGATCGCCG TGCACGCCAC GCGTCACGCC
GTCACCCTGC GGGCCGACAA CGGGGCCGAG ATTCTGATGC ATGTCGGGCT GGAGACCGTC
GGCCTGGGCG GCGAGGGCTT CGAGGTCCAC GTCAAGGATG GCCAGGCGGT GAAGGCCGGC
GACAAGCTGA TCAGCTTCGA CCTCGACCTG CTGTCGCAGC GCGCCAAGAG CCTGATCACC
CCGGTGGTGA TCACCAATCC CGACGCCTTC CGGATCGTCC GCCGCGCGCA GGACCATGCC
GCCTCGGTGG GCGATTTCCT GATGGAGCTG GCCCCGGTCG GCGGCGCGGG CGCGGTCCTG
GCGGTCGCCG GGGGCGAGGT GTCGCGCGAG GTCGTCGTGC CTCTGCGGCA TGGCGTGCAC
GCGCGTCCCG CCGCCCGTAT CGCGGAGGTC GCCCGCGAGT TCGCCGCGGA TGTGGCGATC
GTGGCCGTCA ACCGCCGGGC CAGCGCCAAG AGCCCGGTCG GGGTGATGTC CCTGGCCATC
CACCATGGCG ACCGCATCAG CGTCGTGGCC AGCGGCCCGG ACGCCGACTT CGCCGCCGCC
GCCGTCGTCG CCCTGATCGA GAGCGGCATG GGCGAAAGCG CGGTCCTGCC GCTGACCGCC
GCGCCGGCCT CCGTCGTCGA GGCCGTCCGC GCCGTGCCCG CCAGCGAGCC AGGCCTGCTG
CGCGGGGTGA TGGCGGCGCC GGGCCTCGCC ATCGGCCAGG CCGTGCGCTT CGTCTCCGCC
GACATCGTCG TGGCCGAGAC CGGGGCCGGC GCGGGCGCCG AGCGCGCGGC GCTGGAGGAG
GCGCTGATCC ACGTCCGCGC CAAGATCGAG AAGGCCGCCG CTCAAAGCGC CTTGGGGGGC
GACACCGCCC GAAAGGCCAT CCTCGGCGCC CACCTGGCCT TCCTGGAAGA TCCCGAATTG
ACCGCCTCGG CCCACAGGCT GATCGCTGGC GGCAAGAGCG CGGGCTACGC CTGGCGCCGC
GCCGTGGGCG GCTATGTCGA AGCCCTGCGC GGCCTGGGCG ACCGCCGCCT GGCCGAGCGC
GTCGATGACC TGATCGATCT GGAGCGCCAG GTTCTGCGCG CGCTTTCCGG CGAGGAGGAC
GAGGCTCGCG CCCTGCCGCC GGGTTCGATC CTGCTGGCCG ACGAACTGCT GCCCTCGCAG
CTGATGGGCC TGGACGCCGC CCAGGTGGCG GGGCTGTGCA CCGCCAAGGG CGGGCCGACC
TCGCATGTCG CCATCCTGGC CGCCGCCATG GGCATACCCG CCATCGTCGC GGCCGGTCCC
GGCGTGCTGG ACGTCACCGA AGGCGCGGGC CTGATCCTCG ACGCCGAGAA CGGCGCGCTG
CGGGTTGGAC CCGACGCTGG CCAGTTGGCC GCCGCCCAGA CCGCCATCGC CGCTCGCCAC
GAACGCAAGG CGGCCGCCCA AGCCGCCGCC CATCAGGAAA GCCGCACCGC CGATGGGGTC
CGCATCGAGG TGTTCGGCAA TGTCGGCTCG CTGAACGACG CTGTGGCCGC CGCCGCCAAC
GGCGCTGAAG GCTGCGGCCT GCTGCGCACC GAGTTTCTGT TCCTTGAGCG CGAGACCCCG
CCCGACGAGG ACGAGCAGGC CCGCCAGTAC CAGGCCATCG CCTCGGCCCT GGACGGCCGT
CCGCTGATCA TCCGCACCCT CGACGTCGGC GGCGACAAGG CCGCGCCCTA CCTGCCGATC
CCGGCCGAGG AGAACCCGGC CCTGGGGCTG CGCGGCGTTC GCGTGTCGCT GTGGCGGCCG
CATCTGCTCA AGACCCAGCT GCGCGCCATC CTGCGGGTCA AACCGTTGGG CCAATGCAAG
ATCATGGTTC CGATGATCGC CAGCCTCGAC GAGTTGCGCG CCGTCCGCGC GGTCCTCGAA
GACGCCAAGC GCGAGATGGG GATCACCGAC CACGTCGAGC TCGGCGTCAT GATCGAGACC
CCCGCCGCCG CCGTCACCGC CGACCTGCTG GCCGCCGAGG CCGACTTCCT GTCGATCGGC
ACCAACGACC TGACCCAGTA TGTGCTGGCC ATGGACCGGG GCAATCCGGA GCTGGCCGCG
CGCATCGACG CCTTGCACCC GGCGGTGCTG CGGATGATCG CCCAGACCTG CGCCGGCGCG
GCCAAGCACC AGCGTTGGGT CGGCGTCTGC GGAGGCCTGG CCTCCGACCT TGTCGCCGTG
CCGGTGCTGG TCGGGCTGGG CGTCACCGAG CTGTCCGCGA CCGCCGCCGC CGTGCCCGAG
GTCAAGGCCC TGGTCCGGAC CCTGAACGTC CCAGCCTGCC AGGCCCTGGC CCGCCAAGCG
CTGGACCTGA CCTCGCCCGA GGCTGTGCGC CAACTCTGCA AATCCTTCCA GGCGGGGGCC
TGA
 
Protein sequence
MANLVLSSPL KGWVAPLDEA PDAVFAERML GDGLAIDPLG SSLHAPCDGR VIAVHATRHA 
VTLRADNGAE ILMHVGLETV GLGGEGFEVH VKDGQAVKAG DKLISFDLDL LSQRAKSLIT
PVVITNPDAF RIVRRAQDHA ASVGDFLMEL APVGGAGAVL AVAGGEVSRE VVVPLRHGVH
ARPAARIAEV AREFAADVAI VAVNRRASAK SPVGVMSLAI HHGDRISVVA SGPDADFAAA
AVVALIESGM GESAVLPLTA APASVVEAVR AVPASEPGLL RGVMAAPGLA IGQAVRFVSA
DIVVAETGAG AGAERAALEE ALIHVRAKIE KAAAQSALGG DTARKAILGA HLAFLEDPEL
TASAHRLIAG GKSAGYAWRR AVGGYVEALR GLGDRRLAER VDDLIDLERQ VLRALSGEED
EARALPPGSI LLADELLPSQ LMGLDAAQVA GLCTAKGGPT SHVAILAAAM GIPAIVAAGP
GVLDVTEGAG LILDAENGAL RVGPDAGQLA AAQTAIAARH ERKAAAQAAA HQESRTADGV
RIEVFGNVGS LNDAVAAAAN GAEGCGLLRT EFLFLERETP PDEDEQARQY QAIASALDGR
PLIIRTLDVG GDKAAPYLPI PAEENPALGL RGVRVSLWRP HLLKTQLRAI LRVKPLGQCK
IMVPMIASLD ELRAVRAVLE DAKREMGITD HVELGVMIET PAAAVTADLL AAEADFLSIG
TNDLTQYVLA MDRGNPELAA RIDALHPAVL RMIAQTCAGA AKHQRWVGVC GGLASDLVAV
PVLVGLGVTE LSATAAAVPE VKALVRTLNV PACQALARQA LDLTSPEAVR QLCKSFQAGA