Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_2648 |
Symbol | |
ID | 4028430 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 2960801 |
End bp | 2963677 |
Gene Length | 2877 bp |
Protein Length | 958 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637967856 |
Product | phosphoenolpyruvate--protein phosphotransferase |
Protein accession | YP_574694 |
Protein GI | 92114766 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) [COG1925] Phosphotransferase system, HPr-related proteins [COG4668] Mannitol/fructose-specific phosphotransferase system, IIA domain |
TIGRFAM ID | [TIGR01003] Phosphotransferase System HPr (HPr) Family [TIGR01417] phosphoenolpyruvate-protein phosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.530109 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGACAC TGACAAGTGA CGACGTTCTG CTCGACCGGC ACGCCGACGA CTGGCGCGAT GCCCTGGACC AGGCCGCCGA GGCGCTGGTC GAGGCCGGAC GCGTGGCGCC CGGTTACCGT GACGGCCTGC ACGCGCGCGA GGCGCAGTCG TCGACCTATC TCGGCAATGG CATTGCCATT CCCCACGGCA CCCCCGAGAG CCGCGAACAC GTCAAGACGA CCGGCGTACG GGTGTTGCAG TTTCCGCGTG GCATCGAGTG GCACGACGGT CAGCGGGTGA CGCTGCTGGT GACCATCGCC GCCCAGAGCG ATGAGCACCT CGATATCCTG CGTCAGCTCA CGCACGTGCT CGACCGCGAT GGCGTGGCCG AGCGCCTGGC GGCGGCGGAT AGCCGTGAAG AGGTCATTAC GCTGCTGTCC AGGGCGCCGG TCGAGGCCCG CCTGGATGCC GACACGCTCT GCCTGGGATT CCCCGCGCGT GATCGTTTCG AGCTGGCACT GGCCGCCGCG GCGCGCTTGC GCCAGGTGGG CAGCGTGGAC AGCAGCTTCG TGGCCGAGAT CAATACGCTG GAGCCGGTGG CGCTGGGCCA GGGGCTGTGG CTGGTGAGCA GTGCCCGTGG CGTGGTCACG CCGGCGCTGG GCGTGGCCAC GCCCGAGCAG GCATTCGTGG GCCCCAAGGG GCCGGTCAAT GCGGTGTTCT GCCTGGCGGC CCAGGGCGAT GCCCATCGTG GGCTGCTCGA ACGCCTGGGC ATCCTGCTCG ACGCGGGGGA CGGCGAAAGC CTGGCCGACG CCGACGCGGC GACATTGCTG GCGCGCCTGT CGGGGGAGTC CGCCGACGCC GATACCGCAC GCGCCACGGT GCTCAACGCA CATGGGCTGC ATGCGCGGCC GGCCAAGCAG TTGGTGCAGG TCGCTCGGCG TCAGCGCGTG CCGGTCAAGG TGCGCCTGCT GGAGGGCTCG GGTCAGGCGG TGTCGGCGGC GAGTCTCACC AAGGTCATCG GTCTGGGCGC GCGGCGCGGT CAAGTGCTGG TGTTTTCCGC CGAGGGCGAG GGGGCCGGGG AGGCGCTCGA CGCGCTCGTC GCGGCCGTCA AGGATGGGCT CGGCGAGCAT GTCACGCCAC TCCAGGAAAC GCCGCTCGAG TCGCCCGCCG CCGAGGACGA GCCTCTTCCC GCGCCCCTGG AAGACGATGT CGCGCATCCC GCGGTCCCCG CGTCGCCGGG GCTGGCGATC GCGCCGGTTT TCGTGATGCG TACGCCGCGT TTCGAGTACC CCGAGCGCGC CAGCGACCTG GATGACGCGC GCCGCGGCGA CAGCGACGCG CAACTGGCGC GTCTCGATGC CGCGATCGAG GAGGCCGCCG AGCAGTTGCG TGCCCTGGTA CGTACCGCGC AGGGAGGCGA GGTCGCCGAG ATCCTGTCGA TGCACGAGGA AATGCTCGAC GATCCCGAGC TGCGTGAAGC GGCGCACGAA TCGCTGAATA CCGGCATGAG TGCCGAGGCG GCATGGTGGT CGGCCATCGA TACCGCCGCC CGCGCCCAGG AGAATCTCGC CGATCGCTTG CTCGCCGAAC GTGCCGCCGA CCTGCGCGAT GTCGGCCGTC GCGTGCTGGG AATCCTGTGC GGCGTGCGCC TGCCCACGCC GCCGGGGACG CCCTACGTGC TGGTCACCGA CGATGTCGGC CCGTCCGATG TGGCGCGGCT GGATACCAGC AAGGTGCGGG GGCTGGTCAC CGCCCGCGGC GGCGCCACCT CGCACAGCGC GATTCTTGCC CGGGCACTGG GCATCCCCGC CGTGGTGGGG GCCGGCGAGC GGGTGCTGAC GCTGGTCAAC GATACCGAGA TCATCGTCGA TGGCGAGCGT GGCCGGGTGA TCCCGGCGCC TTCCGCCGAG CGTCGCTCGC GTACCGAGCT GCGCTTGAAG GAGCACGAGA TGCGCGAGCG CGAGGCGTAT GCCGCGCGCC ACGAGGAGGG CCGGACCCAG GATGGCCACC GTGTCGAGGT CGCCGCCAAC CTGGGCAACA CGGCCCATGC GGCGGATGCC GTCGAACGCG GCGCCGAGGG CGTGGGGCTG CTGCGTACCG AATTCGTGTT CATGGCGCAT CCCGACGCCC CGGATCTGGA CACCCAGATC GCCGAATATC GCCAGGCCAT CGACGCGCTC GACGGGCGCC CGCTGGTGGC ACGCACGCTG GACGTCGGCG GCGACAAGCC CCTGCCGTAC TGGCCACTGC CCCAGGAAGA CAATCCGTTC CTCGGCTTGC GCGGCATTCG TCTGGCCTTG ACGCGTCCCG AGGTGCTCGA GACGCAGCTG CGCGCCCTGC TGACCGCCGC CGGCGATCGG CCGCTGCGCA TCATGTTCCC GATGGTCAAG GACATCGCCG AGTTCCGCGC CGCTCGCGAG ATCTTCGACC GCGTGCAGGC CGAGGTTCAG GCCGGCGACG TGCAATTGGG GGTGATGATC GAGATTCCTT CGTGCGCGTT GCTGGCGCCG AGCCTGGCCG CCGAGGTGGA TTTCTTCTCC ATCGGCACCA ACGACCTGAC CCAGTACACC CTGGCCATCG ACCGCGGCCA TGCCGAGCTG TCGGCCCAGG CCGACGGTTT GCACCCGGCG GTGCTGGCGC TGATCCGCAT GACCGTCGAT GCGGCGCACG CCCAGGGCAA ATGGGTGGGC GTGTGCGGCG AACTGGCCAG CGATGCCCAG GCGGTGGCCG TCCTGGTGGG GCTGGGCGTC GACGAGCTGT CGGTGTCGGC AAGGCAGGTG CCCATGGTCA AGGCGCGGCT GCGCGAGTTG ACCCTCGCCA CCGCGCGCCA GCACGCCGAG ACCGCCTTGC GTCAGGCGAC CAGCGACGCC GTGCGCGAAG CGCTGGAGGC CCTCTGA
|
Protein sequence | MLTLTSDDVL LDRHADDWRD ALDQAAEALV EAGRVAPGYR DGLHAREAQS STYLGNGIAI PHGTPESREH VKTTGVRVLQ FPRGIEWHDG QRVTLLVTIA AQSDEHLDIL RQLTHVLDRD GVAERLAAAD SREEVITLLS RAPVEARLDA DTLCLGFPAR DRFELALAAA ARLRQVGSVD SSFVAEINTL EPVALGQGLW LVSSARGVVT PALGVATPEQ AFVGPKGPVN AVFCLAAQGD AHRGLLERLG ILLDAGDGES LADADAATLL ARLSGESADA DTARATVLNA HGLHARPAKQ LVQVARRQRV PVKVRLLEGS GQAVSAASLT KVIGLGARRG QVLVFSAEGE GAGEALDALV AAVKDGLGEH VTPLQETPLE SPAAEDEPLP APLEDDVAHP AVPASPGLAI APVFVMRTPR FEYPERASDL DDARRGDSDA QLARLDAAIE EAAEQLRALV RTAQGGEVAE ILSMHEEMLD DPELREAAHE SLNTGMSAEA AWWSAIDTAA RAQENLADRL LAERAADLRD VGRRVLGILC GVRLPTPPGT PYVLVTDDVG PSDVARLDTS KVRGLVTARG GATSHSAILA RALGIPAVVG AGERVLTLVN DTEIIVDGER GRVIPAPSAE RRSRTELRLK EHEMREREAY AARHEEGRTQ DGHRVEVAAN LGNTAHAADA VERGAEGVGL LRTEFVFMAH PDAPDLDTQI AEYRQAIDAL DGRPLVARTL DVGGDKPLPY WPLPQEDNPF LGLRGIRLAL TRPEVLETQL RALLTAAGDR PLRIMFPMVK DIAEFRAARE IFDRVQAEVQ AGDVQLGVMI EIPSCALLAP SLAAEVDFFS IGTNDLTQYT LAIDRGHAEL SAQADGLHPA VLALIRMTVD AAHAQGKWVG VCGELASDAQ AVAVLVGLGV DELSVSARQV PMVKARLREL TLATARQHAE TALRQATSDA VREALEAL
|
| |