Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2048 |
Symbol | |
ID | 5539526 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 2624253 |
End bp | 2626769 |
Gene Length | 2517 bp |
Protein Length | 838 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640894182 |
Product | phosphoenolpyruvate-protein phosphotransferase |
Protein accession | YP_001432153 |
Protein GI | 156742024 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) [COG1925] Phosphotransferase system, HPr-related proteins [COG3412] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01003] Phosphotransferase System HPr (HPr) Family [TIGR01417] phosphoenolpyruvate-protein phosphotransferase [TIGR02364] dihydroxyacetone kinase, phosphotransfer subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.488537 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAAGTA TTGTGCTCGT TTCGCACAGT TCGTTGCTGG CGGCCGGCAT CGTCGAAATG GCGCGCATGG TTATGCAACA GGCGCCGGTG GCGATTGCTG TGGCGGCTGG CGCCGATGAT CCAGGACATC CATTGGGGAC CGATGCTGCG AAGATTCGTC AGGCGATCGA AGAGGTGTAT AGCGACGACG GCGTGCTGGT GCTGATGGAT CTGGGCAGCG CGGTGTTGAG CGCAGAAATG GCGGTTGATT TTCTTCCTGA ACACAAGCGC GCCAATGTGA GATTATGCGC TGCGCCGATT GTCGAAGGAA CGATTGCCGC AGTGGTGCAG GCTAGCCTGG GCGCATCGCT GGATCGCGTC GCCGCCGAGG CGCTGGAGGC GCTGGCGGGA AAAGTCGAGA GCCTGAGCGA TCAGGGGCAG GCATCCGGCG CTGCGGCGCC ATCCCCATCG ACCGACGCAA CCGCCGAGGT GCTGCACGCA CAACTGACGG TGACGAACCG CCTGGGGTTG CACGCCCGAC CGGCGGCATT ATTGGTGCAG ACGGCAGGGC GCTTTTGTGC CGATGTTCGT CTCGCGCGCG TTGGGCAGGA AACTCGTCAG GTCAATGCAA AGAGTTTCAA TGCTGTCGCA TCGCTGGGCA TCCGCCAGCA TGAGATGATC ACCGTTTCGG CGCGCGGACC GGACGCCGCC GAGGCGCTTG CGGCATTGCA ACAACTCGCT GCGGATCAGT TCGGCGAAGC CGACGAACTG CTGGAAGCGC AGCCATCAGC GCCACCGTCA CCGATGGCAG AAGCGCTCAC CGGCGCGTTG CGTGGTGCTG CCGCCTCCCC AGGGTATGCC ATCGGTCCGG CAGTGGTGCT GCGTCAGGTT GAACCACAGA TCGAACGGCG CATCATCAGT GATCCAGATA CTGAAATGTC TCGTTTACAG GCGGTTTTGG ACGCAGTTCG CGAATCGACG CGCGTGCTGC GCGACCAGAT TGCGCGGCAG CATCCCTACG AGGCGGCGAT CTTTGATGCG TATCTGATGT TTTTGACCGA TCCCGATATT CTGGCGCGGG TGCGACAGAT TATCGCACGC GACCGCGTTT GCGCCGAATG GGCATGGCGC GAGGCGGTGA ATGAATCTGC CAGGGCATTC GAGTCTATCG AGGATGAATA CATGCGGGCG CGGGCAGTCG ATATTCGCGA CATTGGCAGG CAGGTGTTGA GCCGTCTGAC CGGGCAGACT CGATCATTCA GTCTGGATCG GTCGGGTATC GTGATTGCGT CCGATCTTTC ACCATCCGAT ACGGCGCACC TTGATCGGTC AATGGTGTTG GGCATCTGCA CAGAACGGGG TAGCCCGACC TCGCACAGCG CCATTCTGGC GCGTACCCTC GGCATCCCCG CCGTTGTGGG AGTAGGCGCC GCCATCACGC AGGTTGCTCC TGGTACGCCG CTGGTGATTG ATGGGTATGA GGGGTTGGTC TGGATCGCGC CCGATGAGTC GATTGTTGTG GCATATGCCA ATCGGGAAGC GCAATGGCGG GCAACGCAGG AACAGGCGCG ACAATCGAGC ACCGCACCGG CCGTGACGAA GGACGGCATG CACATCGAGA TTGCCGCCAA TATCGGTAGC CTGGCGGATG CGCGCGTCGC CGTTGAGAAC GGCGCCGATG GCGTGGGACT GCTCCGTACA GAATTCCTCT TTCTTGATCG CACGGCGGCG CCTGATGAAA ACGAGCAGTA TGAGGTGTAC GCTGCGATTG CGCGGGTGAT GGGGGAGCGT CCGGTGGTCG TGCGCACCCT CGATGTGGGA GGTGACAAGC CGCTTGCGTA CATTTCGCTG GAGCGTGAAG ACAACCCGTT TCTTGGCCAA CGCGCCATCC GGCTTTGCCT GAATCAACCA GATCTCTTTG CGACTCAACT GCGCGCTATT TTGCGCGCGA GCGCCGGACA TCGACTCAAG GTGATGTTTC CAATGATTGC GGATATCGGC GAGTTGCGCC GCGCACGTGC GGTCCTGGAG TCGGTGCTTG CCGGGTTGCA CACACAATCT GTGCCGGTGG CGGATGCTGT CGAGGTTGGG ATAATGGTCG AGGTGCCCTC AGCCGCATTG CTCGCACACG TCTTTGCGCC GGAAGTCGAT TTTTTCAGCA TTGGCTCGAA CGATCTGGTG CAGTATACGC TGGCAGCAGA ACGGGGCAAT GCGGCGGTTG CGCATTTGCA GGACGGTCTG CATCCGGCGG TGTTGATGCA AATCCAGCGC GTGGTTCAAA GCGCGCAACA TGCCGGGAAA TGGGTGAGCG TCTGCGGCGA ACTGGCTGCC GATCATGATG CTGTGCCGGT CCTGATCGGA TTAGGCGTGC AGAAACTGAG CATGGCGCCT GGCGCCATTC CGCACATCAA GGCGCTTATT CGGCGACTGA CGCTGCAAGA AGCGCGGCAG TGGGCGAGTC AGGCGCTGGC AATGGAGTCG GCGGAAACAG TGCGTCGGTT TATCCGTTCG CGGTTGGAGG CGCTTGTTGG TGAATAG
|
Protein sequence | MVSIVLVSHS SLLAAGIVEM ARMVMQQAPV AIAVAAGADD PGHPLGTDAA KIRQAIEEVY SDDGVLVLMD LGSAVLSAEM AVDFLPEHKR ANVRLCAAPI VEGTIAAVVQ ASLGASLDRV AAEALEALAG KVESLSDQGQ ASGAAAPSPS TDATAEVLHA QLTVTNRLGL HARPAALLVQ TAGRFCADVR LARVGQETRQ VNAKSFNAVA SLGIRQHEMI TVSARGPDAA EALAALQQLA ADQFGEADEL LEAQPSAPPS PMAEALTGAL RGAAASPGYA IGPAVVLRQV EPQIERRIIS DPDTEMSRLQ AVLDAVREST RVLRDQIARQ HPYEAAIFDA YLMFLTDPDI LARVRQIIAR DRVCAEWAWR EAVNESARAF ESIEDEYMRA RAVDIRDIGR QVLSRLTGQT RSFSLDRSGI VIASDLSPSD TAHLDRSMVL GICTERGSPT SHSAILARTL GIPAVVGVGA AITQVAPGTP LVIDGYEGLV WIAPDESIVV AYANREAQWR ATQEQARQSS TAPAVTKDGM HIEIAANIGS LADARVAVEN GADGVGLLRT EFLFLDRTAA PDENEQYEVY AAIARVMGER PVVVRTLDVG GDKPLAYISL EREDNPFLGQ RAIRLCLNQP DLFATQLRAI LRASAGHRLK VMFPMIADIG ELRRARAVLE SVLAGLHTQS VPVADAVEVG IMVEVPSAAL LAHVFAPEVD FFSIGSNDLV QYTLAAERGN AAVAHLQDGL HPAVLMQIQR VVQSAQHAGK WVSVCGELAA DHDAVPVLIG LGVQKLSMAP GAIPHIKALI RRLTLQEARQ WASQALAMES AETVRRFIRS RLEALVGE
|
| |