Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4844 |
Symbol | |
ID | 5707623 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 5500178 |
End bp | 5501806 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641274240 |
Product | phosphoenolpyruvate-protein phosphotransferase |
Protein accession | YP_001539585 |
Protein GI | 159040332 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) |
TIGRFAM ID | [TIGR01417] phosphoenolpyruvate-protein phosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.487304 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0043079 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCTGAGC TACTGCGTGG CATCGGCGTC AGCCCGGGGA GCGCAGCCGG CCCGGCGTAC CGGATGAGCC CACCACCACC ACCGCCGCCC GAGCCGGCCG CAGTGGTTGA TCCGGACGCC GAGGTCGACC GGGCGGTGGC CGCGCTGAGT ACCGTGGCCG CGGACCTGAC CCGCCGTGCC GAGGGTGCGG CAGCCCGGGC GGCGGCCGAT GTTCTGCGGG CACAGGCGAT GATGGCGCAG GATCCGGAGC TGTCGGCCGC TGTGGTCGCG CAGGTGCGGG CCGGGGCCAG CGCACCGGTC GCCGTCGACC GGGCACTGGC CGTGCACCGG GAGGCGTTCC TGGCCGCAGG GGGCTATCTC GCCGAGCGCG TCACCGATCT GGACGACATT CGGGACCGGG TCGTCGCCGC CTGCCTGGGG CTGCCGCCAC CCGGTATCCC CGATCCGGGC CACCCGTTTG TGCTGATCGC CCGTGACCTC GCGCCGGCGG ACACCGCCGG CCTGGATCCG GAGCAGGTGC TGGCGCTGGT CACCGAGGAC GGTGGGCCGA CCAGCCACAC CGCGATTCTG GCCCGAGCCG CTGGCCTACC GGCTGTGGTC CGGTGCCCCG GTGCGATGGC CGTTGCCGAC GGGGTCGAGG TCACCGTCGA CGGCTCGACC GGGCAGGTCG CGGTGGGTGT GGATCACGAC ACCGTCATCG CCACCCGGGT CTCCGAGCAG CGGCGTCGGC GACGACTCGC CACGACGCGG GGGCCCGGCC GCACCGCCGA CGGCCACCCC GTGGCCCTGT ACGGCAACAT CGGCTCGGCT GAGGATGTGG ACGGTGAGCT GGAAGGCGTC GGCCTGTTCC GGACCGAACT GCTCTACCTG CATCGCACCG ATCCGCCCGG ACGCGACGAG CAGGTAGCCG CCTACGCCGA GGTCTTCGCC GCGCTCCCCG GGCGCAGAGT CATCGTGCGG ACCCTCGACG CCGGTGCCGA CAAGCCCCTG CCCTTCCTCG CGGCCGGTGA GGAACCGAAC CCGGCGCTGG GCGTACGCGG CCTGCGGTTG GCCCGGCGGC GGGCGGATGT GCTCCATCTC CAGCTCGAGG CGATCGCACA GGCGGCCCGG GACACGGCAG CCGAGGTGTG GGTGATGGCG CCGATGGTGG CGACGGTCGC GGAGGCCGCC TGGTTCGCCG CCGCCTGCCG GGACGCCGGC CTTCCCACAG CGGGAGCGAT GGTCGAGGTG CCGGCGGCAG CCCTGCGGGC CCGCTCGTTG CTGTCGGTGG TGGATTTCCT CAGCATCGGC ACCAACGACC TGAGCCAGTA CACCTTCGCC GCCGACCGGC AGTGTGGCGA CCTGGCCGAC CTGCTCGACC CGGCACAGCC CGCGTTGCTC GAACTCATCT CCGGCTGTGC CGCCGCCGGC ATGGCCGCCG GCAAACCGGT CGGTGTCTGT GGCGAGGCCG CGGCCGACCC GAGGATCGCG CCGGTGCTCG TCGGCCTTGG CGTGACCAGC CTGTCCATGG CCCCACGGGC AGTGCCCGAC GTGCGGGAGG CGCTTGCCGC CCACACGCTC GCCGACTGTC GGCAACTCGC CGCCGAGGCG TTGTCCGCGG CCGGCACCGC ACCGCTCACC GTCCCCTGA
|
Protein sequence | MAELLRGIGV SPGSAAGPAY RMSPPPPPPP EPAAVVDPDA EVDRAVAALS TVAADLTRRA EGAAARAAAD VLRAQAMMAQ DPELSAAVVA QVRAGASAPV AVDRALAVHR EAFLAAGGYL AERVTDLDDI RDRVVAACLG LPPPGIPDPG HPFVLIARDL APADTAGLDP EQVLALVTED GGPTSHTAIL ARAAGLPAVV RCPGAMAVAD GVEVTVDGST GQVAVGVDHD TVIATRVSEQ RRRRRLATTR GPGRTADGHP VALYGNIGSA EDVDGELEGV GLFRTELLYL HRTDPPGRDE QVAAYAEVFA ALPGRRVIVR TLDAGADKPL PFLAAGEEPN PALGVRGLRL ARRRADVLHL QLEAIAQAAR DTAAEVWVMA PMVATVAEAA WFAAACRDAG LPTAGAMVEV PAAALRARSL LSVVDFLSIG TNDLSQYTFA ADRQCGDLAD LLDPAQPALL ELISGCAAAG MAAGKPVGVC GEAAADPRIA PVLVGLGVTS LSMAPRAVPD VREALAAHTL ADCRQLAAEA LSAAGTAPLT VP
|
| |