Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_4006 |
Symbol | |
ID | 4447269 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 4522849 |
End bp | 4524534 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639691837 |
Product | phosphoenolpyruvate--protein phosphotransferase |
Protein accession | YP_833481 |
Protein GI | 116672548 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) |
TIGRFAM ID | [TIGR01417] phosphoenolpyruvate-protein phosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCAGAACT TCCAAGGAGT AGGCGTCAGC CCCGGCCGGA TCATCGGCAC CATCCGCCAA ATGCCCAAGC CCATCAGTGA GCCGCCGGCA GGCGAACAGC TGCCCGCCGG CGTCACGGCC GAGGACGCGA CAGCCGCCCT CAAGTCGGCG TCCCAGGCTG TGCACGACGA ACTGAAGGCC CGTGCCGCCC ACGCGACGGG CGACGGCAAG GCTGTCCTGG AAGCCACGGC ACTGATGGCC AAGGACACCA TGCTGATCAA GGGCGCGGCC AAGCTCGTTG CCCGAGGCGT CTCGGGTGAG CGCGCCATCT GGGAGTCCGG CTCCTCCGTC TCGGAAATGC TTCACAACCT GGGCGGCTAC ATGGCCGAGC GCGCCACGGA CGTCCTGGAC GTGCGCGCAC GGATCGTCGC CGAGCTGCGG GGCGTGCCCG CCCCCGGCAT CCCCGCCTCC AACACGCCGT TCGTCCTGGT GGCCGAGGAC CTGGCCCCCG CCGACACCGC CACCCTGGAC CCGAACAAGG TCCTGGCGCT CGTCACGGCA GGCGGCGGCC CCCAGTCCCA CACTGCCATC ATCGCCCGCT CCCTCGGCCT TCCTGCCGTC GTCGCCGCAG TAGGCGTGGA CGAGCTCCCG GACGGCATGG AGGTCTATGT GGACGGCGCC GCCGGCACCG TCACGTCCGA GCCCGACCAA TCCTTGCGTG CCGCAGCGGA CGCATGGGCG GCCACAGCTT CCCTGCTGGC CGAGTTCAGC GGCACGGGCG CGACGGCGGA CGGCCACCTG GTGCCTCTCC TCGCCAACGT AGGCGGCGGC AAGGATGCCG AGGCGGCCGC GAAGCTCGGC GCCCAGGGGG TGGGACTGTT CCGTACCGAA TTCTGCTTCC TGGAACGGGA CACCGAGCCC ACCGTGGAGG AACAGGCAGC AGCCTACAAG AGCGTCTTTG ATGCCTTCCC GGGCAAGAAG GTGGTGCTGC GGACGCTCGA TGCCGGCGCC GACAAGCCGC TCCCCTTCCT GACTGACTCC ACCGAGCCCA ACCCCGCACT GGGCGTCCGC GGCTACCGCA CGGACTTCAC CACGCCGGGC GTGCTGGACC GCCAGCTGGA AGCAATCGCC CTGGCCGAGA AGCAGTCCGA AGCGGACGTG TGGGTCATGG CCCCGATGAT CTCCACGGCG GAAGAGGCCG CCCGCTTCGC CTCCATGTGC GCCGATGCCG GGATCAAAAC CCCGGGCGTG ATGGTGGAGG TCCCCTCTGC CGCGCTGACG GCCGAGGCCA TCCTTCGGGA AGTGGAATTC GCCAGCCTGG GAACCAACGA CCTCACGCAG TACGCCATGG CCGCCGACCG CCAACTCGGC CCCTTGGCCG CGCTGAACAC GCCCTGGCAG CCCGCCGTGC TGCGCCTCGT CGGCCTCACC GTTGAGGGCT CGCGCGCAGA AGGTCACAAC AAACCCGTGG GCGTCTGCGG CGAGGCTGCC GCGGATCCGG CCCTCGCCGT CGTGCTGACC GGGCTGGGCG TCACCACACT GTCCATGACC GCACGCTCCC TTGCGGCCGT GGCCGCCGTG CTCAAGACCG TCACGCTGGC CGAAGCGCAG GACCTGGCCA AATTGGCGCT GTCCGCACCG AGTGCCACTG AAGCCCGGGC CTGGGTGCGC GAGAAGCTGC CCGTGCTCGC CGAACTCGGC CTCTAA
|
Protein sequence | MQNFQGVGVS PGRIIGTIRQ MPKPISEPPA GEQLPAGVTA EDATAALKSA SQAVHDELKA RAAHATGDGK AVLEATALMA KDTMLIKGAA KLVARGVSGE RAIWESGSSV SEMLHNLGGY MAERATDVLD VRARIVAELR GVPAPGIPAS NTPFVLVAED LAPADTATLD PNKVLALVTA GGGPQSHTAI IARSLGLPAV VAAVGVDELP DGMEVYVDGA AGTVTSEPDQ SLRAAADAWA ATASLLAEFS GTGATADGHL VPLLANVGGG KDAEAAAKLG AQGVGLFRTE FCFLERDTEP TVEEQAAAYK SVFDAFPGKK VVLRTLDAGA DKPLPFLTDS TEPNPALGVR GYRTDFTTPG VLDRQLEAIA LAEKQSEADV WVMAPMISTA EEAARFASMC ADAGIKTPGV MVEVPSAALT AEAILREVEF ASLGTNDLTQ YAMAADRQLG PLAALNTPWQ PAVLRLVGLT VEGSRAEGHN KPVGVCGEAA ADPALAVVLT GLGVTTLSMT ARSLAAVAAV LKTVTLAEAQ DLAKLALSAP SATEARAWVR EKLPVLAELG L
|
| |