Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2857 |
Symbol | |
ID | 4444689 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 3211502 |
End bp | 3214180 |
Gene Length | 2679 bp |
Protein Length | 892 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639690679 |
Product | phosphoenolpyruvate synthase |
Protein accession | YP_832336 |
Protein GI | 116671403 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0574] Phosphoenolpyruvate synthase/pyruvate phosphate dikinase |
TIGRFAM ID | [TIGR01418] phosphoenolpyruvate synthase [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.321665 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTTACA TCAATGATTT CAGCGAGGTC GGCCGGGAGG ACGTCGCGAC GGCGGGCGGC AAGGGTGCCG GGCTGGGCGA ACTCGTTCGG GCGGGCGCGC CGGTACCCCC GGGATTCCTG ATCAACACCG GTGCCTACGA GCTGTTCGTC CGGGACAACC AGCTGGCCGG GCGCATCCAG GAGTACGCCG CCTTGCCGGC GGCCGCCACG TCCCGCGACT ACGAGGAAGC TTCCGGGCAG ATCCGCGCAC TCTTTGCTGC TGGCACAATG CCGGAGGCGG TAGCCGCCGA AACTCGGGAT GCATACCGCC GGCTGGGCTC AGTCGCGGGC ACCGGTCCTG GTCCAGGCAC GGAGACAGCG GTGGCGGTGC GTTCCTCCGC CACCGCCGAA GACCTTGCCT CGGCCAGCTT CGCGGGCCAG CAGGACACCT ACCTCAACGT CCGTGGCGCC GAGGCCCTCC TGGATGCGGT CATCAACTGC TGGGGCTCCC TCTGGACCTC CCGGGCGATG GCCTACCGTG CCCGGGAGGG CATCCGCCCG GACCAGGTCC GGCTGGCAGT GGTGGTCCAG CACATGGTGG CCGCGGACGC CGCCGGAGTT ATGTTCACCG CCAACCCGGC CTCCGGCCGC CGCGACCAGA TTGTCCTCGC TGCCGCATGG GGGCTTGGCG AATCCGTGGT CAGCGGAGCT GTCTCCACCG ACGACGTCGT GGTCGAGGCT GCCACCGGTA AGGTGGTTTC CCGCCGGACA GCCGACAAAG CGGTCATGAC GGCCTACGCT GACCGAGGCA CGCGCGAGGA ACCCGTCCCC GAATCCCGCA GGCATCAACC CGTGCTGGAC GACGCTGCGG CGGCAACGCT GGCCGGCTAC GGAACCCGCA TCGCCCGGCA CTTTGGCTCC CCGCAGGACA TCGAGTGGGC ACGGGCCGGC GGCGAATTCT TCATCCTGCA GTCACGACCC ATCACTGCAT TGCCCGAGCC CGCGGGGGAA ACCCCGCGCG ACTGGAGCGT GCCGTACCCC AAGGGGATGT ACTTCCGGGC CAGCATCGTG GAGCAACTTC CCGATCCGCT TTCACCCTTG TTTGCCGACC TCATTGACGG TTCGGTGTCG CGGTCCCTGA ACGCCCTGAT GGACCAGGCG TTCGGTAAAA GCAGCCTCCG GGACGGCGAT CTCGGGCTGC CTGCCATCAA CGGATACGCC TATTACTACT ACCGCACGGC CGCGTTGTGG CGGCTGATGG GAAAGACGCC CGCCGCTGTC CGCGCCCTCA TCCGGGGCGA AGCGCACATG GGGGTCAAGG GCTGGCGGGA CTATTCGCAT CCGCGCTATG TGCACGTGGT GGAATCCTGG TCGGCGAAGC CGGTGGCAGA CCTGACCGGA GAGGAGCTTC TGGAGGGGGT GTCTGCCCTG CTCGATGCCG GCACGGTGTA CTACACCGCC GTCCAGTCGA TCATTCCGGT CGCGACGTCG AGCGAACTCG CCTTCCAGAA GTTCTATGAC AAGTTTGTCC GGCAGGACGG CGACCCGCCG GCTGCCACGT TCCTGCTCGG CTATGAAAGC GAACCGATCC GGGCGGAAAA ATCCCTCTTC GACCTGGCCG GGTGGTCCCG CGGAATCCCG GGACTCGCCG AGGCCATCCT GGCGACGCCG ACAGAATCGC TGGCCCGGGC CCAGCTGTCA GGGACGCTTC CGAACGGGAT GGATCAAAGC TTGTGGCAGC AGTGGGAACA CCGGTTCCAG CTCCACCTGG ACCGCTTCGG CCACACCGTA TACAACCTGG ACTTCATTAA TCCCGTGCCG GCAGACGACC CGTCCCCGCT GCTGGACACG CTGAAGTTCT ACCTGCGCGG GCAGGGCAAC GACCCGCACC GGCGGCAGGA GCTTGCTGCC GCCCGCCGTG AGGACGCCAC CGCCAAGGTA CTGGCCCGGC TGCACCCGGC CCGCAGGGCG GCCTTTACGC GTCTGCTCCG GTGGGCGCAG GACCCGGCGC CGATCCGCGA AGACGCGCTC GCCGACGTCG GGCTCGGGTG GCCCCTCATG CGGCGGATGC TGCGAGAACT GGGCGAGCGG CTGCTCGCGG CCGGACTCAT CGCGGACTCC AGCGATGTGT TCTGGCTTCG ACACCAGGAA ATCAGCAGCG CCGTCGAATT CGGCCTTGCG GCCCCGAACG GTCCGGCGGC AATTACAGGT GCCGGCCGGC CGGTCCTGGC GGAGGCAGTC GCTGAGCGAC AGTTGCGCTG GCGGGGCCAG CGCAACGCCA CCGCCCCGCA GATGCTGCCG GAGATCCGGT GGCTCCAGCG GGCGCTCGCC GGAATGATGC CGGAGGGGTC CCAGGACCAG CAGGGGAACA CGATCAAGGG AGTGGGCGCC AGCCAGGGAC GGGTCAGCGC GCCGGCCCGG GTGCTGTCCG GTCCCGAGGA CTTCTACCGG ATGCAGCCGG GGGAAGTGCT GGTGGCCCGC ATCACCACTC CCGCGTGGAC ACCGCTGTTC GCCATGGCGT CGGCCGTGGT CACTGATGTT GGCGGCCCGC TGAGCCACGG CTCCATTGTG GCCAGGGAAT ACGGCATCCC CGCCGTGCTG GGCACCGGCG TGGCGACGCG CCGGATCGTG AGCGGGCAGC GGGTAGAGGT CGACGGCGCC GCCGGCACTG TAACCATTCT CCAGTCAGGG CGGGACTAG
|
Protein sequence | MVYINDFSEV GREDVATAGG KGAGLGELVR AGAPVPPGFL INTGAYELFV RDNQLAGRIQ EYAALPAAAT SRDYEEASGQ IRALFAAGTM PEAVAAETRD AYRRLGSVAG TGPGPGTETA VAVRSSATAE DLASASFAGQ QDTYLNVRGA EALLDAVINC WGSLWTSRAM AYRAREGIRP DQVRLAVVVQ HMVAADAAGV MFTANPASGR RDQIVLAAAW GLGESVVSGA VSTDDVVVEA ATGKVVSRRT ADKAVMTAYA DRGTREEPVP ESRRHQPVLD DAAAATLAGY GTRIARHFGS PQDIEWARAG GEFFILQSRP ITALPEPAGE TPRDWSVPYP KGMYFRASIV EQLPDPLSPL FADLIDGSVS RSLNALMDQA FGKSSLRDGD LGLPAINGYA YYYYRTAALW RLMGKTPAAV RALIRGEAHM GVKGWRDYSH PRYVHVVESW SAKPVADLTG EELLEGVSAL LDAGTVYYTA VQSIIPVATS SELAFQKFYD KFVRQDGDPP AATFLLGYES EPIRAEKSLF DLAGWSRGIP GLAEAILATP TESLARAQLS GTLPNGMDQS LWQQWEHRFQ LHLDRFGHTV YNLDFINPVP ADDPSPLLDT LKFYLRGQGN DPHRRQELAA ARREDATAKV LARLHPARRA AFTRLLRWAQ DPAPIREDAL ADVGLGWPLM RRMLRELGER LLAAGLIADS SDVFWLRHQE ISSAVEFGLA APNGPAAITG AGRPVLAEAV AERQLRWRGQ RNATAPQMLP EIRWLQRALA GMMPEGSQDQ QGNTIKGVGA SQGRVSAPAR VLSGPEDFYR MQPGEVLVAR ITTPAWTPLF AMASAVVTDV GGPLSHGSIV AREYGIPAVL GTGVATRRIV SGQRVEVDGA AGTVTILQSG RD
|
| |