Gene Arth_4006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4006 
Symbol 
ID4447269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4522849 
End bp4524534 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content70% 
IMG OID639691837 
Productphosphoenolpyruvate--protein phosphotransferase 
Protein accessionYP_833481 
Protein GI116672548 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCAGAACT TCCAAGGAGT AGGCGTCAGC CCCGGCCGGA TCATCGGCAC CATCCGCCAA 
ATGCCCAAGC CCATCAGTGA GCCGCCGGCA GGCGAACAGC TGCCCGCCGG CGTCACGGCC
GAGGACGCGA CAGCCGCCCT CAAGTCGGCG TCCCAGGCTG TGCACGACGA ACTGAAGGCC
CGTGCCGCCC ACGCGACGGG CGACGGCAAG GCTGTCCTGG AAGCCACGGC ACTGATGGCC
AAGGACACCA TGCTGATCAA GGGCGCGGCC AAGCTCGTTG CCCGAGGCGT CTCGGGTGAG
CGCGCCATCT GGGAGTCCGG CTCCTCCGTC TCGGAAATGC TTCACAACCT GGGCGGCTAC
ATGGCCGAGC GCGCCACGGA CGTCCTGGAC GTGCGCGCAC GGATCGTCGC CGAGCTGCGG
GGCGTGCCCG CCCCCGGCAT CCCCGCCTCC AACACGCCGT TCGTCCTGGT GGCCGAGGAC
CTGGCCCCCG CCGACACCGC CACCCTGGAC CCGAACAAGG TCCTGGCGCT CGTCACGGCA
GGCGGCGGCC CCCAGTCCCA CACTGCCATC ATCGCCCGCT CCCTCGGCCT TCCTGCCGTC
GTCGCCGCAG TAGGCGTGGA CGAGCTCCCG GACGGCATGG AGGTCTATGT GGACGGCGCC
GCCGGCACCG TCACGTCCGA GCCCGACCAA TCCTTGCGTG CCGCAGCGGA CGCATGGGCG
GCCACAGCTT CCCTGCTGGC CGAGTTCAGC GGCACGGGCG CGACGGCGGA CGGCCACCTG
GTGCCTCTCC TCGCCAACGT AGGCGGCGGC AAGGATGCCG AGGCGGCCGC GAAGCTCGGC
GCCCAGGGGG TGGGACTGTT CCGTACCGAA TTCTGCTTCC TGGAACGGGA CACCGAGCCC
ACCGTGGAGG AACAGGCAGC AGCCTACAAG AGCGTCTTTG ATGCCTTCCC GGGCAAGAAG
GTGGTGCTGC GGACGCTCGA TGCCGGCGCC GACAAGCCGC TCCCCTTCCT GACTGACTCC
ACCGAGCCCA ACCCCGCACT GGGCGTCCGC GGCTACCGCA CGGACTTCAC CACGCCGGGC
GTGCTGGACC GCCAGCTGGA AGCAATCGCC CTGGCCGAGA AGCAGTCCGA AGCGGACGTG
TGGGTCATGG CCCCGATGAT CTCCACGGCG GAAGAGGCCG CCCGCTTCGC CTCCATGTGC
GCCGATGCCG GGATCAAAAC CCCGGGCGTG ATGGTGGAGG TCCCCTCTGC CGCGCTGACG
GCCGAGGCCA TCCTTCGGGA AGTGGAATTC GCCAGCCTGG GAACCAACGA CCTCACGCAG
TACGCCATGG CCGCCGACCG CCAACTCGGC CCCTTGGCCG CGCTGAACAC GCCCTGGCAG
CCCGCCGTGC TGCGCCTCGT CGGCCTCACC GTTGAGGGCT CGCGCGCAGA AGGTCACAAC
AAACCCGTGG GCGTCTGCGG CGAGGCTGCC GCGGATCCGG CCCTCGCCGT CGTGCTGACC
GGGCTGGGCG TCACCACACT GTCCATGACC GCACGCTCCC TTGCGGCCGT GGCCGCCGTG
CTCAAGACCG TCACGCTGGC CGAAGCGCAG GACCTGGCCA AATTGGCGCT GTCCGCACCG
AGTGCCACTG AAGCCCGGGC CTGGGTGCGC GAGAAGCTGC CCGTGCTCGC CGAACTCGGC
CTCTAA
 
Protein sequence
MQNFQGVGVS PGRIIGTIRQ MPKPISEPPA GEQLPAGVTA EDATAALKSA SQAVHDELKA 
RAAHATGDGK AVLEATALMA KDTMLIKGAA KLVARGVSGE RAIWESGSSV SEMLHNLGGY
MAERATDVLD VRARIVAELR GVPAPGIPAS NTPFVLVAED LAPADTATLD PNKVLALVTA
GGGPQSHTAI IARSLGLPAV VAAVGVDELP DGMEVYVDGA AGTVTSEPDQ SLRAAADAWA
ATASLLAEFS GTGATADGHL VPLLANVGGG KDAEAAAKLG AQGVGLFRTE FCFLERDTEP
TVEEQAAAYK SVFDAFPGKK VVLRTLDAGA DKPLPFLTDS TEPNPALGVR GYRTDFTTPG
VLDRQLEAIA LAEKQSEADV WVMAPMISTA EEAARFASMC ADAGIKTPGV MVEVPSAALT
AEAILREVEF ASLGTNDLTQ YAMAADRQLG PLAALNTPWQ PAVLRLVGLT VEGSRAEGHN
KPVGVCGEAA ADPALAVVLT GLGVTTLSMT ARSLAAVAAV LKTVTLAEAQ DLAKLALSAP
SATEARAWVR EKLPVLAELG L