Gene Achl_1233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_1233 
Symbol 
ID7292679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp1357088 
End bp1358275 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content69% 
IMG OID643589639 
Productgalactokinase 
Protein accessionYP_002487313 
Protein GI220912004 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.000254056 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGAGCGCCG CACCCCACCC CACCGCCGAT TCAGCAGTCC CAGGCACACA GGACCTGGCT 
GCCCGCTTTA CCCGGGAGTT CGGCGCCGCC CCGGCAGGCG TCTGGCAGGC ACCGGGCAGG
GTCAACCTGA TCGGTGAGCA CACTGACTAC AACGAGGGCT TCGTGCTGCC CTTCGCCATC
GACCGGACTG CCCGGGTGGC TGTGGGCATC CGCCAGGACT CCACGGTTCG GCTGCTGTCA
ACGTACGGGG ACCAGGGCAT GGTTTCCGCC TCGCTCGACG CCCTGGAGCC AGGCTCCGCC
AAAGGGTGGA CCAAGTATCC CCTCGGCGTG ATGTGGGCAC TCCGCGAGCG CGGCATTGAC
GTTCCCGGAA TCGACCTGCT GCTGGACTCG GATGTTCCGC TCGGCGCAGG CCTGTCCTCG
TCACACGCGA TCGAGTGCGC GGTGGTCACC GCCCTCAACG AGCTCACCGG CGCAGGCCTG
GCAGCGCAGG ACATGGTCCT GGCCACGCAG CGGGCTGAAA ACGACTTCGT GGGGGCTCCC
ACCGGCATCA TGGACCAGTC CGCATCCCTT CGCGGCGCCA AGGGCCACGC GGTCTTCCTG
GATTGCCGTG ACCAGAACGC CACCCTGGTG CCGTTCGAAA CGGAACCCGC GGGGCTGGTC
CTGCTGGTCA TCGACACCAA GGTCTCGCAC TCTCACGCCG ACGGCGGGTA CGCCTCGCGC
CGCGCATCCT GCGAACTCGG CGCCGAGGTC ATGGGCGTCA AGGCACTGCG CGACGTCCAG
GTCGGTGACC TGGAGGAAGC CAGCGGGCTG CTGGACGAGG TGACGTTCCG GCGCGTGCGC
CACGTTGTCA CGGAGAACGA CCGCGTGCTG CAGACGGTCG AGCGCCTGGC CGCCGAGGGG
CCCGCTGCCA TCGGCACACT GCTGGATGCC AGCCACGCAT CCATGCGGGA CGACTTTGAG
ATCTCCTGCC CGGAGCTTGA CCTGGCGGTG GACACCGCCC GTGCCAACGG AGCCATCGGA
GCACGGATGA CCGGAGGCGG TTTCGGGGGT GCGGCGATTG CCCTGACCCC CGTCGCTTCC
GAAGCGAAGG TGCGCGCCGC CGTCGTCCGT GCCTTCGCCG AGGCAGGCTA TGCCGCACCG
GACATCTTCA CTGTCTCCCC GGCAGCGGGC GCCATGCGCG TCGCCTAG
 
Protein sequence
MSAAPHPTAD SAVPGTQDLA ARFTREFGAA PAGVWQAPGR VNLIGEHTDY NEGFVLPFAI 
DRTARVAVGI RQDSTVRLLS TYGDQGMVSA SLDALEPGSA KGWTKYPLGV MWALRERGID
VPGIDLLLDS DVPLGAGLSS SHAIECAVVT ALNELTGAGL AAQDMVLATQ RAENDFVGAP
TGIMDQSASL RGAKGHAVFL DCRDQNATLV PFETEPAGLV LLVIDTKVSH SHADGGYASR
RASCELGAEV MGVKALRDVQ VGDLEEASGL LDEVTFRRVR HVVTENDRVL QTVERLAAEG
PAAIGTLLDA SHASMRDDFE ISCPELDLAV DTARANGAIG ARMTGGGFGG AAIALTPVAS
EAKVRAAVVR AFAEAGYAAP DIFTVSPAAG AMRVA