Gene Achl_3124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_3124 
Symbol 
ID7294604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp3470594 
End bp3471883 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content62% 
IMG OID643591534 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002489174 
Protein GI220913865 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.0621948 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCAACA GAAGGCATTT CCTTACAACC GTAGCCGTCG GCACCGCATC TGCCGGCGTA 
CTGGCGGCCT GCGGAACCGG ATCCAGCACC TCAGGACAGA CCGGTTCGGC GGACAACCCC
GTCACCATCA ACTACACCTG GTGGGGCAAC GACGACCGCG CCGAGCGCAC CCGCAAGGCC
ATTGCATTGT TCGAATCCAA GAACCCGGAC ATCAAGGTCA ACGGCAACTT CACCGACTTC
GCCGGGTACT GGCAGAAGCG TGCCACCGAA GCTGCCGGCG GTGGCCTGCC CGACGTGATG
CAGTGGGACC TGTCCTACCT GCGCGACTAC GGCCAGCGCA ACCAGCTGCT GGACCTGGGT
ACGGTCAAGA TCAATACGGA TGCCTTCGAA AAGTCCCTGC TGCCTTCCGG CCAGATCAAG
GGCAAGACCT ACGGAATCCC CACCAGCACC AATGCCTTCG CCGTCTACTA CGACCCCGCC
AAGCTGGCCT CCCTGGGTAT CGCCGAGCCG GACGGAAGCT GGACCTACAA GGAATTCAAC
GCCTTCCTCA CCGAGGTGGG CAGCAAGAGC AACGGCGCCC TCTTCGGCGG CACCGACTAC
ACGGGCGTCT GGTGGATGTT CAACGTCTGG CTGCGGCAGA ACAACATCGA AGCCTTCACC
TCCGAGGGCA AGCTCGGCTT CAGCAAGGAC GACCTGAAGA AGTGGTGGAA CCTCACGGCT
GATCTCCGCG GCACCCCGGC GATCGTCTCC GAGGAACGCG TCACCCAGCT GGCCCCGAAG
TCGCCGTTCG GCTCGAATGT CACCGCAACC GAAGTCACCT GGGACAACTT CATGGCCGGC
TACCTCGGCG ACAGCGGCGC GAAGGAACTC AAGCTCGTGC CGGTCCCCTC CGACGACGCG
GACAACCTCG GCCTGTTCCT GAAGCCGTCA ATGCTGATGG TGGCCAGCGC CAAGACCAAG
TTCAAGGACG CCGCAGCCCG CTTCATCGAC TTCATGGTCA ACGACCCCGA GGTAGGCCAG
ATCTTCAAGA CCTCCCGTGG CGTGCCCGCA TCGAAGACCC AGCGCGACGG CACCACCTTC
GAAGGCACGG ACAAGATCGT TGTCGATTAC GAAACGTCCA TCTCCCAGTA CCTCAAGGAC
GCCCCGGAGC CACCGATCGT CGGCTTCGGC ACGCTGGAGA CCTCCTTCAA GCGCATTGCT
TCGGACCTGA ACTACGGCAA GCTGGACATC AACGGTGCCA CCGACGCCTG GTTCAAGGAA
GCCGAAGACC TTATCAAGCA GAACGCCTGA
 
Protein sequence
MINRRHFLTT VAVGTASAGV LAACGTGSST SGQTGSADNP VTINYTWWGN DDRAERTRKA 
IALFESKNPD IKVNGNFTDF AGYWQKRATE AAGGGLPDVM QWDLSYLRDY GQRNQLLDLG
TVKINTDAFE KSLLPSGQIK GKTYGIPTST NAFAVYYDPA KLASLGIAEP DGSWTYKEFN
AFLTEVGSKS NGALFGGTDY TGVWWMFNVW LRQNNIEAFT SEGKLGFSKD DLKKWWNLTA
DLRGTPAIVS EERVTQLAPK SPFGSNVTAT EVTWDNFMAG YLGDSGAKEL KLVPVPSDDA
DNLGLFLKPS MLMVASAKTK FKDAAARFID FMVNDPEVGQ IFKTSRGVPA SKTQRDGTTF
EGTDKIVVDY ETSISQYLKD APEPPIVGFG TLETSFKRIA SDLNYGKLDI NGATDAWFKE
AEDLIKQNA