Gene Achl_3169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_3169 
Symbol 
ID7294649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp3524828 
End bp3526165 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content63% 
IMG OID643591579 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002489219 
Protein GI220913910 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.187022 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCTAT TACTCAAGGG AGAGAACATG GCTTCACCGT TTGATGCGTC CGCAACTGCC 
TTCCCGAGCA GGCGGAGCAT CCTCAAGACC GCCGGCGTTG GCGCTGCCAG CCTGGCCGGC
ATCCCGTTCC TCGCAGCCTG CACAGGCGGC AGCGCACCGT CCGCAACAGG TACCGATTCC
GGCGGACTGA CCTTCGGCTC CGGCTCCTCC GACGATGTTC CCAAGCGGGC CTACCAGGCC
GTCACCGATG CGTTTACGGC CAAGACCGGC AAGAAGGTCA CCACCAACAC GGTCCCCCAC
AACGACTTCC AGAACAAGAT CAACTCCTAC CTCCAGGGCT CCCCGGATGA CACCTTCACC
TGGTTTGCCG GCTACCGGAT GCAGTACTAC GCCGGCAAGG GACTCCTTGC TCCCATCGAC
GACGTCTGGG AAACCATCGG CGCCAACTAC TCCGACGCGC TGAAGAAGGC CTCCACCGGA
CCCGACGGCA AGCTGTACTT CGTGCCCAAC TACAACTACC CGTGGGGTTT CTTCTACCGG
AAGAGCCTGT GGGCCGAGAA GGGGTACGAG GTTCCGGAAA CCTTTGACGC CCTCAAGACC
CTCGCCGCGA AGATGCAGGG AGACGGCATC ATCCCCATCG GCTTCGCGGA CAAGGACGGC
TGGCCCGCCA TGGGCACCTT CGACTACATC AACATGCGGC TGAACGGCTA CCAGTTCCAC
GTGGACCTGT GCGCCCACAA GGAATCCTGG GACCAGCAGA AGGTCAGCGC CGTCTTTGAC
ACCTGGTCCG CGCTGCTGCC GTTCCAGGAT CCCGGAGCCC TCGGCCAGAC CTGGCAGGAT
GCTGCCAAGT CGCTTGAAGC CAAGAAGACC GGCATGTACC TGCTGGGCTC GTTCGTCACC
CAGCAGTTCA CCGACGCTGC GGTGCTGGCC GACATCGACT TCTTCGCCTT CCCGGAGATC
GCCATGGAAG GCCGGGACGC CGTCGAAGCC CCCATCGACG GCCTCCTGCT GTCCAAGAAG
GGCGGCGAGA ACAAGGCTGC GCGGGACTTC ATGGCGTACC TGGGCACGCC CGAGGCGCAG
GACGCCTACG CCGCGGTGGA TGCCTCCAAC ATCGCCACCG CCAAGGGCAC CGACACCTCC
AAGTTCAGTC CGCTGAACAA GAAGTGCGCC GAGACCATCG CTGACGCCAA ATACATCAGC
CAGTTCTTCG ACCGTGACGC GTTGCCCGCC ATGGCCAACA ACGTGATGAT CCCTGCCCTG
CAGAGCTTCA TCAAGGACGG CAAGATGGAC GTCAAAAACC TTGAGGCGCA GGCCAAAACC
CTCTACGCGG CGCAGTAG
 
Protein sequence
MLLLLKGENM ASPFDASATA FPSRRSILKT AGVGAASLAG IPFLAACTGG SAPSATGTDS 
GGLTFGSGSS DDVPKRAYQA VTDAFTAKTG KKVTTNTVPH NDFQNKINSY LQGSPDDTFT
WFAGYRMQYY AGKGLLAPID DVWETIGANY SDALKKASTG PDGKLYFVPN YNYPWGFFYR
KSLWAEKGYE VPETFDALKT LAAKMQGDGI IPIGFADKDG WPAMGTFDYI NMRLNGYQFH
VDLCAHKESW DQQKVSAVFD TWSALLPFQD PGALGQTWQD AAKSLEAKKT GMYLLGSFVT
QQFTDAAVLA DIDFFAFPEI AMEGRDAVEA PIDGLLLSKK GGENKAARDF MAYLGTPEAQ
DAYAAVDASN IATAKGTDTS KFSPLNKKCA ETIADAKYIS QFFDRDALPA MANNVMIPAL
QSFIKDGKMD VKNLEAQAKT LYAAQ