Gene Achl_3600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_3600 
Symbol 
ID7295081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp4002796 
End bp4004073 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content65% 
IMG OID643592006 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002489645 
Protein GI220914336 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCCAC CTTTGCCGTC CGCCGGGACA CGACTTTCCC GGCGAACCTT CCTAGCCGCC 
GCAGGAGGCT CGCTGGCCGC TTTCAGCCTT GCGGCGTGCA GCCCGGCCGG AGCCCAGCCC
ACCATCACGT TCCACCAGTC GAAGCCGGAA GCGGTTCCCT ACTTCCGCGA CCTCACAGCA
AAATTCACGG CGTCGCAGGA CCGCTTCAGC GTCCTGCATG ACATGGCAAC GAACCTTTCC
GCCAGCTTCG TCCGCAGCAG CCCGCCGGAC CTTGGCTGCC TCAACTACAA CCTGGAGATG
GCGCGGTTCA TGGAGCGCGG CGCCCTCTCG GACCTGGCCG ACCTTCCGGA AGCGGCAGCC
ATCCGCGGCG ACGTCCTCGA CCTCACCAAC TGGTACCCCA CCTACCCGGG CCGCACCAGC
GTCATCCCCT ACTCGGTCAT GGCGGCGTCG GTCATCTACA ACCGGCGTAT CTTCGAGGCG
AACGGCCTCT CGGTCCCCAC CACCTGGGAC GAGCTCATTG AGGTCTGCGA ACGCCTCAAG
GCTGTGGGGA TCACTCCGGT CTACGGCACG TTCCGGGATC CCTGGACCAT CGCGCAGGGC
CTGTTCGACT ACACCGTGGG CGGGATGGTG GATGTGCGCG GCTTCTACCA GTCCATGCAC
GAAGCGGGCG AGAAGGTGGG GCCGGATTCC GAGGCCTCCT TCCAGAAAAC ACTGCTGGAA
CCCGTCCGGC GCATGGTCCA GCTGAAGAAA TACGTCAACC CCGATGCCGC CAGCCGCGGC
TACGGGGACG GCAACACCGC CATGGCGCAG GGGCAGGCAG CCATGTACTT CCAGGGGCCG
TGGGCCTTCG GCGAAATCGA AAAGGCCGGC ACCGACGTCG ACCTCGGCAC CTTCCCCCTG
CCGATGACCG ACAATCCCGC CGACCTCAAG GTCCGCGTCA ACATCGACCT TTCACTCTGG
GTCCCCGAGG TCTCGAACGG ACAGCAGGGG GCACGCGCCT TCATCCAGTA CCTGATGCAG
CCGGAGATCC AGGACACCTA CAACGCCAAA TTCCTGGGCT TCGGAACGGT CAAGGATGCC
CCGCCGGTCA CCGACCCCAG GATCGTGGAA ATGCAGAAGT ACTACGACGA GGGCCGCTTC
TACATGGGCG CGTCACAGTT CATTCCCAAC ACGATTCCCG CTGCCAACTA CATCCAGTCG
ATCATCGGCG GCGCCGATGC CGAGGGCACC CTGCGCCGGA TGGACGCCGA CTGGGCGCGC
CTGGCGTTCC GCGCGTGA
 
Protein sequence
MLPPLPSAGT RLSRRTFLAA AGGSLAAFSL AACSPAGAQP TITFHQSKPE AVPYFRDLTA 
KFTASQDRFS VLHDMATNLS ASFVRSSPPD LGCLNYNLEM ARFMERGALS DLADLPEAAA
IRGDVLDLTN WYPTYPGRTS VIPYSVMAAS VIYNRRIFEA NGLSVPTTWD ELIEVCERLK
AVGITPVYGT FRDPWTIAQG LFDYTVGGMV DVRGFYQSMH EAGEKVGPDS EASFQKTLLE
PVRRMVQLKK YVNPDAASRG YGDGNTAMAQ GQAAMYFQGP WAFGEIEKAG TDVDLGTFPL
PMTDNPADLK VRVNIDLSLW VPEVSNGQQG ARAFIQYLMQ PEIQDTYNAK FLGFGTVKDA
PPVTDPRIVE MQKYYDEGRF YMGASQFIPN TIPAANYIQS IIGGADAEGT LRRMDADWAR
LAFRA