Gene Achl_1750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_1750 
Symbol 
ID7293210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp1976703 
End bp1978049 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content65% 
IMG OID643590159 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002487819 
Protein GI220912510 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0000000348272 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGAAAA TGAAAGCAAC CGGCGCCCTT GCTGCGGCCG CCGCGGCAGT CCTCGCCCTC 
TCCGCCTGCG GCAGCGGCGG CGGTTCAGCC GAAGCCGGCA AGGGCGAAAT CAGCTACTGG
CTGTGGGACG CCAACCAGCT TCCGGCCTAC CAGCAGTGCG CCGACGACTT CACCAAGGCC
AACCCGGACA TCACCGTCAA GATCACCCAG CGCGGCTGGG ACGACTACTG GAGCACCCTT
ACCAACGGCT TCGTGGGAGG CACCGCCCCG GACGTCTTCA CCAACCACCT GGGCAGGTAC
GGCGAGCTGG CCGAGAACAA GCAGCTCCTG GCCATCGATG ACGCCGTGGC CAAGGACAAG
GTGGACCTCT CCGCCTACAA CGAAGGCCTC GCCGACCTCT GGGTGGGCCA GGACGGCAAG
CGGTACGGGC TGCCCAAGGA CTGGGACACC ATCGGCCTGT TCTACAACAA GGACATGCTT
TCCGCCGCCG GGATTTCCGA GGACCAGATG AAGGACCTCA CCTGGAACCC CAAGGACGGC
GGCAGCTACG AGAAGGTCAT CGCCCACCTG ACCGTGGACA AGAGCGGCAA GCGCGGCGAC
GAGCCCGGGT TCGACAAGAA CAACGTCCAG GTCTACGGCC TGGGCCTGAA CGGCGGCGGC
GACTCCTCCG GCCAGACTGA GTGGAGCTAC CTCACCAACA CCACCGGCTG GTCCCACACG
GACAAGAACC CGTGGGGCAC CCACTACAAC TATGACGACC CGAAGTTCCA GGACAGCATG
CAGTGGTTCG CGGGCCTGGC GGACAAAGGC TACATGCCCA AGCTCGAAAC CACCGTCGGC
GCCAGCATGG CTGACACCTT CGCCGCCGGC AAGTCCGCCA TCAACGCCCA CGGTTCGTGG
ATGATCGGCC AGTACACCGG GTACAAGGGC ATCCAGGTGG GTATCGCCCC CACCCCGGTG
GGCCCCGAAG GCGAGCGCGC CTCCATGTTC AACGGCCTGG CCGACTCCAT CTGGGCCGGC
ACCAAGAAGA AGGACGCCTC AATCAAGTGG GTTGAATACC TGGCCTCCTC AGCCTGCCAG
GACGTCGTAG CGTCCAAGGC CGTGGTCTTC CCCGCGCTGA AGGCCTCCTC GGACAAGGCC
GCCGCAGCGT TCCAGGCCAA GGGCGTGGAC GTCACCGCGT TCACCGAGCA CGTGAAGAAC
AAGACCACGT TCCTGTACCC GATCACGGAC AACACCGCCA AGGTCAAGGG CATCATGGAA
CCGGCCATGG ACGCCGTGGT GTCCGGCAAA GCGCCGGTCA GCTCGCTGAC TGCCGCCAAC
GATCAGGTCA ACGCCCTGTT CAAGTAA
 
Protein sequence
MKKMKATGAL AAAAAAVLAL SACGSGGGSA EAGKGEISYW LWDANQLPAY QQCADDFTKA 
NPDITVKITQ RGWDDYWSTL TNGFVGGTAP DVFTNHLGRY GELAENKQLL AIDDAVAKDK
VDLSAYNEGL ADLWVGQDGK RYGLPKDWDT IGLFYNKDML SAAGISEDQM KDLTWNPKDG
GSYEKVIAHL TVDKSGKRGD EPGFDKNNVQ VYGLGLNGGG DSSGQTEWSY LTNTTGWSHT
DKNPWGTHYN YDDPKFQDSM QWFAGLADKG YMPKLETTVG ASMADTFAAG KSAINAHGSW
MIGQYTGYKG IQVGIAPTPV GPEGERASMF NGLADSIWAG TKKKDASIKW VEYLASSACQ
DVVASKAVVF PALKASSDKA AAAFQAKGVD VTAFTEHVKN KTTFLYPITD NTAKVKGIME
PAMDAVVSGK APVSSLTAAN DQVNALFK