Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_1750 |
Symbol | |
ID | 7293210 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | - |
Start bp | 1976703 |
End bp | 1978049 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643590159 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002487819 |
Protein GI | 220912510 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0000000348272 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGAAAA TGAAAGCAAC CGGCGCCCTT GCTGCGGCCG CCGCGGCAGT CCTCGCCCTC TCCGCCTGCG GCAGCGGCGG CGGTTCAGCC GAAGCCGGCA AGGGCGAAAT CAGCTACTGG CTGTGGGACG CCAACCAGCT TCCGGCCTAC CAGCAGTGCG CCGACGACTT CACCAAGGCC AACCCGGACA TCACCGTCAA GATCACCCAG CGCGGCTGGG ACGACTACTG GAGCACCCTT ACCAACGGCT TCGTGGGAGG CACCGCCCCG GACGTCTTCA CCAACCACCT GGGCAGGTAC GGCGAGCTGG CCGAGAACAA GCAGCTCCTG GCCATCGATG ACGCCGTGGC CAAGGACAAG GTGGACCTCT CCGCCTACAA CGAAGGCCTC GCCGACCTCT GGGTGGGCCA GGACGGCAAG CGGTACGGGC TGCCCAAGGA CTGGGACACC ATCGGCCTGT TCTACAACAA GGACATGCTT TCCGCCGCCG GGATTTCCGA GGACCAGATG AAGGACCTCA CCTGGAACCC CAAGGACGGC GGCAGCTACG AGAAGGTCAT CGCCCACCTG ACCGTGGACA AGAGCGGCAA GCGCGGCGAC GAGCCCGGGT TCGACAAGAA CAACGTCCAG GTCTACGGCC TGGGCCTGAA CGGCGGCGGC GACTCCTCCG GCCAGACTGA GTGGAGCTAC CTCACCAACA CCACCGGCTG GTCCCACACG GACAAGAACC CGTGGGGCAC CCACTACAAC TATGACGACC CGAAGTTCCA GGACAGCATG CAGTGGTTCG CGGGCCTGGC GGACAAAGGC TACATGCCCA AGCTCGAAAC CACCGTCGGC GCCAGCATGG CTGACACCTT CGCCGCCGGC AAGTCCGCCA TCAACGCCCA CGGTTCGTGG ATGATCGGCC AGTACACCGG GTACAAGGGC ATCCAGGTGG GTATCGCCCC CACCCCGGTG GGCCCCGAAG GCGAGCGCGC CTCCATGTTC AACGGCCTGG CCGACTCCAT CTGGGCCGGC ACCAAGAAGA AGGACGCCTC AATCAAGTGG GTTGAATACC TGGCCTCCTC AGCCTGCCAG GACGTCGTAG CGTCCAAGGC CGTGGTCTTC CCCGCGCTGA AGGCCTCCTC GGACAAGGCC GCCGCAGCGT TCCAGGCCAA GGGCGTGGAC GTCACCGCGT TCACCGAGCA CGTGAAGAAC AAGACCACGT TCCTGTACCC GATCACGGAC AACACCGCCA AGGTCAAGGG CATCATGGAA CCGGCCATGG ACGCCGTGGT GTCCGGCAAA GCGCCGGTCA GCTCGCTGAC TGCCGCCAAC GATCAGGTCA ACGCCCTGTT CAAGTAA
|
Protein sequence | MKKMKATGAL AAAAAAVLAL SACGSGGGSA EAGKGEISYW LWDANQLPAY QQCADDFTKA NPDITVKITQ RGWDDYWSTL TNGFVGGTAP DVFTNHLGRY GELAENKQLL AIDDAVAKDK VDLSAYNEGL ADLWVGQDGK RYGLPKDWDT IGLFYNKDML SAAGISEDQM KDLTWNPKDG GSYEKVIAHL TVDKSGKRGD EPGFDKNNVQ VYGLGLNGGG DSSGQTEWSY LTNTTGWSHT DKNPWGTHYN YDDPKFQDSM QWFAGLADKG YMPKLETTVG ASMADTFAAG KSAINAHGSW MIGQYTGYKG IQVGIAPTPV GPEGERASMF NGLADSIWAG TKKKDASIKW VEYLASSACQ DVVASKAVVF PALKASSDKA AAAFQAKGVD VTAFTEHVKN KTTFLYPITD NTAKVKGIME PAMDAVVSGK APVSSLTAAN DQVNALFK
|
| |