Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_3566 |
Symbol | |
ID | 7295047 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | + |
Start bp | 3964303 |
End bp | 3965583 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643591972 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002489611 |
Protein GI | 220914302 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 82 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGCA CCACCCTCGC CGCCATCGCA CTGGCAGTAA CCGCAGGCCT GGGCCTGTCC GGTTGTGCCG GCGCCGCCGG ACCGGCCGAA CCCCAGGGCC AGGAGGGCAA GACCCGCCTG ACCGTTTCGG TCTGGAACTA CGAAGGCACG CCGGAGTTCA AGGCCCTCTT CGACAGCTAC GAAGCAGCCA ACCCGGACAT CGACATCGAA CCCGTGGACA TCCTCGCCGA CGACTACCCG CAGAAGGTCA CCACCATGCT GGCCGGCGGA GACACCACCG ACGTCCTCAC CATGAAGAAC GTCATCGACT ACGCCCGCTA CGCCAACAAC GGCCAGCTAC AGGAAATCAA CGGCGTGGTG GACTCCGTTG GCAAGGACAA CCTCGCGGGC CTGGACGCCT TCGACATCGG CGGCAAGTAC TACGCCGCCC CCTACCGGCA GGACTTCTGG CTCCTGTACT ACAACAAGGA CCTGCTCAAG GCTGCAGGCG TCGAGAACCC CGCCGACCTG ACGTGGGACG AGTACACCGC GCTGGCCAAG AAGCTCACCA CCGAGGCCAA CGGCAAGAAG GTCTACGGCA CCTACCACCA CATCTGGCGT TCCGTGGTGC AGGCCATCGC GGCCGCCCAG GATGACGCCG ACCAGAACAG CGGCGACTAC GGCTTCTTCG AGGACCAGTA CAACACTGCC CTGGACCTGC AGAAGAGCGG CGCCACCCTG GACTTCGGCA CCGCCAAGAG CCAGAAGACC AGCTACCGCA CCATGTTCGA GACCGGACAG GCGGCCATGA TGCCCATGGG CACCTGGTAC ATCGCCGGCA TCCTGCAGGC CAAGAAGGAC GGCAAGTCCA CCGTTGACTG GGGGCTGGCT CCGATGCCGC AGAAGAACGA CGACGGCAAG GTCACCACTT TCGGTTCGCC CACCGCTTTC GCCGTCAACA AGAACGCCGC GCACTCGGAT GCAGCCAAGA AGTTCATCGA GTGGGCTGCG GGTGAGGAAG GCGCCAAGGC CATCTCCAAG ATCGGTGTTG TCCCCGCACT GCAGAACGAC GCCATCACTG CCGAGTACTT CAAGCTTGCC GGCATGCCCA CGGACGAGCT GTCCAAGAAG GCCTTCACCC CGGACAAGGT TGCCCTGGAA ATGCCGGTCA GCGACAAGTC TGCCGCCACG GACAAGATCC TCAACCAGGA ACACGACCTG GTCATGGTGG GTGAGCGCTC GGTGGCCGAC GGTGTTGCCG AGATGGGCAA GCGCGTCAAA AGCGAAGTCC TGGGCAAGTA A
|
Protein sequence | MKRTTLAAIA LAVTAGLGLS GCAGAAGPAE PQGQEGKTRL TVSVWNYEGT PEFKALFDSY EAANPDIDIE PVDILADDYP QKVTTMLAGG DTTDVLTMKN VIDYARYANN GQLQEINGVV DSVGKDNLAG LDAFDIGGKY YAAPYRQDFW LLYYNKDLLK AAGVENPADL TWDEYTALAK KLTTEANGKK VYGTYHHIWR SVVQAIAAAQ DDADQNSGDY GFFEDQYNTA LDLQKSGATL DFGTAKSQKT SYRTMFETGQ AAMMPMGTWY IAGILQAKKD GKSTVDWGLA PMPQKNDDGK VTTFGSPTAF AVNKNAAHSD AAKKFIEWAA GEEGAKAISK IGVVPALQND AITAEYFKLA GMPTDELSKK AFTPDKVALE MPVSDKSAAT DKILNQEHDL VMVGERSVAD GVAEMGKRVK SEVLGK
|
| |