Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_1199 |
Symbol | |
ID | 7292644 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | + |
Start bp | 1320192 |
End bp | 1321490 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643589604 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002487279 |
Protein GI | 220911970 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0000000829014 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACGCGTC ACAGCACCCA CCGCCTCCGC AAATCCGTCG CAATGCTCAG CGCGGCATGC CTGCTTGGGC TGACAGCCGC GTGTTCCGCA CCGTCGGGCG GAGGAGGCGA CGGGCCGGTG GAGATCCGGT TCGCCTGGTG GGGCAATGCG GGCCGCGCAG AGCTCACCAA CAAGGCCATC GCCGAATTCG AAGCCGCGAA CCCTGACATC AAGGTGAAGC CCGAGTTCGG GGACATCGGC GGTTACTTCG ACAAACTCGC CACCCAGGTG GCCGCGAACG ATGCACCGGA CGTCATCACC ATGGGCGGTG CCTACCCGGC GGAGTATGCC AACCGTGGGG CGCTGCTGGA CCTCTCAACG GTCCGGGGTC AACTCGACCT CAGCAAGATG GACCAGGGGG CGCTGGACAA CGGCCAGGTG CAGGGCAAGC AGTACGGCAT TTCCACCGGT GCCAACGCCC TCGCCATCGT GGTGAACCCC GCCGTTTTTG CGGCGGCCGG CGTTCCCCTC CCGGACGATG CCACCTGGAG CTGGGAGGAC TTTGCTGAAA CCGCCGCGAG CGTGACGGCG AAGAGCCCCA AGGGCACCTC TGGGACGGCA ACGGTCCTCA CCCACGACTC CCTGGACGCC TTCGCACGGC AGCGGGGGGA GTCCCTCTAC ACCCAGGACG GCCAGCTTGG TCTCACCAAG GAGACAGTCC AGGACTACTT CGACTTCTCC CTCAAACTCA GCGAGTCCGG CGCTGCGCCC AACGCCTCCG AGACAGTGGA AAAGCTCAGC GTCAGCACCG AACAAACACT CATGGGCATG GGCCAGGCCG GCATGATGCT CACGTGGAGC AACTCTTTGA CGGCGCTCAG CAAGGCCTCC GGAGCCGAAC TGAAACTCCT CAAGCTCCCC GGCGAGAAGC CCACACCGGG CATCTGGCTC CAGTCATCGC AGTTCTACAC CATTTCCGCC CGGAGCAAGC ACACCGAAGC CGCGGCCAAG CTGGTGAACT TCCTGGTCAA TAACCAGGCC GCCGCCAAGA TCATCCAAAG CGACCGCGGC GTGCCCAGCA ACCCCGAAAT GCGCACGGCC ATCCAGGACC TCCTGACGCC GCAAGGCAAG GTCGAAGCTG CCTACATCGG TGAGGTCGGC AAGATGGACT TCGCGCCCAC CTACATCGGG CCCACGGGGT CGACGGCGGT CTCAGAGATC ACGGCGAGGA TCAACACCGA CGTCCTGTTC AAGCGGTTGA CCCCCGAAAA GGCGGCCGAA CAGTGGATCA GTGAAAGCAA GGCCGCTATC GGCAAGTAG
|
Protein sequence | MTRHSTHRLR KSVAMLSAAC LLGLTAACSA PSGGGGDGPV EIRFAWWGNA GRAELTNKAI AEFEAANPDI KVKPEFGDIG GYFDKLATQV AANDAPDVIT MGGAYPAEYA NRGALLDLST VRGQLDLSKM DQGALDNGQV QGKQYGISTG ANALAIVVNP AVFAAAGVPL PDDATWSWED FAETAASVTA KSPKGTSGTA TVLTHDSLDA FARQRGESLY TQDGQLGLTK ETVQDYFDFS LKLSESGAAP NASETVEKLS VSTEQTLMGM GQAGMMLTWS NSLTALSKAS GAELKLLKLP GEKPTPGIWL QSSQFYTISA RSKHTEAAAK LVNFLVNNQA AAKIIQSDRG VPSNPEMRTA IQDLLTPQGK VEAAYIGEVG KMDFAPTYIG PTGSTAVSEI TARINTDVLF KRLTPEKAAE QWISESKAAI GK
|
| |