Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_2541 |
Symbol | |
ID | 7294016 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | - |
Start bp | 2859480 |
End bp | 2861258 |
Gene Length | 1779 bp |
Protein Length | 592 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643590950 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002488595 |
Protein GI | 220913286 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.0000236938 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCAGTCA GGCGCCTGAT GCAGTACATC ACCGCAGCGG CAGTGGTCGC AGTGGCCCTT TCTGGCTGCT CAGGCGGCGG CAGCGCGCCG GTAGTAGTGG GGGAGGCCAA GCGCGGAGGC AGCGCCACCG TGGCCGAGGT CAACGCTTTT TCCTCCTTCA ACCCGTTCAG TACCGACGGC AACACGGACA TCAACTCCAA GATCGGCGCA GCCACCCACT CTGGCTTCTA CTCCCTGGAC GATAAATCCG TGGTGGTCCG GAACGACAAG TTCGGCCGCT ACGAAAAAGT CTCTGATGAC CCGCTTAAGG TGCGGTACAC AGTCAACGAA GGCGTCAAAT GGTCCGACGG CGCGCCGATC GACGCTGGCG ACCTCCTCCT GAGCTGGGCC GCGGGTTCGG GTTATTTCGA CGACGCGGAT CCTGCGGCCG GGACGGGCAC CAGGTATTTT TCGGCTGCGT CAGCCGCCGG CGGCCTCGCG GGCACGGCGT TCCCCGAGCT CGGCGACGAC GGCCGGTCCA TCACGCTGCA GTACGCCGCG CCTTACGCGG ACTGGCAGAC CGCGTTCGAC GTCGGCCTGC CAGCCCATGT GGTCGCAGCC AAGGCCGGGC TGAGCGACGA GGAGGACCTC GTGGACCTGA TCAAGGACGC GCCCAAGGGA AACCCCGGAA AACCGGCGGT AAATTCGGCG TTGAAGACGG TGGGCGATTT CTGGAACAAC GGATTCGATA CGAAATCCCT GCCCGACGAC CCCGCCCTGT ATCTTTCCAG CGGACCCTAC ATCGTGCGGG ACATCGTTCC GGAAGTATCC ATGAAACTCG TCCGGAACCG GGACTACGTG TGGGGGACCG AGCCGTGGCT TGACGAGATC AACGTCCGGT TCACCGGTGC CCTTCCTACC GCTGTTGATG CGCTCCGCAG CGGGCAGGCG GACATCATCT CGCCGCAGCC TTCCGCCGAC ACCGCGAACC TCTTCGCCGG CCTGGCGGAC CAGGGAAACA CGGTGGAGCA GTACAGCCAG TCGGGGTACG ACCACCTCGA CCTCAACTTC TCCGGGCCCT TCGCGGACGA GGACGTCCGC AAGGCCTTCC TGAAGGCCGT GCCCCGGCAG GCCATCGTGG ACGCGGTGGT GGGGGGCCTG ATTACGGACG CCAAACCGCT CGATTCGCAG GTCTTCCTTC CGGGCCAGCC CAAGTACGCG GATACTGTGA AAAACAACGG CTCGGCCGAA TACGCCGAGG TGGACATCGA CGCAGCCAAG GAACTCCTGG ACGGTGCCAC GCCGACCATC CGCATCCTGT ACAACCGGGA CAACCCCAAC CGCGCCAAGG CATTCACCCT GATCCGCGAT TCGGCGCAGA AGGCCGGTTT CCGAGTGGTC GATGCCGGCC AGGGAAATGC GGACTGGGCC AAGTCGCTCG GGGGCGCAGG GTACGACGCC GCTTTGCTGG GGTGGATCGG AACGGGCGCC GGAGTGGGCC GCATCCCGCA GATCTTCCGC ACCGGGGCGG GCAGCAACTT CAACGGATTC TCCGACGGCG ACGCGGACAA GGCAATGGAG CAGCTGGCAA CCACCACTGA CCTCGGCAAA CAGGACGAAC TGCTGGCGGG GATCGATAAG CGCGTCTGGG AGAAAGCGTA CGGACTGCCG CTTTACCAGA CGGTCGGAGC CATAGCCTTC AACGCCCGGG TGACCGGTGT GAAACCCAGC CCGGGACCCC TCGGCGTGTG GTGGAACGTC TCGGATTGGC GCCTTGCCGA GCAGGGGACC AAGAACTGA
|
Protein sequence | MPVRRLMQYI TAAAVVAVAL SGCSGGGSAP VVVGEAKRGG SATVAEVNAF SSFNPFSTDG NTDINSKIGA ATHSGFYSLD DKSVVVRNDK FGRYEKVSDD PLKVRYTVNE GVKWSDGAPI DAGDLLLSWA AGSGYFDDAD PAAGTGTRYF SAASAAGGLA GTAFPELGDD GRSITLQYAA PYADWQTAFD VGLPAHVVAA KAGLSDEEDL VDLIKDAPKG NPGKPAVNSA LKTVGDFWNN GFDTKSLPDD PALYLSSGPY IVRDIVPEVS MKLVRNRDYV WGTEPWLDEI NVRFTGALPT AVDALRSGQA DIISPQPSAD TANLFAGLAD QGNTVEQYSQ SGYDHLDLNF SGPFADEDVR KAFLKAVPRQ AIVDAVVGGL ITDAKPLDSQ VFLPGQPKYA DTVKNNGSAE YAEVDIDAAK ELLDGATPTI RILYNRDNPN RAKAFTLIRD SAQKAGFRVV DAGQGNADWA KSLGGAGYDA ALLGWIGTGA GVGRIPQIFR TGAGSNFNGF SDGDADKAME QLATTTDLGK QDELLAGIDK RVWEKAYGLP LYQTVGAIAF NARVTGVKPS PGPLGVWWNV SDWRLAEQGT KN
|
| |