Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_1103 |
Symbol | |
ID | 7292548 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | + |
Start bp | 1215643 |
End bp | 1217217 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643589509 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002487184 |
Protein GI | 220911875 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGTTC TTCCTCAAGG ACGTGAGATT TCCCGCCGCC GCCTGCTGCA GTTCGGCACG GCCGCAGGGT TTCTGCTGGG CACCGGCAGC CTGGCCGGCT GCGCCGGTCC CACCGGCCTC CCCGGACCCA GCACCCTGAC CCTGGCCCTG AACCGCTCCC TGGTCAGCCT GGACAACAAG CTCAACCAGT TCGATGCCGC CGTCACCGTA CAGCGCTCCG TCCGCCAGGG CCTCACCGCC ATCGGCCCCG AAACCAAGCC CGTCCTGGTG CTGGCCGAAC GCTTCGAAAT GACCGGCCCC ACCGAATGGA CCGTCACCCT CCGCGAAGGC ATCCGCTACT CGGACGGCAG CCCCGTGCAG ATCGAGGATG TGGCCACCGC CCTGAAGATG TACAAGCAGG TGCAGGGCTC CTTCGTAGCA GGCTTCTTCC CCGAATTCCC CGAGGTTGTC CCCGTGGACA ACCGCACGTT CAAGATGGTG TCCAAGAACC CCGTCCCCAT CCTGGACAGC CTCATGAGCA TGATCCTGAT TACCCCGGCC GCACAGAACA AGCCGGAGGA ACTCCAGGAA GGCGTGGGCA CCGGCCCCTA CAAGGTCACC AAGTTCAACC GCGGCGCCGG CACCTACAGC CTGGCACGCA ACGAGAACTA CTGGGGCCCG GCGCCGGAGA TCGAGAACGT GGAAGTCCGG TTCCTCCCCG AGGAATCCAG CCGCGTCATC GCCCTGCGCA GCGGCGAGGT GGACATCATC GACTCCATCA CCCCGGACTC CCGCGAACAG CTGGCCGGCC TCCCCGGCGT CCAGCTGGCC GAGGCGTCCA GCCTGCGGCT CAACCAGATC TTCTACAACT TCCGCAAGCC CGCCGGCCAC CCCCTGGCCG ATGTCCGCGT CCGTGAAGCC CTCAGCTGGG CCATCGACGG CGAATCCCTG GTCAAGGACG TGCTGGTGGA CTCCGTCAGC GCCGCCGAGG GCGTCACGCC CGGCAGCCTC ACCGGCTACC ACAAGACCGG CACCTACACC TACGATCCGG AGAAGGCCAA GGCCCGGCTC GCCGAGCTCG GCGTCAAGGA CCTCACCCTG AAGATCATCT GGGAAACCGG AGAATTTGCC TCCGACACCT CCGTGATGGA AGCCCTGGTG GAAATGTTCG GCAAGATCGG CGTCAAAACC GAACTCCAGC AGTTCGAACC CGGCGGCAAC ATCCTGGCCT GGCGCCAGGG CAAGCAGGGC GACTGGGACC TGCTGGGCAA CGGCTTCTCC AGCCCCACCG GCCTGGCCAT CACCATGATG CAGGGCATGT ACGCCGGCAC CCCGGAGAAG GAAAAGACCC GCGACACCTA CCAGGGCTAC GTCATCCCCG AGGTGCAGGC CAAGATCCAG GCCGCCTCCT CCGAGGTGGA CGCCACCCGC CGGCAGGAAC TGCTGGCCGA CGCGCAGCAG GCCATCTGGG ACACCTGGCC CTGCGCCTGG GCGTTCGTGC CCAAGTCCGT CTTGGCCCAC CGGAACCGGG TCTCCGGCAT CAACCTGGCA CCCACCAACT CCTACCCGCT CGTCGACGCA CGGCTGGAGG CCTAA
|
Protein sequence | MTVLPQGREI SRRRLLQFGT AAGFLLGTGS LAGCAGPTGL PGPSTLTLAL NRSLVSLDNK LNQFDAAVTV QRSVRQGLTA IGPETKPVLV LAERFEMTGP TEWTVTLREG IRYSDGSPVQ IEDVATALKM YKQVQGSFVA GFFPEFPEVV PVDNRTFKMV SKNPVPILDS LMSMILITPA AQNKPEELQE GVGTGPYKVT KFNRGAGTYS LARNENYWGP APEIENVEVR FLPEESSRVI ALRSGEVDII DSITPDSREQ LAGLPGVQLA EASSLRLNQI FYNFRKPAGH PLADVRVREA LSWAIDGESL VKDVLVDSVS AAEGVTPGSL TGYHKTGTYT YDPEKAKARL AELGVKDLTL KIIWETGEFA SDTSVMEALV EMFGKIGVKT ELQQFEPGGN ILAWRQGKQG DWDLLGNGFS SPTGLAITMM QGMYAGTPEK EKTRDTYQGY VIPEVQAKIQ AASSEVDATR RQELLADAQQ AIWDTWPCAW AFVPKSVLAH RNRVSGINLA PTNSYPLVDA RLEA
|
| |