Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_3387 |
Symbol | |
ID | 7294868 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | + |
Start bp | 3756132 |
End bp | 3757784 |
Gene Length | 1653 bp |
Protein Length | 550 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643591794 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002489433 |
Protein GI | 220914124 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 100 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGAGGC ACATGAACAA CATCAAGAAC CTGCCGCTGG TCAATGACGC CAGCCGCCGG AACTTCCTGA AGCTCGGCGG GGCCATGGGC CTGGCCGCAG CCTTCGCCAC CTCCCTGTCC GCCTGCGGCG GCCCTGCCGC GACCACCACG GGCGCCACCG AAAACACCTC GCCCATCAAC AAGGACCTCA CCATCGAGGC CGGCATCTCC TACGCGCTGT CCACCGGCTT TGACCCCATC TCCTCCTCCG GCGCGACGCC GATGGCCGCC AACCTGCACG TCTTCGAAGG CCTCATCGAA CTTCACCCCG CCACCCGTGA GCCGTACAAC GCCCTGGCTG CCGCAGACCC CAAGAAGGTC AGCGACACCC AGTACCAGGT GTCCATCCGG GACGGCGCAG TGTTCCACGA CGGCACGCCG GTCACCGCCG AGGACGTGGT GTTCTCCTTC ACCCGCGTGA TGGACCCGGC CAACAAGTCG CTGTTCTCGC AGTTCATCCC GTTCATCCAG GAGGTCAAGG CCCTGGACGC CAAGACCGTG GAATTCACCC TTAAGTACGC GTTCCCCGGC TTCGGCCCGC GCATTTCCGT GGTGAAGGTG GTGCCCAAGG CGCTGACCAA TTTCCCGTTC GGCTCCGATC AGCTGAAGTC CTTCGACGCC AAGCCCGTGG GCACCGGCCC GTACAAGCTC ATCTCCGCCG TCAAGGACGA CAAGATCGTC TTCGAGGCCA ACCAGGCCTA CAACGGCCCC ATGCCGGCCC TGGCCAAGGG CATGACCTGG CTGCTGCTCT CCGATGCCGC CGCCCGCGTC ACCGCCATGC AGTCCGGCCG TGTCCAGGCC ATCGAGGACG TCCCCTACCT GGACATGGAG GGACTCAAGT CTGCCGCCAC AGTTGAGTCC GTGCAGTCCT TCGGCCTGCT GTTCATGATG TTCAACTGCA ACGCCGCACC CTTCAACAAC AAAAAGGTCC GCCAGGCCCT CCACTACGGC CTGGACAAGG ACTCCATCAT CAAGAAGGCC CTGTTCGGCA ACGCCAAGCC GGCCAGCTCC TACTTCCAGG AAGGCCACCC GGACTACGTC AAGGCCAAGA ACGTCTACGG CTACGACGCC CAGAAGGCCG AGGACCTCCT CAAGGAAGCC GGCGTCACCA GTCTGGAATT CGAGCTGCTC ACCACGGACA CCGCCTGGGT CAAGGATGTG GCACCGCTGA TGCTCGAATC CTGGAACAAG ATCCCCGGCG TCAAGGTCAC CCTCAAGAAC CTGCAGTCCG GCGCCCTGTA CACGGACCGC GTGGCCAAGG GTGACTACAA GGTTGTTGCC GCACCGGGCG ATCCCTCGGT GTTCGGCAAC GACGCGGACC TGCTGCTGAG CTGGTTCTAC GCCGGCGACA CCTGGATGAA GAACCGCGCC TTCTGGGCCG ACACCCCGGA ACGTGCGCAG CTGGTGGACC TGATGTCCAA GGCCGGCCGC GCCGCCAAGG AAGATGCCAA GAAGCTGACC GGCGAGATTG TGGACCTGGT GTCCGAGGAA GTCCCGCTGT ACCCGCTGTT CCACCGCCAG CTTCCCAGCG CCTGGGATTC CAAGAAGCTC AGCGGCTTCA AGCCCCTCCC CACCACCGGA GTGTCCTTCG TTGGTGTGGG CCGCACCGCC TAG
|
Protein sequence | MVRHMNNIKN LPLVNDASRR NFLKLGGAMG LAAAFATSLS ACGGPAATTT GATENTSPIN KDLTIEAGIS YALSTGFDPI SSSGATPMAA NLHVFEGLIE LHPATREPYN ALAAADPKKV SDTQYQVSIR DGAVFHDGTP VTAEDVVFSF TRVMDPANKS LFSQFIPFIQ EVKALDAKTV EFTLKYAFPG FGPRISVVKV VPKALTNFPF GSDQLKSFDA KPVGTGPYKL ISAVKDDKIV FEANQAYNGP MPALAKGMTW LLLSDAAARV TAMQSGRVQA IEDVPYLDME GLKSAATVES VQSFGLLFMM FNCNAAPFNN KKVRQALHYG LDKDSIIKKA LFGNAKPASS YFQEGHPDYV KAKNVYGYDA QKAEDLLKEA GVTSLEFELL TTDTAWVKDV APLMLESWNK IPGVKVTLKN LQSGALYTDR VAKGDYKVVA APGDPSVFGN DADLLLSWFY AGDTWMKNRA FWADTPERAQ LVDLMSKAGR AAKEDAKKLT GEIVDLVSEE VPLYPLFHRQ LPSAWDSKKL SGFKPLPTTG VSFVGVGRTA
|
| |