Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_0028 |
Symbol | |
ID | 7291454 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | + |
Start bp | 31837 |
End bp | 33399 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643588427 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002486120 |
Protein GI | 220910811 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.0183012 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGTCTT CTTCAAAGTT TCCCTTTTCC ACTGCAGCGG TGAAGTGGCC GGCCCTGGCC GCCGTGGCAG TCCTCGGCCT CAGCGGCTGC TCCGTCGCAA ACTCCGATGG CGCAGGCGGC GGTGCCGCTG ACACGGTGCG GGTGGTGCTG GGCCAGGAAC CGCCCACGCT CGAAGCCTGC GAATCCAACC TCACCTCCAC GGGCGTTGTG GTGCGATCCA ATGTGACTGA GCCGCTGATC GAGCGAAACC CCCAGAGCGG GGAACTCGAA CCCAAGCTCG CCACCGAGTG GAAGGCCACC AGTGACACCG AATGGACACT GAAGCTCCGC GAGGGAGTCA ACTTCCAGGA CGGAACTCCT TTCAACGCCG AGGCGGCAGC CTTCACGATT GACCGTGCCG TGAACTCCAA GCTCGGGTGC AACGTTGAAG GCTACGTCTT CGGCGACGCC GACCTCGACG TCAAGGCCGT TGATGCCACC ACCCTGACCG TGACCACCCC GGAACCGGAT CCCATCCTTC CGTTGCGGCT CTCCTTCCTC GAAGTGGTGC CGACGTCCAC CAGCACCACC GAAAAGGTCC GTGAACCGAT TGGCACCGGC CCCTACAAGA TCGACAGCTG GGACGCCGGC CAGAAGATCT CGCTCAGCAG CTGGGACGGA TACTGGGGGA ACAAGCCCGC CTACGCCAAG GCTGAGTACC AGTGGCGCTC CGAAAGCTCG GTGCGTGCCG CCATGATCAC CAGCGGTGAA GCGGACGTTG CCATGGGGCT GAGCCCGGAT GACAACATCG GCGACCTCGG CGTTGACTAC CCCAACAACG AAACCGTTGC GCTGCGGATG GACGCGAACG AAGCGCCGCT CAATGACATC CGGATCCGCC AGGCCGTTAA CTACGCCATC GACAAGGAAG GCATCGTCAA CTCCCTCTAC CAGGGCAAGC ACCAGGTGGC CGCCCAACTG GTTCCCGAGG GGATCGTCGG CCACAACGAC GAACTGAAGG GCTGGCCCTT CGACCTCGAG AAGGCGAAGT CACTGGTTGC CGAGGCCAAG GCCGACGGCG TTGATACCTC CAAGCAGATT TCCCTGGTGG TCCGCAGTGC CCAGTTCCCC AAGATCACCG AACTGGCACA GGTACTCCAG GAACAGCTCA GCCAGGCAGG CCTGAACGTC AAGCTGAAGA TGCTTGAGAC CAGCCAGCAC CTGACCTACC AGGTCCGCCC CTTCGCCGAG GATGACGGCG CGGTTGCCCT GATGACCCAG CATGGCAACC AGGCGGGAGA CGCGGCCTTC ACGGTGGACC AGTACATGCT CTCCACCGGT GCCCAGAGCT ACTTCGGCAC CCCCGAGTTC GACGCAATGA TCAAGAAGGC AGACGCCGCG TCCGGCGATG ATCGCCAGAA GGCCTTCGAG GAGATCTTCG CCTACCAGAA CGACAAGGTG GTCCAGTTCG CCCACATTTC GCACCAGACG GGCATCCTGG GCAAGGCCAA GTCCGTCAAC TACACGCCCA ACTCCTCCAG TGGTGACGAG CTGCGCATCT CCGAGATGAC CCCGGCCAGC TAA
|
Protein sequence | MTSSSKFPFS TAAVKWPALA AVAVLGLSGC SVANSDGAGG GAADTVRVVL GQEPPTLEAC ESNLTSTGVV VRSNVTEPLI ERNPQSGELE PKLATEWKAT SDTEWTLKLR EGVNFQDGTP FNAEAAAFTI DRAVNSKLGC NVEGYVFGDA DLDVKAVDAT TLTVTTPEPD PILPLRLSFL EVVPTSTSTT EKVREPIGTG PYKIDSWDAG QKISLSSWDG YWGNKPAYAK AEYQWRSESS VRAAMITSGE ADVAMGLSPD DNIGDLGVDY PNNETVALRM DANEAPLNDI RIRQAVNYAI DKEGIVNSLY QGKHQVAAQL VPEGIVGHND ELKGWPFDLE KAKSLVAEAK ADGVDTSKQI SLVVRSAQFP KITELAQVLQ EQLSQAGLNV KLKMLETSQH LTYQVRPFAE DDGAVALMTQ HGNQAGDAAF TVDQYMLSTG AQSYFGTPEF DAMIKKADAA SGDDRQKAFE EIFAYQNDKV VQFAHISHQT GILGKAKSVN YTPNSSSGDE LRISEMTPAS
|
| |