Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_3131 |
Symbol | |
ID | 7294611 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | + |
Start bp | 3480280 |
End bp | 3481548 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643591541 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002489181 |
Protein GI | 220913872 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.00803878 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGGGGCGG CTGCTGCAAC AGCCGTACTG GCCTTGGCGC TCACAGGTTG TGGCAGCAGT CCGCAGGCCG GAAAGGTCGG CACGGCGGAG GATCCCGTGA CCATCCGTTT CGCCTGGTGG GGCAACGATT CCCGCGCCAA AACCACGCTT GAAGTCATCA AGGACTTCGA AGCGGCCAAC CCCACCATCA AGGTGCAGGG CGAGAACACT GAGTTCAGCT CCTACTGGGA CAAGATGGCC ACGCAGATTG CGGGCGGCAC CACCCCGGAC GTTTTCGCCA TGAGCGGTTC CTACCCCAGC GAATACGCGT CGCGCGGGGT GCTGCTGGAC CTGGACAAGG TCAAGGACCA GATCGATACC TCCAAGTTTG CCGACGGAAC CGTGGAGTTG GGCCAGCTGG ACGGCAAGCA GTACACCATC ACCGCCGGCG TCAATGCCAT GTCCATGGTC CTTGATCCCA CAGTGTTTGA AGCCGCCGGC GTACCACTGC CGGACGACGA AACCTGGACC TGGGACGACT ACGTCGATAT TGCCGCCAAG ATCAGCAAGA ACTCCCCCGC CGGCACCTTC GGCACCACGC CGATGTCCAA CGATTCGTTC GTTGCCGTCT GGGCACGCCA GAGCGGCGAA GAGCTGTACA CGGACGACGG AAAGAAGATG GGTATCAGCG AGGGCACCCT CGCCAAGTGG TTTGAGTTCA ACAAGAAACT CATGGACACC GGCGGCGCAC CTTCCGCGTC GCAGACCGTC GAAGACGGCT CAGCGCAGCC GGAACTGACG CTGATGGGCC AGGGTAAGCA GGCGATGAAG GTGTCGTGGA GCAACCAGAT GACCTCTTAC TCGGGTGCGC CCCTGACCAT GGTGAAGCTG CCCGGGGAAA GCAAGCAGCC GGGAACCTGG CTGCGCTCCT CCATGGAGTA CGCCATCTCG TCCAAGTCCG CCCAGTCCAA GGAAGCTGCC CTTTTCATCA ATTATTTGGT GAACAACATG GATGCTGCCA GCAAGATCAA GAGTGACCGC GGCATGCCCG CCAACACCGA TCTCAAGGCG GGCATCACCC CCCTGCTGAA GGAAACCCAG CAGAAGGAGG CGGGATACCT GGACCGCATC GCCGAGCTGG ACGTCAAGCC GCCCCAGCCG TTCCCGGCAG GTTCTTCTTC CACCCTGGAA GTTTTGAACC GATACAACAC GGATGTACTC TTCGGGAAGA TCTCGCCGCA GGATGCGGCA AAGGGCGTCA TCAGTGAGGT CAATTCGAAC CTGGGGTAG
|
Protein sequence | MGAAAATAVL ALALTGCGSS PQAGKVGTAE DPVTIRFAWW GNDSRAKTTL EVIKDFEAAN PTIKVQGENT EFSSYWDKMA TQIAGGTTPD VFAMSGSYPS EYASRGVLLD LDKVKDQIDT SKFADGTVEL GQLDGKQYTI TAGVNAMSMV LDPTVFEAAG VPLPDDETWT WDDYVDIAAK ISKNSPAGTF GTTPMSNDSF VAVWARQSGE ELYTDDGKKM GISEGTLAKW FEFNKKLMDT GGAPSASQTV EDGSAQPELT LMGQGKQAMK VSWSNQMTSY SGAPLTMVKL PGESKQPGTW LRSSMEYAIS SKSAQSKEAA LFINYLVNNM DAASKIKSDR GMPANTDLKA GITPLLKETQ QKEAGYLDRI AELDVKPPQP FPAGSSSTLE VLNRYNTDVL FGKISPQDAA KGVISEVNSN LG
|
| |