Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_3534 |
Symbol | |
ID | 7295015 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | - |
Start bp | 3915675 |
End bp | 3917012 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643591940 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002489579 |
Protein GI | 220914270 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.582574 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGGCC ACATCATGAA GAAAGCACCA AGCCCGGCTG GCCGGAGGTT TCTCTCCGTT GCGGCACTGG CATCCGTTAC CGCATTGGCG TTGAGCGCCT GTGGTGGGGG CGACACCAAC AGCAACGCAC CCATCGCTGA AGAAACCGGC CCTGTTGAGA TCACCCTCGC AACCCCTGCC TTTACAGGTG GTGGCGCAGG AAACCCTTAC CTGACATTGA TCGACGCCTT CAAGGCCAAG AACCCAAACA TCACGGTCAA GCTGGTGGAG TCGCCCAATG ACCAGCACGG TCAGACCATG CGCACCCAGC TCCAGGCAGG CAACGCGCCG GACATTTTCT ACGTCACGGC AGGGCGGGGA AACAACCAGT CGTTCGCCTC GCTGGCCGAG GCAGGCTACC TGCAGGACCT GACGGACCAA AAGTGGGCAG CCGACGCGAT CCCCGCTTCC GCCAAGAACC TGTACTACGA CGACGACAAG GTCTTTGCCG TTCCGGCCGA CCTCGCACCG ATCACCATGC TTCAAAACAC GGGTGTCTTG AAGGAGCTTG GGCTCCAGGA ACCCGCCACC CTGGATGAGC TGATCACCCA GTGCAAGACT GCGCGGGCAG CCGGCAAGTC GTACTTCGCA GTGGCAGGAA CCTCCGGCGC CAACACGGGC CTCCAGGCAA TGCAACTCGC GGCATCACTG GTTTACGCCA AGGACCCCCA GTGGGACGCC AAACGCGCCA AGGAGGAAAC CACCTTCGCT GACTCGGACT GGAAGAAGGT GCTGGAGCAG ATCGTGAAGT TCAAGGACGC CGGCTGCTAC CAGGACGGTG CCGCGGGTGC CGGCTTTGAC CAGCTTTTCC CGTCAGTTGC CCAGGGCAAG GTGGCGGCAG CATTCGCCCC CGCCGGCGCT GTTGCCGCAC TCCGGGCGCA GGTCAAGGAC GGATCCTTCG ATGTAGCTGT TCTGCCTGGC GAAACGGCTG AAGACAGCCG GCTGATCGCA AGCCCGGGCA ACGCCATGGC CGTCAACGCT GCCGGCAAGC ACAAGGGTTC CACCCTGAAG TTCCTGGAAT TCCTGGCCCA GCCCGCCAAT CAGGATGCCT TGGCGAAAGC AAACGGAAAC GTATCTGTCA CCTCGGCACT TTCAGGTACC GTGCCCGAGC AGTTCCCGCT GCTGGAACCG TACTTCTCTG AGCCCGAGAA AAAGATCGTC ACCCAGCCCA ACTACCTCTG GCCCAACAGC GGCGTCTACG ACTCGCTCGG TACCGGCATC CAGGGACTTC TGACCGGGCA GGCCACCCCT GACCAGGTTC TGAAGACCAT GGACGAGCAG TACGACCGCG GCGCCTAG
|
Protein sequence | MKGHIMKKAP SPAGRRFLSV AALASVTALA LSACGGGDTN SNAPIAEETG PVEITLATPA FTGGGAGNPY LTLIDAFKAK NPNITVKLVE SPNDQHGQTM RTQLQAGNAP DIFYVTAGRG NNQSFASLAE AGYLQDLTDQ KWAADAIPAS AKNLYYDDDK VFAVPADLAP ITMLQNTGVL KELGLQEPAT LDELITQCKT ARAAGKSYFA VAGTSGANTG LQAMQLAASL VYAKDPQWDA KRAKEETTFA DSDWKKVLEQ IVKFKDAGCY QDGAAGAGFD QLFPSVAQGK VAAAFAPAGA VAALRAQVKD GSFDVAVLPG ETAEDSRLIA SPGNAMAVNA AGKHKGSTLK FLEFLAQPAN QDALAKANGN VSVTSALSGT VPEQFPLLEP YFSEPEKKIV TQPNYLWPNS GVYDSLGTGI QGLLTGQATP DQVLKTMDEQ YDRGA
|
| |