Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3379 |
Symbol | |
ID | 4444108 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 3800103 |
End bp | 3801440 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639691202 |
Product | extracellular solute-binding protein |
Protein accession | YP_832854 |
Protein GI | 116671921 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCTAT TTTTCAAGGG AGAGAACATG GCAGCACAGT TTGATGCGTC GGCAACGGCT TACCCGAGCC GGCGGACCAT CCTCAAGACC GTCGGCGTCG GCGCGGCAGG CCTGGCCGGC ATTCCGTTCC TCGCCGCCTG CACGGGCGGG AGCGGGCCGT CGGCCACGGG TTCCGACTCC CCCGGCCTCA CCTTCGGTTC AGGATCCTCG GACGACGTCC CCAAGCGCGC GTACCAGGCC GTGACCGATG CCTTCACCGC CAAGACCGGC AAGAAAGTCA CCACGAACGT GGTGCCGCAC AACGATTTCC AGAACAAGAT CAATTCCTAC CTGCAGGGCT CCCCGGACGA CACCTTCACG TGGTTCGCCG GCTACCGCAT GCAGTACTAC GCGGGCAAGG GACTGCTCGC GCCGATCGAC GACGTCTGGG AGAGCATCGG CGCGAACTAC TCCGACGCGC TGAAGAAGGC ATCCACCGGC CCGGACGGCA AGCTGTACTT CGTCCCCAAC TACAACTACC CGTGGGGCTT CTTCTACCGG AAGAGCCTCT GGGCGGAGAA GGGCTACGAG GTTCCGGAAA CCTTCGACGC CCTCAAGGCC CTGGCCACCA AGATGAAGGC CGACGGCATC ATCCCGATCG GCTTTGCCGA CAAGGACGGC TGGCCGGCCA TGGGCACCTT CGACTACATC AACATGCGGC TCAACGGCTA CCAGTTCCAT GTGGACCTCT GCGCGCACAA GGAATCCTGG GACCAGCAAA AAGTGAGCGC GGTCTTTGAC ACGTGGGCGG AACTCCTTCC CTTCCAGGAC CCGGCAGCAC TCGGCCAGAC ATGGCAGGAC GCGGCAAAGG CGCTGGAGGC CAAGAAGACC GGCATGTACC TGCTCGGGTC CTTCGTGACC CAGCAGTTCA CCGATCCCGC AGTCCTGGCT GACATCCAAT TCTTCGCCTT CCCCGAGATC GCCATGGAAG GCCGGGACGC CGTCGAAGCA CCCATCGACG GACTCCTGCT GTCCAAGAAG GGCGGCGAGA ACAAGGCAGC CCGCGACTTC ATGGCATTCC TGGGAACGGC CGAAGCCCAG GACGCCTACG CCGCCGTGGA TTCCTCGAAC ATCGCCACGG CCAAGGGCAC CGACACGTCC AAGTTCACGC CGCTCAACAA GACGTGCGCT GACACCATCG CAAACGCAAA ATACATCAGC CAGTTCTTCG ACCGCGACGC CCTTCCGGCC ATGGCCAACA ACGTGATGAT CCCGGCCCTG CAGAGCTTCA TCAAGGACGG CAAGATGGAC GTCAAGAACC TCGAGGCACA GGCCAAGACG CTGTACGCCG CGCAGTAG
|
Protein sequence | MQLFFKGENM AAQFDASATA YPSRRTILKT VGVGAAGLAG IPFLAACTGG SGPSATGSDS PGLTFGSGSS DDVPKRAYQA VTDAFTAKTG KKVTTNVVPH NDFQNKINSY LQGSPDDTFT WFAGYRMQYY AGKGLLAPID DVWESIGANY SDALKKASTG PDGKLYFVPN YNYPWGFFYR KSLWAEKGYE VPETFDALKA LATKMKADGI IPIGFADKDG WPAMGTFDYI NMRLNGYQFH VDLCAHKESW DQQKVSAVFD TWAELLPFQD PAALGQTWQD AAKALEAKKT GMYLLGSFVT QQFTDPAVLA DIQFFAFPEI AMEGRDAVEA PIDGLLLSKK GGENKAARDF MAFLGTAEAQ DAYAAVDSSN IATAKGTDTS KFTPLNKTCA DTIANAKYIS QFFDRDALPA MANNVMIPAL QSFIKDGKMD VKNLEAQAKT LYAAQ
|
| |