Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3465 |
Symbol | |
ID | 4443775 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 3897763 |
End bp | 3898998 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639691289 |
Product | extracellular solute-binding protein |
Protein accession | YP_832940 |
Protein GI | 116672007 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2182] Maltose-binding periplasmic proteins/domains |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTCAG CATTCACCAA GAGCCGTCCT CTCGCCGTTG CCGTCACCGC CCTCATCGGG CTGCCGGCCA TGCTCTCCGG CTGCGGTACC GCCGCCTCAT CATCATCCAC GGAGTCAGTC TCGGAGATCT CCGTCATGGA CTACTACAAC AATGAACCTG ACAAGACCTT CATCGGCGAC GCCCTCACCG CCTGCGGCAC CAAGGCCGGC GTCAACATCA AGCGGGAAAC GGTGCCCGGT AAATCCCTGA TCTCCAAGGT CCTTCAGCAG TCCTCATCCA AGACCTTGCC GGACGTGCTG ATGCTGGACA ACCCGGACCT GCAGCAGATT GCGGCCACGG GCGCCCTGGC TCCCCTGGCC GATTTCAACA TCAGCACCGC CGACTTCGCC CCCGGCGTGC TCAGTGCGGG CACGTACAAG GACAAGGTCT ACGGGCTGGC CCCCACTGTC AATACCATCG CCCTCTTCTA CAACAAGGAC ATCCTGACGA AGGCCGGAGT GACCCCGCCG GCCACGTGGG ACGAGCTGGA AGCGGCCGCC GCCAAACTGA CGTCCGGGGA CCAATACGGG CTGGCGTTCA ATGCCAACCC CACTTATGAA GGCACCTGGC AGTTCCTGCC GGTCATGTGG TCCAACGGCG GCAACGAAAA GAACATCGAC ACGGAGGAAA CGGCACAGGC CCTGCAGCTC TGGACCGACC TGGTGAAGGA CGGCTCGGTG TCCTCCTCCG CCCTGAACTG GACGCAGGCC GATGTCAAGG ACCAGTTCCT GGCCGGCAAG GCCGCCATGA TGGTCAACGG ACCGTGGCAG ATCCCCTCCC TCGACAAGCA GGCCTCCCTG CAGTACGGCG TGGTGAAGAT ACCCGTCAGG GAGGCCGGCC AGACCGTTGT TGCCCCGCTC GGCGGAGAAG TCTGGACTGT TCCGCAGACC GGGAACAAGG CCCGGCAGGC CAAGGCCGCA GAGGTGGTCT CCTGCCTCAA CAGCGACGAA AACCAGCTGG CCATGGCCAA GGTCCGGAAC ACTATCCCGT CCAAAACGAC CCTGGCAGCC AAGTTTGCCG AAGAAAACCC AAAACTCGCC ACGTTCACCG AACTTGTGAA AACCGCCCGC GCCCGCACGG GACAGCTGGG TGAGGAATGG CCCGCGCAGG CCACCAAGAT CTACACCGCC ATCCAGACGG CCCTCACCGG TAAGGCGACC CCGTCCGAGG CCCTGAAGCA AGCGCAGGGA CAGTAG
|
Protein sequence | MKSAFTKSRP LAVAVTALIG LPAMLSGCGT AASSSSTESV SEISVMDYYN NEPDKTFIGD ALTACGTKAG VNIKRETVPG KSLISKVLQQ SSSKTLPDVL MLDNPDLQQI AATGALAPLA DFNISTADFA PGVLSAGTYK DKVYGLAPTV NTIALFYNKD ILTKAGVTPP ATWDELEAAA AKLTSGDQYG LAFNANPTYE GTWQFLPVMW SNGGNEKNID TEETAQALQL WTDLVKDGSV SSSALNWTQA DVKDQFLAGK AAMMVNGPWQ IPSLDKQASL QYGVVKIPVR EAGQTVVAPL GGEVWTVPQT GNKARQAKAA EVVSCLNSDE NQLAMAKVRN TIPSKTTLAA KFAEENPKLA TFTELVKTAR ARTGQLGEEW PAQATKIYTA IQTALTGKAT PSEALKQAQG Q
|
| |