Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0744 |
Symbol | |
ID | 4446749 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 802157 |
End bp | 803455 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639688549 |
Product | extracellular solute-binding protein |
Protein accession | YP_830242 |
Protein GI | 116669309 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2182] Maltose-binding periplasmic proteins/domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0587531 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGTGC AGAACACCCT CACGAGCGGG ACCAGCCGCC GCGTCTTCAC GGCGGGCGCG CTCTCCGTCG TGGCAGCCTT GGCGCTCAGC GCCTGCGGCG GCGCATCGGC CACCGCGCCG GCAACGGCTA CCGCCAAGCC GGACCTGAAG TCTGCCGGTG CAGGCACCAT CACCATGTGG GTCGACGCCG AACGGTCACC GGCCCTGAAG GACATCACCG CCAAGTTCCA GGCTGACACC GGCATCGAGG TCAAGCTCGT CGTCAAGGAC TTCGCGGCCG TCCGCGATGA CTTCATCACC CAGGTACCCA CGGGCAAGGG CCCGGACCTG ATCGTCGGAC CGCATGACTG GGTCGGCAAG TTCGTCCAGA ACGGCGTCAT TGCTCCGATC GAACTCGGGG ACAAGTCGTC GGCCTTCCAG GATTCGTCCA TCAAAGCCAT GACCTTCAAC GGTTCCGTCT ATGGTGTCCC GTACGCCATC GAAAATATCG CGTTGCTGCG CAACACCGAC CTGGTGGCCG AGAGCGCGGC AACGCTGGAC GAGGTAGTGG CCAACGGCAA GAAGGCAGTG GCGGACGGCA AGGCCAAGTT CCCGTTCCTG GTGGGCATGG ACCCCAAGCA GGGTGACCCC TACCACCTCT ACCCGCTGCA GGCCTCCATG GGTTCGCAGG TCTTCGGCCA GGCTGCCGAC GGTGGCTACG ATCCCAAACA GCTCCTCATC GGTGACGCCG CCGGTGTTGA GTTCGCCAAG AAGCTCGTGG CCTGGGGCGA CGCAGGTGAA AAGATCATCA ATTCCAACAT CACCGGAGAC ATCGCCAAGG AGAAGTTCCT GGCAGGCGAA TCCCCTTACT TCCTGACGGG CCCCTGGAAC GTTCCGGATG TCCAGAAGAA GGGCATCAAG TTTGCCGTGG ACGCCCTGCC GACCGCCGGC GACAAGCCCG CCCAGCCGTT TATCGGCGTC AACGGTTTCT TCATCAGCGC CAAGAGCGCT AACGCCCTTG CCACCAATGA ATTCGTAACC AACTACCTCA CCTCCGAAGC GGCGCAGGAT TCCATGTACA AGGCCGGCGG CCGTCCGCCG GCGCTGAAGG CCTCCTTCGA GAAGGCAGCC AGCGACCCCG TGGTGGAAGC ATTCGGCAAG ATCGGCGCCA CCGGCGTGCC CATGCCTGCC ATTCCGGAAA TGAGCGCGGT CTGGGCCGAC TGGGGAGCCA CTGAACTCGC GCTGATCAAG GGCCAGGGCG ATCCCGCTGC CGAGTGGGCC AAGATGGCTG CCAGCATCAA GGCGAAGATC GCCGGCTGA
|
Protein sequence | MKVQNTLTSG TSRRVFTAGA LSVVAALALS ACGGASATAP ATATAKPDLK SAGAGTITMW VDAERSPALK DITAKFQADT GIEVKLVVKD FAAVRDDFIT QVPTGKGPDL IVGPHDWVGK FVQNGVIAPI ELGDKSSAFQ DSSIKAMTFN GSVYGVPYAI ENIALLRNTD LVAESAATLD EVVANGKKAV ADGKAKFPFL VGMDPKQGDP YHLYPLQASM GSQVFGQAAD GGYDPKQLLI GDAAGVEFAK KLVAWGDAGE KIINSNITGD IAKEKFLAGE SPYFLTGPWN VPDVQKKGIK FAVDALPTAG DKPAQPFIGV NGFFISAKSA NALATNEFVT NYLTSEAAQD SMYKAGGRPP ALKASFEKAA SDPVVEAFGK IGATGVPMPA IPEMSAVWAD WGATELALIK GQGDPAAEWA KMAASIKAKI AG
|
| |