Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3329 |
Symbol | |
ID | 4444058 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 3739225 |
End bp | 3740520 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639691152 |
Product | extracellular solute-binding protein |
Protein accession | YP_832804 |
Protein GI | 116671871 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0332045 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGTTGG CTCTCAAGAA GTCTGTAATG GGTGTCGCCG GCGCCACGGC TGCGCTGGCG TTGGTACTGA CGGGCTGCGG CAACAGCCCG CAGGCCGGGA AGGTGGGAAC GGCGGAGGAC CCGGTGACCA TCAGGTTCGC ATGGTGGGGC AATGACTCCC GCGCCAAGAC CACCCTGGAA GTGATCAAGG ACTTCGAGGC TGCCAACCCC ACCATCAAGG TGCAGGGTGA GAACACCGAG TTCAGCTCCT ACTGGGACAA GATGGCAACC CAGATCGCCG GCGGAACGAC GCCGGACGTG TTCGCCATGA GCGGCGCCTA CCCCAGCGAA TACGCCAGCC GCGGTGTGCT CCTGGACTTG GACAAGGTCA AAGACCAGAT CGATACCTCC AAGTTCGCCG AGGGAACGGT GGACCTGGGC AAGATCGACG GCAAGCAGTA CACCATCACG GCAGGCGTGA ACTCGATGTC CATGGTCATC GACCCCCAGG TCTTCGAGGC TGCCGGGGTG CCGCTGCCGA ACGACGAAAC CTGGACTTGG GACGACTACG TGGACATTGC CGCGGAGATC GCGAAGAAGT CGCCGGCGGG CACGTTCGGC ACCACGCCCA TGGCCAATGA TTCCTTCCTG GCCGTCTGGG CACGCCAGAA CAACGAGGCC CTCTACACGG ATGACGGCAA GAAGATGGGA ATCAGCGAAG GCACCCTGAC CCGCTGGTTT GAACTGAACA AGAAGCTCAT GGACACCGGC GGCGCCCCGT CGGCCTCCCA GACCGTGGAG GACGGCTCGG CCCAGCCCGA ACTGACCCTC ATGGGCCAAG GCAAACAGGC CATGAAGATT TCGTGGAGCA ACCAGATGAC GTCCTATTCC GGTTTCCCGC TGGTCATGAT GAAGATGCCC GGTGAAAGCA AGCAGCCCGG AGCCTGGCTG CGTTCCTCCA TGGAATATGC CATTTCCTCG AAATCCGCGC AGTCGAAGGA AGCCGCACTG TTCATCAACT ACCTGGTCAA CAACATGGAC GCCGCCACCA AGATCAAGAG CGACCGTGGC ATGCCCGCCA ACACGGACTT GAAGGCCGGT ATCACCCCGT TGCTGAAGGA AGGCCAGCAG AAGGAGGCCG CCTACCTGGA CCGGATCGCG GAGCTGAATG TGGCCCCGCC CAAGCCGTTC CCGGCCGGTT CTTCCGCCAC CCTTGAGGTG CTGAATCGCT ACAACACGGA CGTTTTGTTC GGGAAGATAT CCCCGCAGGA CGCTGCGAAG GGCTTCATTT CCGAGGTCAA CCAGAACCTG GGCTGA
|
Protein sequence | MRLALKKSVM GVAGATAALA LVLTGCGNSP QAGKVGTAED PVTIRFAWWG NDSRAKTTLE VIKDFEAANP TIKVQGENTE FSSYWDKMAT QIAGGTTPDV FAMSGAYPSE YASRGVLLDL DKVKDQIDTS KFAEGTVDLG KIDGKQYTIT AGVNSMSMVI DPQVFEAAGV PLPNDETWTW DDYVDIAAEI AKKSPAGTFG TTPMANDSFL AVWARQNNEA LYTDDGKKMG ISEGTLTRWF ELNKKLMDTG GAPSASQTVE DGSAQPELTL MGQGKQAMKI SWSNQMTSYS GFPLVMMKMP GESKQPGAWL RSSMEYAISS KSAQSKEAAL FINYLVNNMD AATKIKSDRG MPANTDLKAG ITPLLKEGQQ KEAAYLDRIA ELNVAPPKPF PAGSSATLEV LNRYNTDVLF GKISPQDAAK GFISEVNQNL G
|
| |