Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2948 |
Symbol | |
ID | 4444470 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 3314283 |
End bp | 3315629 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639690771 |
Product | extracellular solute-binding protein |
Protein accession | YP_832427 |
Protein GI | 116671494 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTATGA ATCTTGACCG CAGGCATTTC CTAGGGCTTG CCGGCGCAGG AGCCGGTGCC GCTGCGCTGG CAGCGTGCGG CGGCCCGTCC ACCGGCGGAA CAACGCCTGC AAGCGAAGCC GCCGAGATCG ACTTCAGCGG CGTCAAGCCT GCCGCCTCCA TCGACTTCTG GACGAGCCAC CCGGGCAAGT CCCAGGACGT CGAAAAATCC ATCATCGCCA AGTTCCACGC CAAGTTCCCG GACATCAAGG TGAACCTGGT CACCGCCGGT GCCAACTATG AGGAGATTGC ACAGAAGTTC CAGACCTCGC AGGCCGCCAA GGAGGCACTG CCGGGCCTTG TGGTGCTCTC CGATGTGTGG TGGTTCCGCT ACTTCACGAA CGGCAACATC ATTCCGCTGG ACGGACTGGT GAAACAGCTG GATATCAAGG TGGACGACTT CCAGAAGTCC CTCGTGGCCG ACTACCAGTA CGACGACAAG CAGTGGGCCC TCCCCTACGG CCGTTCGACG CCGCTCTTCT ACTACAACAA GGACCACTTC AAGGCGGCCG GCCTCCCGGA CCGGGCACCG AAAACCTGGC AGGAATTCGC CGAGTGGGCG CCCAAGCTGA AGGCAAGCTC CGGCGCGCAG TACGCCTACA TCTACCCGGC GCTGGCCGGC TATGCGGGCT GGACCCTGCA GAACAACCTC TGGGGATGGG GCGGCAGCTG GTCCAACGAG TGGACCATCA ACTGCGACTC GGCGGAATCG GTGGAGGCCC TGCAGTGGGC CCAGGATTCC ATCTACAAGG ACGGCTGGGC GGGTGTTTCC TCGAAGGAGG CCGCTGACGA CTTCGCCGCG GGCATCACAT CCTCCACCAT CTCGTCCACA GGGTCCCTGC TCGGTGTGCT GAAGTCCGCC AAGTTCAACG TGGGCGTGGG CTTCCTGCCG GGCGGCCCCA AGGTGGAAAG CGGCGTGTGC CCCACCGGTG GTGCCGGCCT GGGCATTCCC AGCGGTGTGA GCAAGGAAGT GCAGCTGGCT GCGGGCACCT TCCTGAAGTT CATGACCGAG CCGGAAAGCA CCGCGGAATT CTCTGCGGCA ACGGGCTACA TGCCTACGCG TGTTTCGGCC GACATGACGT CGGTACTGGC CAAGACGCCG CAGATCAAGA CGGCCATGGA CCAGCTCGCG GTCACCCGGG TCCAGGACAA CGCCCGCGTG TTCCTGCCCG GCGCAGACCA GGAAATGGCC AAGGCCGCAG CGAAGATCCT CACCCAGCAG GGCGACGTGA AGGCCACCAT GACCGCGTTG AAGTCCACGC TGGAGGGCAT CTACACGAAG GACGTCAAGC CCAAGCTCAA GAGCTGA
|
Protein sequence | MGMNLDRRHF LGLAGAGAGA AALAACGGPS TGGTTPASEA AEIDFSGVKP AASIDFWTSH PGKSQDVEKS IIAKFHAKFP DIKVNLVTAG ANYEEIAQKF QTSQAAKEAL PGLVVLSDVW WFRYFTNGNI IPLDGLVKQL DIKVDDFQKS LVADYQYDDK QWALPYGRST PLFYYNKDHF KAAGLPDRAP KTWQEFAEWA PKLKASSGAQ YAYIYPALAG YAGWTLQNNL WGWGGSWSNE WTINCDSAES VEALQWAQDS IYKDGWAGVS SKEAADDFAA GITSSTISST GSLLGVLKSA KFNVGVGFLP GGPKVESGVC PTGGAGLGIP SGVSKEVQLA AGTFLKFMTE PESTAEFSAA TGYMPTRVSA DMTSVLAKTP QIKTAMDQLA VTRVQDNARV FLPGADQEMA KAAAKILTQQ GDVKATMTAL KSTLEGIYTK DVKPKLKS
|
| |