Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0048 |
Symbol | |
ID | 4447483 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 53340 |
End bp | 54881 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639687842 |
Product | extracellular solute-binding protein |
Protein accession | YP_829549 |
Protein GI | 116668616 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTTCGGT TCCGTTGTCA CGGATGGACC CGGCGGATGC CAAAACCTGC CAAATCGGCC AAAATATGCC AAACCGTTTC TTCTGCCTTC GACGTCGACT CTAGGGCTGC GCCCGCCCAT GCGTCAAGGC CCCAAATAAT TTGCATTGTG ACGCTTGACA CGCCTGAACG GACGCCCCTA GGGTGTTGCC AAGTCGTTTT GGCAAAACGA TTCTTCACCC CACAAACATT CTTGCCATTC CGTTCCGAGA GAAAGTCGAC AGCGATGTCC ACTCGAAAGA CCATCTCCAG GCTCGCCGCC ATCGGCGGCC TTTGCACGGC CGTGGCCCTG ACGGCCACCG CATGCGGCGC TGGGGGTCCG GCGTCGTCCG GCAGCGCCGC AAGCTCCGTC AACGTCCTTG TCGAAGCCGG CGGGCACGCC GAGCTCGCCG GCGTTGCCGA GGCCTGCAAA AAGGACACCG GCCTCGACGT CAACTTCGTC GAACTGCCCT ACGACGGCTT GTTCAACAGG CTCTCCAGCG AATTTTCCTC CGGCACTGTC TCCTTCGATG TCGCCGCTCT GGACTCCGTC TGGCTCCCCA GCTTCAAGGA TGCGGTCCAG CCCATCGACG AGCTCTTCAC CGATGAGGCC AAGAAGGACA TCTTCCCCGC ACTGGTCAAG GAAGCCAACG TTGATGGCCA CTTCATCGGC ATGCCCGCCT GGACCAATGC CGAAATCATC CTGTACCGCA AGGACCTCTT CGAGGACGCC AAAAACAAGG CCGACTTCAA GGCAAAGTAC GGATACGAAC TTGCAGCCCC CACCACCTGG AAGCAGTACC AGGACATCTC CGAGTTCTTC ACCAAGGATG GCATGTACGG CACCGACGTG AAGGGTGCCG TCGAAACCGA ATGGCTGGCC CATGTTCTCC AGGCCGGGTC CCCGATGGTC CTGGACGACC AGAACAACGT CGTGGTCGAC AACGCAGCCC ACAAGGAAGC CCTCGATTTT TACACGAGCC TTGTTAAGTC CGCGCCGTCC GGAGCGGCAC AGGTCGACTG GGCTGCCGCG CAGAACCTCT TCAACCAGGG CAAGACCGCG ATGACCAGGT TCTGGGCCCA CGCCTACCGC CAGATCCCCG CCGACGCCGC TGTCTACGGC AAGGTGGGCG CGGCTCCCAT GATCGGCGGA TCCGCCGGCG TCGCCGGCGT CCCGGGACCG TGGTACCTCT CCGTCCCCAA GGCGACGAAG AACGCAGACG CCGCCAAGAA GTTCATCAAG TGCGCCTACG ACCACAACGA CCTGGGCATC GAGTCCAAAC TGGGGCTCGC CGCCCGCATC TCGGCCTTCG AGAAGTACCA GGACAAGCCC GGTTATGAGA GCTTCAAGCC GTTGATCGAG ACGCTCAACG GCGAGGCCAC GGCGACCCGC CCGGCAACGG CGAAGTGGCA GCAGATTGTG GACACGGTCC TGGTACCGAC GCTGCAGAAG GCAGTGGCCG GAGGGGACAG CGCGTCCCTC CTGGCTGAAG CCAAGACCAA GATCCAGGCC CTCGTCAAAT GA
|
Protein sequence | MLRFRCHGWT RRMPKPAKSA KICQTVSSAF DVDSRAAPAH ASRPQIICIV TLDTPERTPL GCCQVVLAKR FFTPQTFLPF RSERKSTAMS TRKTISRLAA IGGLCTAVAL TATACGAGGP ASSGSAASSV NVLVEAGGHA ELAGVAEACK KDTGLDVNFV ELPYDGLFNR LSSEFSSGTV SFDVAALDSV WLPSFKDAVQ PIDELFTDEA KKDIFPALVK EANVDGHFIG MPAWTNAEII LYRKDLFEDA KNKADFKAKY GYELAAPTTW KQYQDISEFF TKDGMYGTDV KGAVETEWLA HVLQAGSPMV LDDQNNVVVD NAAHKEALDF YTSLVKSAPS GAAQVDWAAA QNLFNQGKTA MTRFWAHAYR QIPADAAVYG KVGAAPMIGG SAGVAGVPGP WYLSVPKATK NADAAKKFIK CAYDHNDLGI ESKLGLAARI SAFEKYQDKP GYESFKPLIE TLNGEATATR PATAKWQQIV DTVLVPTLQK AVAGGDSASL LAEAKTKIQA LVK
|
| |