Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0577 |
Symbol | |
ID | 4446928 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 616591 |
End bp | 617913 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639688375 |
Product | extracellular solute-binding protein |
Protein accession | YP_830076 |
Protein GI | 116669143 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCTACCA AACCGTCCAT CGCCTCTTCC GCCTTCTCCC GGCGGGGGTT CCTGGGCTTC GCAGCCGCCG CGGCCTCTGC TCCCCTCCTG GCCGCCTGCG GCGGCGGTTC CGCTTCGCAG GGCGGAGGCG GCGCGGGCGG GACGATCAAG TTCTGGGACA TGCCGTGGGC CACCCCCGCT TACAACGACG CCGCCAAGAA GCTCGCAGAA GGGTTCTCCG GATCCGGCAG CAAAGCCAGC TACCAGATCA TCCAGTGGAA CAACTTCTAC CAGACGTTCT CCTCGGCCAT CGCGTCGAAG ACCGGTCCCG CGGTGTCCAC CGGCGGCGGC TTCCAGGCCT TCCAGTTTGA GCAGCAGGGC CAGATCGCGT ACGCGGACAA GGTCATTGAC AAGCTGAAGG AGAACGGCCA GTTCGATGAC TTCCTTCCCG GCGTCCTCGA CCCCTTCAAG TCCGACAAGG GCTACGTCGC TGTCCCCTGG CAGCTGGACA TGCGCGTGTT CTGGTACCGC AAGTCGCTGT TCGAGAAGGC GGGCGTAGGA CTCCCCACCG ACTGGCCGTC CCTCCTGGAA GCCGGCAAGG CGCTCAAGAA GGTCGGAGCC TTCGGCTTCA CCACCGGTGC AGGCGCCGGC AACAACTACG GCAACCACTC CATGATCATG ATGATGGTCA ACAATGGCGG CGGCGTCTGG AACAAGGACG GCGAACTGGA CCTGATGAAC GACCGCAACG TGGAAGCCAT GGAATTCGTC CTTGAGCTGG TGTCCAACGG AATCGTCGAT CCGGCCGCCG TCAGCTACAC CACGGACAAC ATGTCCGCGC AGTGGAAGGA CGGCAAGGCC GGGTTCGGCC TGTTCCAGGT CAACGTGCCG CAGCGCGTCG GCGACACTTC CGGAGATCTG CTCGTTGCGG ACCCCATCAC CGGACCGCAC GGCGACAAGG CCACCATCGT CTTCCCGAAC AACATCATGA TGTACACGAA CACTCCGTCT CAGGAAGCAT CCGAGGCCTT CCTGGTGTAC TACCTGGGCC AGCTCAAGGA ACTGTGGCGC CAGAAGCTGA TGTCTGCCCT CCCGGTCTTC AAGTCGATTA CGGAATTGCC CGAATTCGCC AATGATCCCA ATAACGTAAA GATCGTCAAG GACTGGCAGC CGATCGCCAA GACCTTTGCC GCCCAGGGCA AGACCTTGAA CGCTAACCTC GCGGCGCTGG ACGGCGGACA GGCCCTGAAC CAGTTCAGCC AGACGATCCT CACCGGCAAG GCTGACGCCA AGACTGCCCT GCAGACGTTC CAGTCCGGCC TCGAATCGGT CCTGAAGAAG TAA
|
Protein sequence | MSTKPSIASS AFSRRGFLGF AAAAASAPLL AACGGGSASQ GGGGAGGTIK FWDMPWATPA YNDAAKKLAE GFSGSGSKAS YQIIQWNNFY QTFSSAIASK TGPAVSTGGG FQAFQFEQQG QIAYADKVID KLKENGQFDD FLPGVLDPFK SDKGYVAVPW QLDMRVFWYR KSLFEKAGVG LPTDWPSLLE AGKALKKVGA FGFTTGAGAG NNYGNHSMIM MMVNNGGGVW NKDGELDLMN DRNVEAMEFV LELVSNGIVD PAAVSYTTDN MSAQWKDGKA GFGLFQVNVP QRVGDTSGDL LVADPITGPH GDKATIVFPN NIMMYTNTPS QEASEAFLVY YLGQLKELWR QKLMSALPVF KSITELPEFA NDPNNVKIVK DWQPIAKTFA AQGKTLNANL AALDGGQALN QFSQTILTGK ADAKTALQTF QSGLESVLKK
|
| |