Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0482 |
Symbol | |
ID | 4447037 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 513005 |
End bp | 514336 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639688279 |
Product | extracellular solute-binding protein |
Protein accession | YP_829981 |
Protein GI | 116669048 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00268905 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGATTCCA AGATCAGGTT CCGAGCACGG GTTGCCGTGG CCCTGTCCAT CGCTTCAGCG GCTGCATTGA CCGGCTGCGG GAGCGGGCCG TCCGCCCCGG CCGCCACGGA CGACGGCCAG CCCATTGAAG TGTGGGCACG CGCTGGCACC GACGCCGCCA CCACCTACGC CGCCATGTTC AAGGAATTCA CGGCGAAGAC GGGTGTGAAG GTCAATTTCC AGGGGGTTCC GGACCTGGAC CAGCAACTGC AGACCCGCGC GGCATCCAAG AAGCTCCCGG ACATCGTCAT CAACGACTCC GCCGCGCTGG GCAACTACAC GGCGCAGGGC TACCTCCAAA AGATCGACAG GTCCTCAGTG ACCGGTAGCG ACCGGATTGC CGATTCCCTG TGGAAGGAGA CTGAAGGCCT GGACGGAGCA AGCTACGGAG TTCCGTTCTC CCGCCAGACC ATGGTCACCA TGATCCGCAA GGACTGGCGC GAAAAGCTGG GACTTCCCGT CCCCAAGACG CTTGAGGACC TGGCGAAGCT CGCCACGGCC TTCGCCACCC AGGATCCGGA CGGCAACGGC CAGGCAGACA CATACGGCAT GGTCGTTCCC GGCTCCACCG AACGCGGATA CCTGGGCTGG TGGGCTTCCT CGTACATCTG GCAGGACGGC GGCAGCTACC TGAAGGATGA GGGCAACGGA AAGTACTCGT CCAACGCCTC CTCCCCGGAA ACCCAGGCAG GTGTCAGCTG GGTCAAGCAG CAGTTCTGCA CTCCGGGAAA CACGCAGCCC GGTGCCCTGA CCGCAGCCAC CAGCGTGGCC TCCCCGTTCT TCCAGACCGG CAAGGCCGGC ATCATCCTCA CCGGCCCCTA CAACTTCTCA TCGTTCGACA AGACTCCCGG CAAGGACGTC TACGAGGTCA TCGAAGCGCC CAAGGGCTCC AAGGACAACA CCGTGCTGGC CGAGGGCGAG AACATCTATG TCACGGCAAG CAACGCCAAG AAGGACCAGA CCAAGCAGGT CATCGACTAC CTGGTTTCCC AGGACGGCCA GAAGGCCGGC ATGACGGCCG GCAAGCAGCC GATCGTCCGC GTACCGGTGA ATTCCGACGT CGATGCTGCC GCCGTCTACA ATGATCCGCG CTGGTCAGTG GTCCAGGAGG CACTGAAGTC GTCGTCCAAG GCCTTCCCGT CCGCCATCAA CTTCGTCCCG TTCCGCCAGG CCGCGGCTGA AGCGCTGAAC AAGATCGTGG CCGATTGCCC TGCAGACAAC GTGGCCTCCG GTTTGCAGGC CCTGGACACC GCGATCAATG ACGAACTGAA CAGCCAGAAC GCCAAGTCAT GA
|
Protein sequence | MDSKIRFRAR VAVALSIASA AALTGCGSGP SAPAATDDGQ PIEVWARAGT DAATTYAAMF KEFTAKTGVK VNFQGVPDLD QQLQTRAASK KLPDIVINDS AALGNYTAQG YLQKIDRSSV TGSDRIADSL WKETEGLDGA SYGVPFSRQT MVTMIRKDWR EKLGLPVPKT LEDLAKLATA FATQDPDGNG QADTYGMVVP GSTERGYLGW WASSYIWQDG GSYLKDEGNG KYSSNASSPE TQAGVSWVKQ QFCTPGNTQP GALTAATSVA SPFFQTGKAG IILTGPYNFS SFDKTPGKDV YEVIEAPKGS KDNTVLAEGE NIYVTASNAK KDQTKQVIDY LVSQDGQKAG MTAGKQPIVR VPVNSDVDAA AVYNDPRWSV VQEALKSSSK AFPSAINFVP FRQAAAEALN KIVADCPADN VASGLQALDT AINDELNSQN AKS
|
| |