Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0426 |
Symbol | |
ID | 4447086 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 455919 |
End bp | 457529 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639688225 |
Product | extracellular solute-binding protein |
Protein accession | YP_829927 |
Protein GI | 116668994 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTACCA GCCGCAAGCT CGCCGTCCTT GGCGCCCTGA TGGCAGGAAC GATGCTGTTC ACCGCCTGCT CGGGCAGCTC CAACGGTTCC GCCGCCATCA AAGACTCGTC CGCGGAGTTC GGCTTCCAGG AGACCGGCTT CCCGATCGTC AAGGACACGC TGACGCTCAA GTTCTCCGGA ACCAAGTCGG CGCTCGCCCC CGATTACAAC ACCATGTCCC TGGTGCAGCA GTGGGAAAAG GACACCAACA TCCACATCGA CTGGGAGAAC CTCCCGGAGA CGGTGTTCAA GGAAAAGAAA AACCTCATCC TGGCCAGCGG CGACCTGCCC GACGCCTTCT TCAACAGCGG GCTCACCGAC GCGGAAATCG CCACCTACTC GGCCAGCGGA ACACTGATCC CCCTCGAAGA CCTCATTCAG AAAAATGCCC CCAACCTGTC CAAGCTGCTC GCCGACCGGC CGGACATCAA AGCGGCCATC ACCTCCTCCG ACGGGCACAT CTACTCCCTC CCCTCCATCG AAGAACTGGG ACTCGTCCAG TTCCCCAACG AGATGGCGAT CAACACCGCG TGGCTGAACA AGCTGGGCCT CCCGATGCCC AAGACCGTGG ACGAACTGCA TGATGCCCTG CTCGCCTTCA AGACCAAGGA CGCCTCAGGC ACCGGTAAAA CCATCCCGCT GAGCTTCATG CCCGGCTCCT GGTGCGGTGA CATCGTTGAC CTCATCGCCG CCTTGGGCGG AGTCCCGGAC AACATGGACC ACAGGATCGT CCAGGACGGC AAGGTCATCT ACACCGCCAC CCAGGACGGC TACAAAAAGG CCCTCCAGAC CCTGCATACC TGGTATCAGG AAGGCCTGAT CGATCCAGAA TCGTTCTCCC AGGATGACAA GGCCTACCTG GCCAAGGGCA AGGCCAGCAC CGAAAACCTG GGCTCCTTCG TCTGGTGGGA AGTCAAGGAA ATGGTCGGCG CCGACCGCGC CGGCGACTAC AAACTGCTCC CCGTACTTGA GGGCGTGGAC GGCAAGCGGC TCGCCAGCCA GTCCAACAAC CAGGAAATCG CCCGCGGCGC CTTCGCTGTG ACCCGAACCA ACAAATACCC TGCCGCCACC ATCCGCTGGG CAGACAACCT GTACGATCCC ATCCAGTCCG CCCAGGCCAA CTGGGGCCCC ATCGGTGAAA CCCTGCAGAA GGACCCCGCC ACCGGGCTGC TGACCCAGAT ACCCGCGGCC GCGGGAACCA GTGAAGGCGA ACGCCGCCAG AAGGTTGCCC CGGGCGGCCC GAAGGCCAAC ACCGCGGAGA ACTTCGAGAA GGTCGTGGCA CCCGAGCCGC GCGCGGCCGA GCGGCAGAAG ACCGTCGAGG AGAACTACAA GCCTTTCGCA GCCAACGACG GCTACCCCCC GGTGGCACTG TCCAACGAGG AAGTGCAGCA GATCAGCACC ATCGAGACGG ACGTGGCCGC CATCGTCAAG CAGACCACGG CGAAATGGAT CGTCTCCGGC GGCATCGAGG CGGAGTGGGA CGGCTACGTC TCGCAGCTGA AGAACATCGG CCTGGACAAG ATGGTGGACG TCTACCAGCA GGCCTACGAC AGGTACCAGA AGAACTCCTG A
|
Protein sequence | MATSRKLAVL GALMAGTMLF TACSGSSNGS AAIKDSSAEF GFQETGFPIV KDTLTLKFSG TKSALAPDYN TMSLVQQWEK DTNIHIDWEN LPETVFKEKK NLILASGDLP DAFFNSGLTD AEIATYSASG TLIPLEDLIQ KNAPNLSKLL ADRPDIKAAI TSSDGHIYSL PSIEELGLVQ FPNEMAINTA WLNKLGLPMP KTVDELHDAL LAFKTKDASG TGKTIPLSFM PGSWCGDIVD LIAALGGVPD NMDHRIVQDG KVIYTATQDG YKKALQTLHT WYQEGLIDPE SFSQDDKAYL AKGKASTENL GSFVWWEVKE MVGADRAGDY KLLPVLEGVD GKRLASQSNN QEIARGAFAV TRTNKYPAAT IRWADNLYDP IQSAQANWGP IGETLQKDPA TGLLTQIPAA AGTSEGERRQ KVAPGGPKAN TAENFEKVVA PEPRAAERQK TVEENYKPFA ANDGYPPVAL SNEEVQQIST IETDVAAIVK QTTAKWIVSG GIEAEWDGYV SQLKNIGLDK MVDVYQQAYD RYQKNS
|
| |