Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0496 |
Symbol | |
ID | 4447016 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 526933 |
End bp | 528294 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639688293 |
Product | extracellular solute-binding protein |
Protein accession | YP_829995 |
Protein GI | 116669062 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGTTTA GATCGTCTTC CGGGCTTGCG GCAATCGTTA CGGCTGCGGC TTTGGCATTG ACCGGCTGCG GCGCGGGCGC AGGGACTACT GGCAGTTCAA CAAACGCCGA CGGAAAGGTG GACGGCACGG GCAAGACGCT CAACGTCCTG GTCGGCGTCC TCAGCCAGTA CCCGGAGCAG CAGAAGAAGT GGCAGAGCGA CATCGCGGCC AAGTTCAAGG CAGAAACCGG GGCCGACGTG AAGTTTGAAA CCTTCGCCTC AGCCAACGAC GAGCTGACGC GCATCCAGAC CTCCGTGGTT TCAGGGCAGG GCCCGGACAT CTATGGGCTC GGCACCACGT TCACTCCGAC CGCCTTCGCC ACCAAAGCCT TCGTGACCCT GTCCGACGAC GATTGGAAGA AGGTCGGCGG CAAGGACCGC TTCAACCCTG CAGCATTGGG CATCTCCGGC CCCGACGAGG GGCACCAGGC CGGCATCCCG TTCGTAAGCC GCCCCTTCGT GATGGCTTAC AACAAGGAGC TGTTGGCGGC TGCAGGCATT GAGAAGCCTG CCACCAGCTG GGACGAGCTT GCCGAACAGG CGAAGAAAAT GACCAAGGAC GGCACGTTCG GCATCGCCAC CGGGTACAAA GACTCCTACG ATCCGTGGAA GTTCATCTGG GCCATGTCCG TCCAAGCCGG CAATCCGCTG GTGGACGGAA ACAGCCTCAA GATGGATGAT CCCACCGTCA AGAAGGCTTA CGAGACTTAT TTCGGCTGGT TGACCGATGA CAAAGTTGTG GACCCTGCCT CCGTCGGGTG GAGCAACAGC AACGCGGTTG CTGCCTTCGC CAGCGGAAAA GCCGGTTATC TGATGATGAC GACGTCGAGC TCCATCCCAA CGCTGGACAA GTCGGCCGTG GCAGGCAAGT ACGAATACGC ACTGATGCCC ACTACCGCTC CGGGTGAATC CAGCCCCAAG AGTGACGGCG CGGAAGCCGC GAGCATCCTC TCCGGGGATA ACCTCGTGGT GGCGGACTAC TCGAAGGAGA AGGATCTCGC CTTCGCCTAC ATCAAGCTGA TCACCTCGAA AGAGGAACAG CTGAACTACC AAAAGACCTT CGGCGACCTG CCCGCAAACG CCGAGGCGTT GGCCAGCCTC ACTGATCCCA AGCTCAAGCC AATCGCGGAT GCCGCCGCCA AGTCCAAAGC CACCCCGTTT ACAGGTGCTT GGGGCGACAT CCAGCTCGGC TTGCTCAACG TCACTGTTCA GTCGATTCCG GACCTTTCCA GCGGCAGGCT CGACGAGTCG GCCCTCGAGG CTCGAATCAA GGACGCCCAG ACCAAGGGGC AGGCGTCCCT TGACCGGGCC GCCAAGGGAT AA
|
Protein sequence | MRFRSSSGLA AIVTAAALAL TGCGAGAGTT GSSTNADGKV DGTGKTLNVL VGVLSQYPEQ QKKWQSDIAA KFKAETGADV KFETFASAND ELTRIQTSVV SGQGPDIYGL GTTFTPTAFA TKAFVTLSDD DWKKVGGKDR FNPAALGISG PDEGHQAGIP FVSRPFVMAY NKELLAAAGI EKPATSWDEL AEQAKKMTKD GTFGIATGYK DSYDPWKFIW AMSVQAGNPL VDGNSLKMDD PTVKKAYETY FGWLTDDKVV DPASVGWSNS NAVAAFASGK AGYLMMTTSS SIPTLDKSAV AGKYEYALMP TTAPGESSPK SDGAEAASIL SGDNLVVADY SKEKDLAFAY IKLITSKEEQ LNYQKTFGDL PANAEALASL TDPKLKPIAD AAAKSKATPF TGAWGDIQLG LLNVTVQSIP DLSSGRLDES ALEARIKDAQ TKGQASLDRA AKG
|
| |