Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1745 |
Symbol | |
ID | 4445714 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 1952695 |
End bp | 1953588 |
Gene Length | 894 bp |
Protein Length | 297 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639689565 |
Product | extracellular solute-binding protein |
Protein accession | YP_831237 |
Protein GI | 116670304 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR02995] ectoine/hydroxyectoine ABC transporter solute-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00227146 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAAGTC ACATTTCACG GCGGAACCTC CTGCGCGGCG CCGGTGCGGC AGCCTTGGGA ATCTCGGTGG CAGGCTGGGT AACCAGCTGT TCCACCGTTC CCGTCGGCGG CCCTGCAACA GGGGCCACCA CCAACCTGCT GGACACTGCG AAGGAGCAGG GCTTCATCCG GGTTGGCATC GCCAATGAGC CGCCGTACAC CCAGGTCAGC CCGGACGGGA AAGTCACCGG TTGTGAGCCT GATGTCCTTC GCGCGGTCTG CAAGCGCCTG GGCATCGACG AGGTCCAGGG CATCATCACG CCGTACGAGT CCATGATTCC GGGGCTCAAT GCCAACCGCT GGGATGTCAT TGCGGCGGGC CTCTTTATGA AGCAGTCGCG GTGTTCCCAG GTTCTCTACT CGGAGCCGGT TATCGTTTCC ACCGAGTCCT TCGCCATGCC GAAGGGCAAC CCGAAGGGCA TCCTGACGGT CGCTGACATC ATTGCCAACC CCGCGCTGCG CATTGCCGTC CTGCCGGGCG GGTTCGAGGA AGGGGTCCTG AAGGCGGCCA AAGTTCCCGC CAGCCAGCAG GTCAAGGTCA ATGACGGCCG CAGCGGCCTT GAGGCGCTCA CGGCAAACCG GGCGGACGCC TTCATGCTCC CCACCCTGTC CCTTAAGTCA CTTGCAGAGA ATGACGGCAG CTTCGATATC ACAGCACCGA TCAAAGACGC TCCCCGCACG GGCTCGGGTG CTGCTTTCCG CAAGGCTGAC ACGTCCTTCC ACGAGGCTTA CAACAGGGAG CTTGCCGCGT TCAAGGCCAC TCCTGAGTTT GGCGCGATCC TCACCAAGTG GGGCTTCGAT CCGACCGTAG TCGAAGGGGC CACTGCGGAG GAACTATGCA AGACCGAGGG CTGA
|
Protein sequence | MSSHISRRNL LRGAGAAALG ISVAGWVTSC STVPVGGPAT GATTNLLDTA KEQGFIRVGI ANEPPYTQVS PDGKVTGCEP DVLRAVCKRL GIDEVQGIIT PYESMIPGLN ANRWDVIAAG LFMKQSRCSQ VLYSEPVIVS TESFAMPKGN PKGILTVADI IANPALRIAV LPGGFEEGVL KAAKVPASQQ VKVNDGRSGL EALTANRADA FMLPTLSLKS LAENDGSFDI TAPIKDAPRT GSGAAFRKAD TSFHEAYNRE LAAFKATPEF GAILTKWGFD PTVVEGATAE ELCKTEG
|
| |