Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3901 |
Symbol | |
ID | 4444541 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 4394521 |
End bp | 4395501 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639691726 |
Product | extracellular solute-binding protein |
Protein accession | YP_833376 |
Protein GI | 116672443 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCCGTT TTTCACTCAA ACCCGGCGTC ACCGCAGCGC TGGCAGCCAC CACTTTGCTG GTGCTTGCGG CCTGCTCCGA CCCCGGCGCC GCAGCGGCCC CGGCGTCCGG TCCGGCGACG TCCGCCGCAG GCGTCAAGCA GTTCAACCTC TCCCCCGAGC AGAACCGGGA GGCAGCCGCC GTTGACCCGG CAGCCGCGGC TCTGGTGCCG GAAGCCATCA AGAAAGACGG CAAGCTCACG GTGGCCGTCA GCCCCTTCGC AGCGCCGTTG GCCGTCTACG CCACGGACAA CAAGACTCCG GTGGGCAACG AGGTGGATAT CGCCGTCGCA CTGGCCCAGA CGCTGGGGCT GGAGGCTGAC ATCGTTCCCA CCGCCTGGGC GGACTGGCCC CTCGGCGTCG AGTCCGGCAA GTACGAGGCC GTGCTATCCA ATGTCACGGT GACCGAGGAG CGGAAACTCA AGTTCGATTT CGCCAGCTAC CGGGACGACA AACTGGGTTT CTACACCAAG AGCGACAGCT CCATCAGCAA GGTGGAATCC GCTCCGGATG TTGCCGGGAA GCGCGTGATT GTGGGTTCGG GGACCAACCA GGAGTCCATC CTGCTGCGCT GGGATGAGGA GAACAAGAAG AACGGCCTGC CAGCGGTGGA GTTCCAGTAT TACGACGACG ACTCCGCCTC CTCGCTCGCC CTCCAGTCCG GACGCGCGGA CCTGACGTTC GGGCCGAACG CTACGGCCGC CTTCAAGGCG GCGTCCGACG CAAAGACCAA GCTGGTAGGC CTGGTGGACG GCGGCTGGCC ACTGAAGGCC AGCATCGCGG CCACCACCAA GAAGGGCAAC GGCTTTGCCG CTGCCGCCCA GGCCGGCCTG AACCACCTCA TCGAAGACGG CAGCTATGCC AAAATCCTGG ACCGCTGGGG CCTTAGCGCG GAAGCCGTCC CGAAGTCCGA ACTGAACCCG GCGGGACTGC CGAAGAAGTA G
|
Protein sequence | MARFSLKPGV TAALAATTLL VLAACSDPGA AAAPASGPAT SAAGVKQFNL SPEQNREAAA VDPAAAALVP EAIKKDGKLT VAVSPFAAPL AVYATDNKTP VGNEVDIAVA LAQTLGLEAD IVPTAWADWP LGVESGKYEA VLSNVTVTEE RKLKFDFASY RDDKLGFYTK SDSSISKVES APDVAGKRVI VGSGTNQESI LLRWDEENKK NGLPAVEFQY YDDDSASSLA LQSGRADLTF GPNATAAFKA ASDAKTKLVG LVDGGWPLKA SIAATTKKGN GFAAAAQAGL NHLIEDGSYA KILDRWGLSA EAVPKSELNP AGLPKK
|
| |