Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_4014 |
Symbol | |
ID | 4447815 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 4530973 |
End bp | 4532682 |
Gene Length | 1710 bp |
Protein Length | 569 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639691845 |
Product | extracellular solute-binding protein |
Protein accession | YP_833489 |
Protein GI | 116672556 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAATC TGACCAAGAT TGGCGGGGCG GCCGCCCTAA CGGCAGCGCT TGCGCTCACA GGCTGCGGCG GAGGCGGGAC CTCCTCCGGC CCCGAGGCAG GCAAAGGCCA GGACTCCGGC AGCGACCTCA CCAAGCTGAT CAGCATCAAT GCCAAGGACG CCAAGGACCT GGAACCGGGC GGCACCGTGA CCCTTCCGCT GGGCAACATC GGTCCGGACT TCAATGGCTT TTCCAACAAC GGCAACAGCG CGGACAACAC CGCGCTGCAC CGCCCCATCG ACGGTGCGGG AACGTGGGGC TGCTGGAACT TCGACTTCGA CGGCACAGCC ACGCCGAACA AGGACTTCTG CGAAGACGTC AAGAGCGAGG TCAAGGACGG CAAGCAGACC ATCACCATCA AGGTGAACGA GAAAGCCACC TACAACGACG GCACGCCGAT CGACGTCAAG GCCTTCCAGA ACACCTGGAA CATGCTCAAG GGTGAAAACA AGGACATCGA CATCGTCAGC TCCGGCGCGT ACGAGTTCGT TGATTCCGTG AAGGCCGGTT CCAGTGACAA GGAAGTTGTG GTCACCACCA CCCAGCCGGT ATTCCCGCTG GATGCCCTCT TCACTGGCCT GATCCACCCG GCAGTGAATA CGCCGGAACT CTTCAACACC GGCTTTACCG GCGACATGCA CCCGGAGTGG ATGGCCGGCC CGTTCAAGCT GGACCAGTAC GACAGCGCCG CCAAGACCGT GACCCTGGCC CAGAACGACA AGTGGTGGGG CACCAAGCCG GTCCTGGACA AGGTGGTGTT CCGCCAGCTG GAGACCAGCG CCCAGATTGC AGCGTTCAAG AACGGTGAAA TCGACGGCGT CTCGGCCAAC ACCATCGCGC TGTACAAGCA GCTCGACGGC ACCAAGAATT CAGAGGTCCG CCGCGGCCAG CGCCTGTTCG CAGGTGGCCT GAACCTGAAC GCCCAGAAGG CTCCGATGAC CGACGTCGCC ATCCGCAAGG CGATCTTCAC CGCCGTGGAC CGCGAAGCAC TCCGGAAGGT CCGCTTCAAC GGCCTGAACT GGGAAGAGAC CAGCTCCGGC TCAATGATGC TGCTGCCGTT CTCCAAGTAC TACCAGGACA ACTATCCGGC CACGGAATCC GGTGCCGAAG CAGCCAAGAA GGTACTGACC GATGCCGGCT ATAAGCCCAA CGCAGCGGGC ATCATGGAGA AGGACGGAGT CCCGGCCGCC TTCAAGATCA GCAACTTCGG TGACGACCCC ACCACCCTGG CGTTCGTGCA GACCCTGCAG AAGCAGCTCC AGGCCGGCGG CATGGACGTA GGGATCGACC AGCGCGCCTC CGCCGACTTC GGCAAGGTAC TGGGAAGCCG CGACTTCTTC CTGAGCGTTT CCGGCTACAC CGTCGGCGCT GATGCGACCG ACGCCGTCAA GCAGTTCTAC GACTCCAAGA CCAACGAGAA CGGGTTGGGC GACGCGGAGC TGGACGCCGA GATCAAGGCC CTCAGCAGCA TCGAGGACAA CGCCGAGCGC AACAAGGCGG CCATGGAGGT TGAAAGGAAG CACATGGCCA AGTACTTCTC CATGGGTGTT GTGATGAACG GCCCGCAGAT CTCGTTCGTC CGCACGGGCC TGGCAAACTA CGGCCCGTCC CTGTTCAAGA GCCTGTCCCA GGTTCCGGAC TGGACCAGCC TCGGCTGGGA AAAGAAGTAA
|
Protein sequence | MKNLTKIGGA AALTAALALT GCGGGGTSSG PEAGKGQDSG SDLTKLISIN AKDAKDLEPG GTVTLPLGNI GPDFNGFSNN GNSADNTALH RPIDGAGTWG CWNFDFDGTA TPNKDFCEDV KSEVKDGKQT ITIKVNEKAT YNDGTPIDVK AFQNTWNMLK GENKDIDIVS SGAYEFVDSV KAGSSDKEVV VTTTQPVFPL DALFTGLIHP AVNTPELFNT GFTGDMHPEW MAGPFKLDQY DSAAKTVTLA QNDKWWGTKP VLDKVVFRQL ETSAQIAAFK NGEIDGVSAN TIALYKQLDG TKNSEVRRGQ RLFAGGLNLN AQKAPMTDVA IRKAIFTAVD REALRKVRFN GLNWEETSSG SMMLLPFSKY YQDNYPATES GAEAAKKVLT DAGYKPNAAG IMEKDGVPAA FKISNFGDDP TTLAFVQTLQ KQLQAGGMDV GIDQRASADF GKVLGSRDFF LSVSGYTVGA DATDAVKQFY DSKTNENGLG DAELDAEIKA LSSIEDNAER NKAAMEVERK HMAKYFSMGV VMNGPQISFV RTGLANYGPS LFKSLSQVPD WTSLGWEKK
|
| |