Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2724 |
Symbol | |
ID | 4444589 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 3062826 |
End bp | 3064460 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639690544 |
Product | extracellular solute-binding protein |
Protein accession | YP_832203 |
Protein GI | 116671270 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.117295 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAATGA ACAAAAAGGC CCTGCACAGC GCTATCGCGC TCGCGGGTGT TTCCGCGTTT GCACTGACAG CCTGCACAGG TCCGTCCGGC GGCGGCGGAA CTTCCACCGG CGGCGCCGGA GGCGGAACCA TTACCTACGG CACCACGGAC AAGGTCGTTA CCCTCGATCC TGCGGGCTCG TACGACGCCG GTTCCTTCAT GGTGATGAAC CAGATCTACC CGTTCCTGCT GAACGCCAAG CCCGGCACGG CGGACGCCAC ACCCGATATC GCAGAGTCCG CGGAATTCAC GAGCCCCACG GAGTACACCG TCAAGCTCAA GTCGGACCTC AAATTTGCCA ATGGACACGC GCTCACCTCC TCCGACGTGA AGTTCTCCAT CGACCGCGTG GTCAAGATCG CGGACGACAA CGGCCCTGCC TCGCTTCTGG GCAACCTGGA GTCGGTTACC GCCAAGGACG ACTCCACGGT GGTCTTCAAG CTCAAGGCCG GCAATGACCA GGTCTTCCCG GGCGTCCTTG CTGCCAATGC AGGACCCATC GTCGATGAAG AGGTCTTCCC GGCGGACAAG CTCATGAGCG ACGACGAAAT CGTCAAGGGC AAGCCGTTCG CCGGCCCCTA CACGATCGAG AGCTACAAGA AGAACGAGCT TGTGAGCCTG AAGGTCAACC CGGACTACAA GGGCCTGCTG GGCAAGCCCG CCAATGACGG CGCGAGCATC AAGTACTACG CCGATTCGAA CAACCTCAAG CTCGACGTCC AGCAGGGCAA CATCGACGTT GCCGGCCGCA GCCTGACCGC TACGGACGCC GCTGACCTCG AAAAGGACTC CAAGGTCACC GTCCACAAGG GTCCCGGCGG CGAGCTGCGC TACATCGTGT TCAACTTCGA CACCATGCCG TTCGGAGCGA AGACCGCCGA GGCAGATCCC GCCAAGGCGC TCGCCGTCCG CCAGGCCATG GCGAACGTCG TTGACCGCGA CGCCATCGCA ACCCAGGTCT ACAAGGGCAC CTACCTGCCC GCGTACTCCG TAGTCCCCGA CGGGTTCGTC GGAGCCATCC AGCCGCTCAA GGAAATGTAC GGCGACGGCA GCGGCAAGCC CAGCCTGGAC AAGGCCAAGA AGGCATTCTC CGAGGCAGGC GTAACGGCCC CGGTCAACAT TAAGCTGCAG TACAACCCCG ACCACTACGG CAAGTCCTCG GGCGACGAAT ACGCCATGAT CAAGGAACAG CTGGAGAAGT CCGGCCTCTT CAAGGTGGAC CTGCAGTCCA CTGAATGGGT GACCTACTCA AAGGACCGCA CCAAGGACGT CTACCCGGTC TACCAGCTCG GCTGGTTCCC GGACTACTCG GACGCGGACA ACTACCTGAC CCCGTTCTTC GTACCGGGCA ACTTCCTGAA GAACCACTAC GAAAACCCGT CCGTGACGGA CCTGATCACC AAACAGCTCA CCACTGTTGA CAAGGCAGAG CGCGAGAAGG TCCTGGGTGA AGCCCAGACG TCAGTTGCCA AGGATCTCTC CACGCTGCCG CTGCTGCAGG GCGCCCAGCT CATGGTCGCC GGAAAGGACG TCAAGGGTGT TGAAAAGACC CTGGACGCGT CCTTCAAGAC CCGTCTTGGC GTGATTTCCA AGTAG
|
Protein sequence | MAMNKKALHS AIALAGVSAF ALTACTGPSG GGGTSTGGAG GGTITYGTTD KVVTLDPAGS YDAGSFMVMN QIYPFLLNAK PGTADATPDI AESAEFTSPT EYTVKLKSDL KFANGHALTS SDVKFSIDRV VKIADDNGPA SLLGNLESVT AKDDSTVVFK LKAGNDQVFP GVLAANAGPI VDEEVFPADK LMSDDEIVKG KPFAGPYTIE SYKKNELVSL KVNPDYKGLL GKPANDGASI KYYADSNNLK LDVQQGNIDV AGRSLTATDA ADLEKDSKVT VHKGPGGELR YIVFNFDTMP FGAKTAEADP AKALAVRQAM ANVVDRDAIA TQVYKGTYLP AYSVVPDGFV GAIQPLKEMY GDGSGKPSLD KAKKAFSEAG VTAPVNIKLQ YNPDHYGKSS GDEYAMIKEQ LEKSGLFKVD LQSTEWVTYS KDRTKDVYPV YQLGWFPDYS DADNYLTPFF VPGNFLKNHY ENPSVTDLIT KQLTTVDKAE REKVLGEAQT SVAKDLSTLP LLQGAQLMVA GKDVKGVEKT LDASFKTRLG VISK
|
| |