Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2822 |
Symbol | |
ID | 4444618 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 3172511 |
End bp | 3174154 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639690644 |
Product | extracellular solute-binding protein |
Protein accession | YP_832301 |
Protein GI | 116671368 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.630772 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTTTTT CGCGCACTTC CAAAGCACTG GGCATGGTTG CCATCGCGGC CCTGGCCCTG ACCGGATGCG GAGCCGGAGA CGGCTCCACC ACCGGTAGCG GCAGTACCGG TGGCGACACC AGCAAAGTAA TCCTGGCCGA CGGCTCCGAG CCCCAGCGTC CCCTCATGCC GGCCGACACC AACGAGGTCG GCGGCGGCAA GGTCATCGAC ATGATCTTCG CCGGCCTTGT CAGCTATGAC CCCAGCGGCA AGCCCGTCAA CGAACTGGCC GAGTCCATTG AAGGCAAAGA CGGCCAGCAC TTCACCATCA AGATCAAGAA GGACCAGAAG TTCACGAACG GCGAGGCCAT CACGGCCAAG TCATTCGTCG ATTCCTGGAA CTTCGGGGCT GCTGCCAAGA ATGCCCAGTT GAGCGGGGGC TTCTTTGAAA GCATTGCCGG CTACGACGAG GCCAGTGCAG AAGGCTCCAC CGTGGAGACC ATGTCCGGCC TCAAGGTCGT TGACGACCAG ACGTTCACGG TTGAACTGAA GCAGCCCGAA TCCGACTGGC CGCTTCGACT CGGCTACACC GCCTTCGTTC CGGTTCCCTC CGGCGCGCTG AAGGACCCGA AGGGTTTCGG CGAGAAGCCG GTCGGCAACG GCCCGTACAA GCTGGCGGAC GGCGGTTGGC AGCACAACGT GCAGATCCAG CTCGTGCCGA ACCCGGACTA CAACGGCCCG CGCAAGGCCA AGAACGCCGG CGTGACCTTC AAGATCTTCC AGAACGACGA CGCCGCGTAC CAGGACCTGC TGTCCAACAA TCTGGACATC CTGCAGACTA TCCCCACCAG CGCCTTGAAG AACTTCAAGA CCGACCTGGG TGACCGCACC ATCAACAAGC CGTACGCCGG CAACCAGACC ATTGCCATCC CGGAATACCT GCCGGAATGG AGCGGTGAGG CAGGCAAGCT TCGTCGCCAG GCCATCTCCA TGGCCATCAA CCGGGAAGAG ATCACCAAGG TGATCTTCAG CGGTGCACGC CAGCCCGCCA AGGACTTCAC TGCTCCCGTC CTTGACGGCT ACAGCGACTC GATCACGGGT TCCGAGAACC TGACGTTCGA CGCCACGAAG GCAAAAGAAG CCTGGGCCAA GGCCGACGCC ATCCAGAAGT GGGACTCCAA CGAGACCTTC ACTATTGCCT ACAACGCCGA CAAGGGCGGA CACAAGGCCT GGGTCGAAGC CGTAGTGAAC CAGCTCAAGA ACACGCTCGG CATCAAGGTT GAGGGCAAGC CGTACGCCAC CTTCAAGGAA GCCCGCAACG ACGCCACCGC CAAGACGCTG ACCGGCTCCA TCCGCGCCGG CTGGCAGGCG GATTACCCGT CGCTGTACAA CTTCCTCGGA CCGATCTACA AGACCGGTGC AGGCTCTAAC GACGCCAAGT ACGCCAACCC GACGTTCGAC AAGGCCATCT CTGAAGGACT GGCCGCTTCC TCCGTCAGCG AAGGCAACAA GGCCATGAAC AAGGCCCAGG AAATCCTCCT GGCCGACCTT CCGGCCATCC CCCTGTGGTA CCAGGTTGCA CAGGGCGGCT GGAGCGACAA GGTCACCAAC GTTGACTACG GCTGGGACGG CGTCCCGCTG TACTACAACA TCACTGGCAA GTAA
|
Protein sequence | MRFSRTSKAL GMVAIAALAL TGCGAGDGST TGSGSTGGDT SKVILADGSE PQRPLMPADT NEVGGGKVID MIFAGLVSYD PSGKPVNELA ESIEGKDGQH FTIKIKKDQK FTNGEAITAK SFVDSWNFGA AAKNAQLSGG FFESIAGYDE ASAEGSTVET MSGLKVVDDQ TFTVELKQPE SDWPLRLGYT AFVPVPSGAL KDPKGFGEKP VGNGPYKLAD GGWQHNVQIQ LVPNPDYNGP RKAKNAGVTF KIFQNDDAAY QDLLSNNLDI LQTIPTSALK NFKTDLGDRT INKPYAGNQT IAIPEYLPEW SGEAGKLRRQ AISMAINREE ITKVIFSGAR QPAKDFTAPV LDGYSDSITG SENLTFDATK AKEAWAKADA IQKWDSNETF TIAYNADKGG HKAWVEAVVN QLKNTLGIKV EGKPYATFKE ARNDATAKTL TGSIRAGWQA DYPSLYNFLG PIYKTGAGSN DAKYANPTFD KAISEGLAAS SVSEGNKAMN KAQEILLADL PAIPLWYQVA QGGWSDKVTN VDYGWDGVPL YYNITGK
|
| |