Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0715 |
Symbol | |
ID | 4076992 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 769931 |
End bp | 771628 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638006012 |
Product | extracellular solute-binding protein |
Protein accession | YP_612710 |
Protein GI | 99080556 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.109389 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATTGA AATCCCTTCT GCTGGGTGCT GTGGCAACTG CCGCTGTCGC CCCTGCGGCT TTTGCCGAGC GTGGCTCGGA CGGCCAGGTC AACATTATTT ATTGGCAGGC TCCGTCCATC ATGAACCCGT TCCTGTCCGG CGGCACCAAG GACGTTGAAG CGGCCTCGCT CGTGATCGAG CCTCTGGCGC GCTACAACTC CTCCGGCGAG ATGGTTCCGT GGCTGGTCGA AGAAGTCCCC ACCGTTGAAA ACGGCGGCGT GAGCGAAGAC CTGACCCAGA TCACCTGGAA ACTGAAGCCG GGCATCAAGT GGTCCGATGG TTCCGACCTT ACGTCTGCGG ACGTGAAGTT CACATATGAG TACTGCACCC ACCCCGAGGG CGGCTGCGCA CAGGTCACCA AGTTCGAAGG CGTTACCTCT GTCGAGACCC CCGACGACTC CACCGTGGTG GTGACCTTCG ATAAAGCGAC CCCCTTCCCC TATGGCCCCT TCGTCGGCGG TGAAAGCCCG ATCATTCAGG CAGCACAGTT CGCTGAGTGC CTTGGTGCCA AAGCACCTGA GTGTACCGAA GCCAACTTCA ACCCGATCGG CACCGGCCCG TTTGTGGTCG ACGAGTTCAA GCCGAACGAC GTGATCACCC TCTCCGCGAA CCCGAACTAC CGTGACCCGG CCAAGCCCGC GTTCGCGAAG GTTCTGTTCA AAGGTGGCGG CGATGCAACC GCAGCCGGTC GCGCCGTGAT GGAAACCGGC GAATTCGACT ACGCATGGAA CCTCCAGCTG GCCCCCGATG TCATCGCGCA GATGGAAGCA GGCGGCAAAG GCCAAGCGGT TGCAGGCTTT GGTCCGCTCC TCGAGCGCAT CATGCTCAAC AACACCAACC CCTCCGCGGA TCTCTCCCCG GAAGAGCGTT CGGTGATCCG TCCGCACCCG TTCCTGTCCG ATCCGGCGGT TTACAAAGCC ATGTCCATGG CAATCGACCG TCCGCTTCTG GTGGAAATCG GCTATGGCAA AGCAGGTCGC GTGTCCTGCT CCTGGGTTCC GGCCCCCGAA GCCTTTGCGA TCAGCCCCGA AGGCTGTGAG ACTCAGGACA TCGCCGGTGC AAACGCCATG CTCGACGCAG CAGGTATCGT TGACACCGAT GGTGACGGCA TCCGCGAAAA AGACGGTGTT CCGCTGAAGA TCCTGTACCA GACCTCGACC AACGCCGTCC GTCAGGACTT CCAGGCGCTG ATCAAACAGT GGTGGAGCGA GATCGGCATC GAAACCGAAC TGCGTAACAT CAACTCCTCC GTGTTCTTCG GCGGCGACCC GGGCTCCCCG GACACCTTCC AGAAGTTCTA CGCAGACGTT GAAATGTACG CCAACACCTT CAACGGCACT GACCCGCAGT CCTACTTCGG CAACGGTCTG TGCGACAAAG CCCCGACTCC GGCCTCCCAG TGGCAGGGTG AGAACATCTC CCGCTTCTGT GACGAAGAGT TCGACGCGCT GCACGCAGAG CTTTCGCAAA CCGCAGACAT GGCAAAACGG ATCGAGATCG GTCAGCAGCT CAACACCATC ATCTTTGAGC GTGGTGGGAT GATCCCGCTG GTCCACCGTG GCCGTCTGTC CGCACACTCC AACACCCTTG GTGGTGTCGA CCTGAACGTG TGGGACAGCG AGCTGTGGAA CGCAGCTGAC TGGTACCGCT CCGAATAA
|
Protein sequence | MTLKSLLLGA VATAAVAPAA FAERGSDGQV NIIYWQAPSI MNPFLSGGTK DVEAASLVIE PLARYNSSGE MVPWLVEEVP TVENGGVSED LTQITWKLKP GIKWSDGSDL TSADVKFTYE YCTHPEGGCA QVTKFEGVTS VETPDDSTVV VTFDKATPFP YGPFVGGESP IIQAAQFAEC LGAKAPECTE ANFNPIGTGP FVVDEFKPND VITLSANPNY RDPAKPAFAK VLFKGGGDAT AAGRAVMETG EFDYAWNLQL APDVIAQMEA GGKGQAVAGF GPLLERIMLN NTNPSADLSP EERSVIRPHP FLSDPAVYKA MSMAIDRPLL VEIGYGKAGR VSCSWVPAPE AFAISPEGCE TQDIAGANAM LDAAGIVDTD GDGIREKDGV PLKILYQTST NAVRQDFQAL IKQWWSEIGI ETELRNINSS VFFGGDPGSP DTFQKFYADV EMYANTFNGT DPQSYFGNGL CDKAPTPASQ WQGENISRFC DEEFDALHAE LSQTADMAKR IEIGQQLNTI IFERGGMIPL VHRGRLSAHS NTLGGVDLNV WDSELWNAAD WYRSE
|
| |