Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3683 |
Symbol | |
ID | 4075652 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 740642 |
End bp | 742561 |
Gene Length | 1920 bp |
Protein Length | 639 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638005203 |
Product | extracellular solute-binding protein |
Protein accession | YP_611912 |
Protein GI | 99078654 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.378617 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.495229 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTAACG CCCGTCCCCT GCTTTCCTTC ACCAGTGCCC TTGCCTTTCT GCTGCTGCCG CTTGCGACGG TTGCTCAACC CACCGTGCAA GAAAGCGCCT TCTGGAGCCA TGAGGTCAGC GCAGGCGATC TGCCGCCCGC CGCTGAACGC ATCCCCGCAG AGCCGCTGGT GGTGAACCTC GCCGCCAAGG GGCGCAGTCC GGGCATTCCC GGCGGCACGC TCAATACGAT GGTCACGCGC TCCAAGGACA TCCGCCAGAT GGTTGTCTAC GGCTATGCGC GCCTTGTGGG ATACAACGAG AACTACGAGC TCGTGCCCGA TGTGCTACAA AGCTTTGAAA GCGAGGGCGA TCGCAAATTC ACCCTGCACC TCCGCGAGGG GCATAAATGG TCCTCTGGCG ATCCTTTCAC CTCGGCGGAT TTTGAATACT GGTGGACACA CATTGCCAAC AACCGCGAGC TCAATCCCAC CGGCCCGCCG GATTTTGTGC GCGTCGAGGG CGAGTTGCCG GAGGTCACTT TTCCGGATGA CACCACTGTG GTTTTTGAGT GGAGCACCCC CAACCCCAGC TTCCTTCAGG TGCTGGCGCA GGCGGTGCCA CCCTTCATCT ACCGTCCCTC CGCCTATCTG AAACAGTTCC ACAAGGACTT TGCCGACCCC GAAGCCTTGG AAGAGGAAAT CGACTACGCC CGCGTCAAGA GCTGGGCCGC GCTGCACAAC AAGCGCGACA ATATGGACAA GTTCGACAAT CCTGACCTGC CGACGCTACA GCCTTGGATC AACGCCACCG CAGGCAAGAA GATCCGCCAT CAATTTGTGC GCAATCCGTA CTATCATCGC ATCGATGAGA ACGGCGTGCA GCTCCCTTAT ATCGACACGG TCGAAATGGA GATTGTGTCG GGCGGGTTGG TCGCGGCAAA ATCCAACGCG GGCGAGGCCG ACCTGCAGGC ACGTGGGCTT GATTTCAGAG ACATTCCGAT CTTGCGCAAA GGCGAGGCAA ATGGTGACTA TCGCACCGAG CTATGGTCCT CGGGCACTGC CTCGCAGATT GCGATTTATC CGAACCTGAA CGCGGCGGAT GAAGTCTGGC GCGCGACCCT GCGCGATGTG CGTGTGCGCC GCGCCCTTTC GGTGGCGATC AACCGAGCAG CCATCAATAA ATCGCTCTAT TTCAAGCTGG CAAAGCCCGG CGCGATGACG GTTCTGGAGA AAAGCCCCTT CTTTGAGCAA GAGTTGCGGG ACGCGTGGGC CCAGTATGAT CCCGATCTCG CCAACACGCT GCTGGATGAG GCAGGCCTCA CGGAACGTGA CGGCTACGGC ATCCGCCGCC TGCCCGACGG GCGCCCGATG GAATTGGTGG TGGAAACCGC AGGCGAGCGG CAAGAGGTAG AAAATGCGCT GCAGATCATC ACAGATGACT GGCGCGATGT GGGTGTAAAG CTGGTGATGC GCCCGCTCGA TCGCGACATC CTGCGCAATC GTGTGTTCTC GGGCACCACC ATGGCCTCGG TCTGGTACGG CTGGGACAAT GGCCTCCCAC AGAGCTACAC CTCCCCGGCC TATCTTGCGC CCACGGATCA GGTGTTTTTG TCCTGGCCCA AATGGGGTCA GTATTATCAA ACCAGCGGCG CGGTGGGCGA AGCGCCAGAT ATGGCACCGG CGCAGCGTCT GATGGAGCTT CTGGACGACT GGAACAAGGC ACCAGATGCC AACAAACGGG CCGAAGCCTG GCATGAGATG CTTGAAATTC ACGCCGAAAA TGTCTTTGCA ATTGGTCTGG TGGCAGCCGC GCCGCAGCCC GTTGTGGTCT CAAACCGTCT GCGCAATGTG CCCAAGACGG CGATCTGGGC CTGGGATCCC GGCGCACATT TTGGCGTGCA CCGGATGGAT GAGTTCTACT TTGAGGATGG CGAAGGCTGA
|
Protein sequence | MSNARPLLSF TSALAFLLLP LATVAQPTVQ ESAFWSHEVS AGDLPPAAER IPAEPLVVNL AAKGRSPGIP GGTLNTMVTR SKDIRQMVVY GYARLVGYNE NYELVPDVLQ SFESEGDRKF TLHLREGHKW SSGDPFTSAD FEYWWTHIAN NRELNPTGPP DFVRVEGELP EVTFPDDTTV VFEWSTPNPS FLQVLAQAVP PFIYRPSAYL KQFHKDFADP EALEEEIDYA RVKSWAALHN KRDNMDKFDN PDLPTLQPWI NATAGKKIRH QFVRNPYYHR IDENGVQLPY IDTVEMEIVS GGLVAAKSNA GEADLQARGL DFRDIPILRK GEANGDYRTE LWSSGTASQI AIYPNLNAAD EVWRATLRDV RVRRALSVAI NRAAINKSLY FKLAKPGAMT VLEKSPFFEQ ELRDAWAQYD PDLANTLLDE AGLTERDGYG IRRLPDGRPM ELVVETAGER QEVENALQII TDDWRDVGVK LVMRPLDRDI LRNRVFSGTT MASVWYGWDN GLPQSYTSPA YLAPTDQVFL SWPKWGQYYQ TSGAVGEAPD MAPAQRLMEL LDDWNKAPDA NKRAEAWHEM LEIHAENVFA IGLVAAAPQP VVVSNRLRNV PKTAIWAWDP GAHFGVHRMD EFYFEDGEG
|
| |