Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0849 |
Symbol | |
ID | 4076024 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 901433 |
End bp | 902737 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638006147 |
Product | extracellular solute-binding protein |
Protein accession | YP_612844 |
Protein GI | 99080690 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0177782 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTGTTT CAAAAATTGC ACTGAGCTGC GCACTGGCGA CCGCTCTGAC CGCCGGGGCC GCCTGGGCCG AGACCGAAAT CCAGTGGTGG CACGCCATGG GCGGCGCCAA TGGCGAGCGC ATCGACAAGA TGGCGGCAGA CTTCAACGCC AGCCAGTCCG AGTATAAAAT CGTGCCCACC TACAAGGGCA ACTACACTGA AACCATGACC GCCGCCGTGG CCGCGTTCCG CGCGGGTGAG CAGCCGCACC TTGTACAGGT GTTCGAAGTG GGCACCGCCA CCATGATGGC TGCCAAAGGG GCGATCTACC CCATCGAGCA GATGATGTCC GATGCGGGCG AAGCCTTTGA CAAATCCGAC TATCTGCCCG CGGTGATTTC TTATTACCAG ACCCCCGAGG GGGAACTGCT GTCGATGCCG TTCAACAGCT CGACACCGGT TCTGTGGTAC AATGCCGATG CCTTCAAATC CGCAGGCGTC GATGTCCCGG AAACCTGGGA TGACGTGAAA TCCGCTGCTC AGGCGCTGGT CGACAACGGC ATGGAGTGCG GCCTGTCCTT CGGTTGGCAG TCCTGGGTGA TGGTTGAGAA CTTCTCGGCT TGGCACAACA TCGAGATGGG CACCAAGGAA AACGGCTTTG CCGGGTTCGA CACCGAGTTC ACCTTCAACA ACGAGCAGGT TGCGGCCCGC CTCGAGGACA TCGCCTCCAT GAGCGAGGGC AACCTCTTCA AATATGGCGG TCGTCGCGGC GACAGCCTGC CGCTGTTCAC CAACGGTGAA TGCGGGATGT GGATGAATTC CTCGGCCTAT TACGGCTCCA TGGTCGAGCA GGCAGAGTTC GAATTCGGCC AGACCATGCT GCCGCTCGAC ACCTCGGTTG CGGACGCGCC TCAGAACTCC ATCATCGGCG GTGCGACCCT CTGGGCGCTG GCCGGTCACG AGGCCGAGGA ATACAAGGGT CTGGCGCAGT TCATGACCTA TCTTTCCTCG CCCGAAGTTC AGGCATGGTG GCACCAGGAA ACCGGCTATG TGCCGATCAC CACTGCCGCG TATGAGCTGA GCAAGGAGCA GGGTTTCTAT GACGAAAACC CCGGCACCGA CACCGCGATC AAGCAGCTGA GCCTGAACGC GCCGACGCCG AACTCCCGCG GGATCCGCTT TGGCAACTTC GTGCAGGTGC GTGACGTGAT CAACGAAGAG CTCGAAGCGC TCTGGGCTGG TGACAAGACC GCCTCCGAAG CCCTCGATGC CGCCGTTGAG CGTGGTAACG CGCTGCTGCG CAAATTCGAG CGCTCCGCGA AGTAA
|
Protein sequence | MGVSKIALSC ALATALTAGA AWAETEIQWW HAMGGANGER IDKMAADFNA SQSEYKIVPT YKGNYTETMT AAVAAFRAGE QPHLVQVFEV GTATMMAAKG AIYPIEQMMS DAGEAFDKSD YLPAVISYYQ TPEGELLSMP FNSSTPVLWY NADAFKSAGV DVPETWDDVK SAAQALVDNG MECGLSFGWQ SWVMVENFSA WHNIEMGTKE NGFAGFDTEF TFNNEQVAAR LEDIASMSEG NLFKYGGRRG DSLPLFTNGE CGMWMNSSAY YGSMVEQAEF EFGQTMLPLD TSVADAPQNS IIGGATLWAL AGHEAEEYKG LAQFMTYLSS PEVQAWWHQE TGYVPITTAA YELSKEQGFY DENPGTDTAI KQLSLNAPTP NSRGIRFGNF VQVRDVINEE LEALWAGDKT ASEALDAAVE RGNALLRKFE RSAK
|
| |