Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3483 |
Symbol | |
ID | 4075123 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 514420 |
End bp | 515673 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638004998 |
Product | extracellular solute-binding protein |
Protein accession | YP_611717 |
Protein GI | 99078459 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACTTGC TGAAGGGAAC GGCCGCAGGA CTTGCGATGG CCGTCGGGCT GGCCGCATCT GCTCAGGCAT CGGAGCTGAC CGGAACGCTC AAGATTTTCT CGGATATGTC GAACCCGGCA CCGCGCGCGG TGATGGAGAA GATGGCGTCC GATTTTGATG CACTGCATCC CAATCTGAAA GTCGAACTGA CCGTCATCGA CCGCGAAGCC TACAAGACCC AGATTCGCAA CTTCCTGACC GCCAATGCGC CGGATGTTGC CAACTGGTAC GCTGCCAACC GCATGCGCCC CTATGTGTCG GCCGGTCTCT TTGAGGATGT CTCTGACCTC TGGGCAGAGC CTGCGATTGC GGAAAACCTT GCGTCCACCA AGGGCGCGAT GACGCTTGAT GGCAAGCAGT GGGGCGTGCC CTATACCTAC TATCAGTGGG GCGTCTACTA CCGCGAGGAC ATCTACAACG AACTGGGTCT CGAAGAGCCA AGCGACTGGG CAACCTTCAA GTCCAACTGC CAGAAGATTC TCGACTCGGG CCGCAAGTGC TTCACCATTG GTTCCAAGTT CCTCTGGACC GCCGGCGGCT GGTTTGACTA TCTGAACATG CGTACCAACG GCTACGACTT CCACATGGCG CTGACCAATG GGGACGTGGA ATGGACCGAT GACCGAGTGA AGCAAACCTT TGCCAATTGG CGCGAGCTGA TCGACATGGG CGCCTTTATC GACAACCACC AGTCCTACAG CTGGCAGGAG GCGCTGCCCT TCATGGTGAA TGGTGAAGCG GCGGCCTACC TCATGGGGAA CTTTTCCGTG GCCCCGCTGC GCGAAGCGGG TCTGAGCGAC GAGCAACTTG ATTTCTACCA GTTCCCGGCG ATCAACCCGG ATGTCGAGCT GGCCGAAGAT GCGCCGACCG ATACGTTCCA CATCCCGTCC GGGGCCCAGA ACAAGGAAGC GGCGCGTGAG TTCCTGCGCT ATGTGGTCTC TGCGGACGTG CAGACCGCGA TCAATGCGGG CGACGCACTT GGGCAGCTGC CGGTCAATGC CTCTTCCTCG GTGGATGATG ACGAGATGCT GAACCAGGGC TTCGAGATGC TCTCCTCCAA CAGCCCCGGC GGTATCGCGC AGTTCTTTGA TCGCGACGCC CCGGCCGAGA TGGCCTCGGT GGCGATGGAA GGCTTCCAGG AGTTCATGGT GTTCCCCGAC AATCTCGACG ACATCCTGAA CCGTCTCGAG AAGGCCCGTC AGCGGATCTA CTAA
|
Protein sequence | MNLLKGTAAG LAMAVGLAAS AQASELTGTL KIFSDMSNPA PRAVMEKMAS DFDALHPNLK VELTVIDREA YKTQIRNFLT ANAPDVANWY AANRMRPYVS AGLFEDVSDL WAEPAIAENL ASTKGAMTLD GKQWGVPYTY YQWGVYYRED IYNELGLEEP SDWATFKSNC QKILDSGRKC FTIGSKFLWT AGGWFDYLNM RTNGYDFHMA LTNGDVEWTD DRVKQTFANW RELIDMGAFI DNHQSYSWQE ALPFMVNGEA AAYLMGNFSV APLREAGLSD EQLDFYQFPA INPDVELAED APTDTFHIPS GAQNKEAARE FLRYVVSADV QTAINAGDAL GQLPVNASSS VDDDEMLNQG FEMLSSNSPG GIAQFFDRDA PAEMASVAME GFQEFMVFPD NLDDILNRLE KARQRIY
|
| |