Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3399 |
Symbol | |
ID | 4075573 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | - |
Start bp | 418497 |
End bp | 419819 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 638004908 |
Product | extracellular solute-binding protein |
Protein accession | YP_611633 |
Protein GI | 99078375 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0250918 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.750741 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAAT CCAAATTTAC CAAAGGCTTG TTGGCAAGTT GCGCGGTCCT TGCATCAGCA GAATCGGCGC TGTCGAGCGA TTGGGGCTCA TTCGAGGGCG TGACAATCGA AGCCAAGCTG ATCGGTGGTC AGCAGTATGA AGGGCTCTAT GGCCGCATTG CAGACTGGGA GGCTGCAACC GGTGCCAAGG TCGAAATTAT CTCGAAGAAG AGCCACTTTG AAATCGACCG TGAGATCAAA TCGGATATGG CCGCGGGCAC AACTGATTGG TGCATTGGCT CCAATCATTC GTCCTTTGCG CCTCAATACG AGGGCCTCTA TGTCGATCTG AACGACTATG TCGACGCAAG CGTAATTGAG GGGTTCGTGC CAGGCACCAT TACGGCCTCT ACTGTTGGCG GGGATCTGTT GATGCTGCCA CGGGCGCAGT TTGATGTTTC GGTGCTGTAT TACCTCAAGT CCAACTATGA GGATGCGCAG AAAGCCGAGG CATTCGAGGC CCAATTCGGC TATCCATTGG CCGTGCCGCA GACTTGGGAG CAGGTGAAGG ATCAGGCGAT ATTCTTTGCG GATCCGCCGA ATTTTTATGG CACGCAATAT GCGGGCAAGG ACGAAGCTAT CGTCGGTCGC TTCTATGAAA TGGTGGTCGC GGAAGGTGGC AATTTCCTTG ATGAGGACAA CCGACCGATT TTCAATTCGG ACGCAGGTCA GCGCGCGCTG CAGTGGTTTG TCGATCTCTA CGAGGCCAAA GCGGTGCCTG CGGGTACCAC GTCTTATGTC TGGGACGACC TTGGCCAAGG GTTTGCAAGT GGCACCGTAG CGCTGAACCT CGATTGGCCC GGCTGGGCTG GCTTCTTCAA TAATCCTGAC TCGTCCAAGG CGGCTGGAAA CGTGGGTGTT GCCGTGCAGC CGATGGGATC GGTGACCCGC ACCGGCTGGT CTGGCCATCA TGGGTTCTCG GTGACGGATG ACTGCGCCAA CAAAGAAGCT GCTGCCTCTC TTGTGGCCTT TCTGACGAGC GAAGAGAGCC AGCTGGCAGA ATCTGCGGGC GGCTCGTTGC CCACCCGCAC GGCGGTTTGG GAGGCCAACA TCGCCAAGGC GCGCGCCGGG GATGATCCGT TCCGGACCGA GGCGCTGGAA GCCTTTGCTG AAGGGGCGAA ATATGCCTTT GCAGTGCCGC CCATCCCGGA GTGGGGCGAG TCCACCAATC TGGTTTTCCC GGAACTTCAG GCCGCTATCG TTGGCGATAA AACCGTCGAG GAAGCGCTGG ATGATGCGGC TGAGGCGGTG GATGAGCTGA TGCGCGAGTC CGGCTACTAC TAA
|
Protein sequence | MKKSKFTKGL LASCAVLASA ESALSSDWGS FEGVTIEAKL IGGQQYEGLY GRIADWEAAT GAKVEIISKK SHFEIDREIK SDMAAGTTDW CIGSNHSSFA PQYEGLYVDL NDYVDASVIE GFVPGTITAS TVGGDLLMLP RAQFDVSVLY YLKSNYEDAQ KAEAFEAQFG YPLAVPQTWE QVKDQAIFFA DPPNFYGTQY AGKDEAIVGR FYEMVVAEGG NFLDEDNRPI FNSDAGQRAL QWFVDLYEAK AVPAGTTSYV WDDLGQGFAS GTVALNLDWP GWAGFFNNPD SSKAAGNVGV AVQPMGSVTR TGWSGHHGFS VTDDCANKEA AASLVAFLTS EESQLAESAG GSLPTRTAVW EANIAKARAG DDPFRTEALE AFAEGAKYAF AVPPIPEWGE STNLVFPELQ AAIVGDKTVE EALDDAAEAV DELMRESGYY
|
| |