Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2416 |
Symbol | |
ID | 4076742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 2556474 |
End bp | 2558207 |
Gene Length | 1734 bp |
Protein Length | 577 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638007738 |
Product | extracellular solute-binding protein |
Protein accession | YP_614410 |
Protein GI | 99082256 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000012875 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGAAGT ATCTTTTGAC CGCAGTTGCA GCCGGGGCTG TAATTGCGGC AACGGGTTCG TATGCCGATG AGGCAGCGGC CCAGAAATGG ATCGACGAGG AGTTTCAGCC GTCGGTTCTG AGCAAAGCAG AACAACTTGC CGAGATGCAG TGGTTCATCA ACGCCGCCGA GCCTTACAAG GGCATGGAGA TCAACGTACT GTCCGAGGGC ATCCCCACGC ACAGCTACGA ATCCGAGGTG CTGACCAAGG CGTTTGAGGA AATCACCGGC ATCAAGGTGA ACCACCAGAT CCTGGGCGAA GGCGAGGTCG TTCAGGCCGT GCAGACCCAG ATGCAGACCA AGCGGAACCT CTATGACGCA TACGTCAACG ACTCCGACCT GATCGGCACG CACTCGCGCC TGCAGCTCGC TTACAACCTG AGCGACATGA TGGAAGGCGA CTTCAAGGAT GTGACCAACC CCGGTCTCGA CCTTGACGAT TTCATGGGCA CCCAGTTCAC CACTGGCCCC GATGGCGACC TCTACCAGCT GCCCGACCAG CAGTTTGCGA ACCTCTACTG GTTCCGCAAA GATTGGTTCG ACCGCGAAGA TCTGAAGGCC GCCTTCAAAG AGAAATACGG CTACGAGCTG GGTGTTCCGG TCAACTGGTC CGCCTATGAA GACATTGCCG AGTTCTTCTC TGAAGATGTG AAAGAAATCG ACGGCACCAC CATCTACGGC CACATGGATT ACGGCAAACG CGCGCCTGAC CTCGGCTGGC GGATGACCGA TGCGTGGCTC TCCATGGCCG GTGCGGGCTC CAAGGGTGAG CCGAACGGTG TTCCGATCGA CGAATGGGGC ATCCGTATGG AAGAAGGCAC CTGTAACCCG GTGGGCGCAA GCGTCACCCG CGGCGGTGCT GCAAACGGTC CGGCAGCAGT CTATGCGATC CGCAAGTGGG ACGAATGGCT GCGCAAATAC GCACCTCCCG GTGCCGCGTC TTATGACTTC TACCAGTCTC TGCCCGCACT CGCTCAAGGC AACGTCGCGC AGCAGATCTT CTGGTACACC GCCTTTACCG CAGACATGGT GAAGCCGAAG TCCGAAGGCA ACAACACCGT CGACGACAGC GGCACCCCGC TGTGGCGCAT GGCACCGAGC CCGCATGGCC CCTACTGGGA AGAAGGCCAG AAGGTTGGCT ATCAGGACGT GGGCTCCTGG ACCTTCCTCA ACTCCACCCC GCTGGACCGC GCACAAGCCG CATGGCTCTA TGCTCAGTTC GTCGTCTCCA AGACCGTCGA CGTGAAGAAG TCCCACGTGG GTCTGACCTT CATTCGCGAC AGCTCCGTCA ACCACGAGAG CTTCACCGAG CGTGCGCCCA AACTGGGTGG TCTGGTGGAA TTCTACCGTT CGCCCGACCG GACTGCATGG TCCCCGACCG GCATCAACGT GCCTGACTAT CCCAAGCTGG CGCAGATCTG GTGGCAGCAG ATTGGTGACG TGAACTCCGG TGCCTTCACC CCGCAAGAAG CGATGGATCG TCTGGCGCAG GAAATGGACA TCACCATGGG TCGTATGCAG CGTGCAGACG AGCAGGCGAA TGTCTATGGC GGCTGCGGCC CGCGTCTGAA CGAAGAAAAA GACGCGGAGT GGTGGTACGC CAATGGCGGC GCCAAGCCGA AGCTGGAGAA CGAAAAGCCG CAAGGCCAGA CCGTCAACTA TGACGAGCTG GTGGCGCGCT GGGCCGCGAA CTGA
|
Protein sequence | MRKYLLTAVA AGAVIAATGS YADEAAAQKW IDEEFQPSVL SKAEQLAEMQ WFINAAEPYK GMEINVLSEG IPTHSYESEV LTKAFEEITG IKVNHQILGE GEVVQAVQTQ MQTKRNLYDA YVNDSDLIGT HSRLQLAYNL SDMMEGDFKD VTNPGLDLDD FMGTQFTTGP DGDLYQLPDQ QFANLYWFRK DWFDREDLKA AFKEKYGYEL GVPVNWSAYE DIAEFFSEDV KEIDGTTIYG HMDYGKRAPD LGWRMTDAWL SMAGAGSKGE PNGVPIDEWG IRMEEGTCNP VGASVTRGGA ANGPAAVYAI RKWDEWLRKY APPGAASYDF YQSLPALAQG NVAQQIFWYT AFTADMVKPK SEGNNTVDDS GTPLWRMAPS PHGPYWEEGQ KVGYQDVGSW TFLNSTPLDR AQAAWLYAQF VVSKTVDVKK SHVGLTFIRD SSVNHESFTE RAPKLGGLVE FYRSPDRTAW SPTGINVPDY PKLAQIWWQQ IGDVNSGAFT PQEAMDRLAQ EMDITMGRMQ RADEQANVYG GCGPRLNEEK DAEWWYANGG AKPKLENEKP QGQTVNYDEL VARWAAN
|
| |