Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3216 |
Symbol | |
ID | 4075320 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | - |
Start bp | 211368 |
End bp | 212645 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 638004725 |
Product | extracellular solute-binding protein |
Protein accession | YP_611452 |
Protein GI | 99078194 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.949389 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGATGA CGAAATTCAA GGCGGCCGCT GCTGCCCTTG CCTTTGCTGC GGCCCCGGCC GCAGCGGATG TGGACCTGGA ATTCTATTTC CCGGTGGCCG TTGGCGGAGC TGCGGCCGAC ACCATCGAAG AGCTGACCGC GCAATATGTG GCCGAGAACG AGGGTGTGAA CATCGAAGCG ATCTACGCGG GTTCCTATCA GGACACGGTC GCCAAGGCTC TGACCGCTGC GCGGGGCGGA AACGCGCCGC AGCTGTCGGT GATCCTGTCG GTTGATATGT TCACCCTCCT GGACGAAGAC CTCATCCTGC CGTTCGATGA CTTTGCCACC AGCGCAGAGG ACAAGGCCTG GCTCGACTCC TTCTACCCCG CGTTTATGGA AAATTCCCAA ACTGGTGGAA AGACTTACGG TATTCCGTTC CAACGCTCGA CTCCTGTTCT CTATTGGAAC AAAGAGGCCT TTGAAGCCGC CGGTCTCGAT CCTGAAACGC CCCCGGCGAC CTGGGACGAG ATGGTCGAGA TGGGTAAGAA GCTTACGCTC AAAGACGACG CGGGCAATGT GACCCAATGG GGTGTGCGCA TCCCATCTTC GGGGTTTCCG TATTGGTTGT TCCAGGGGCT GACCACCGAG AATGACGCCA TCCTTGCCAA TGCCGATGGC AATGAGGTGA ATTTCGATGA TCCCAAGGTT GTGGAAGCGC TCGACTATCT GGTGGATCTG TCCAAAACGC ATGAAGTGAT GGCCCCCGGC ATCATCGAAT GGGGCGCAAC GCCCAAAGCG TTCTTTGAAG GCCAGACCGC GATGATGTGG ACCTCGACTG GCAACCTGAC GAATGTTCGC AACAATGCGC CTTTTGACTT TGGTGTCGCG ATGCTGCCCG CCAACAAACG TCGCGGAGCG CCGACCGGCG GTGGCAATTT CTACCTCTTC AAAGGCGCAT CGGACGCCCA GGCAAAGGCC GCATTTGATT TTGTCAAATG GATCTCCGCC CCGGAACAGT CGGCCAAATG GACCATCGCC ACGGGCTATG TCGCCCCGCG TCCCGAAACC TGGGAGAGCG AAGCGATGAA AGCCTACGCT GCTGAATTTC CCCCGGTTCT CGTTGCTCGT GATCAGCTTG AGCACGCGGT TGCGGAGCTT TCGACCTATG AAAATCAGCG TGTGACCCGC ATTTTCAACG ATGCGCTTGC CGCTGCGATC ACCGGTCAGA AAACCGCCGA AGAAGCCTTG AAAGAAGCGC AGGCGAAGGC AGACGCGATC CTTCAGGACT ACCGTTAA
|
Protein sequence | MLMTKFKAAA AALAFAAAPA AADVDLEFYF PVAVGGAAAD TIEELTAQYV AENEGVNIEA IYAGSYQDTV AKALTAARGG NAPQLSVILS VDMFTLLDED LILPFDDFAT SAEDKAWLDS FYPAFMENSQ TGGKTYGIPF QRSTPVLYWN KEAFEAAGLD PETPPATWDE MVEMGKKLTL KDDAGNVTQW GVRIPSSGFP YWLFQGLTTE NDAILANADG NEVNFDDPKV VEALDYLVDL SKTHEVMAPG IIEWGATPKA FFEGQTAMMW TSTGNLTNVR NNAPFDFGVA MLPANKRRGA PTGGGNFYLF KGASDAQAKA AFDFVKWISA PEQSAKWTIA TGYVAPRPET WESEAMKAYA AEFPPVLVAR DQLEHAVAEL STYENQRVTR IFNDALAAAI TGQKTAEEAL KEAQAKADAI LQDYR
|
| |