Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0206 |
Symbol | |
ID | 4078654 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 223721 |
End bp | 224980 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638005500 |
Product | tryptophan synthase subunit beta |
Protein accession | YP_612201 |
Protein GI | 99080047 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0133] Tryptophan synthase beta chain |
TIGRFAM ID | [TIGR00263] tryptophan synthase, beta subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.475237 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGAAG ATCTCATCAA TAGCTTCATG AACGGGCCGG ACGAAAACGG TCGGTTTGGC ATCTTTGGCG GCCGCTTTGT CAGCGAGACA CTGATGCCGC TGATCCTGAG CCTCGAAGAG GAATACGAAA AGGCCAAGGT GGATCCGGAT TTCTGGGCGG AAATGGATGA TCTGTGGAAG AACTATGTGG GCCGCCCGAG CCCGCTCTAT TTCGCCGAGC GCCTGACCAA TCATCTGGGC GGCGCCAAGG TCTACATGAA GCGTGACGAG CTCAATCACA CCGGCGCGCA TAAAGTGAAC AATGTGCTGG GTCAGATCCT GCTCGCCCGC CGCATGGGCA AGACTAGGAT CATCGCCGAA ACCGGTGCTG GCCAGCATGG GGTTGCGACC GCCACTGTCT GTGCCAAGTT TGGTCTGAAA TGCGTGGTCT ACATGGGCGC TCATGACGTA CGTCGACAGG CGCCCAACGT GTTCCGGATG CGTCTTCTTG GCGCTGAGGT GATCCCAGTC ACCTCTGGCC GTGGCACGCT CAAGGATGCG ATGAACGACG CGCTGCGGGA CTGGGTCACC AATGTGCGCG ACACATTCTA CTGCATCGGC ACCGTTGCGG GCCCGCACCC CTATCCGGCT ATGGTGCGCG ATTTCCAGTC CGTGATCGGC AAGGAAGTGC GCTGGCAGCT TGCAGAGCAG GAAGGCGAGG GCCGGTTGCC GGACACGGTG ATTGCGGCTA TCGGAGGGGG CTCCAACGCG ATGGGCCTGT TCCACCCGTT CCTCGATGAC CCTTCGGTCA ATATCATTGG CGTTGAGGCC GGCGGCAAAG GTGTGGATGA GAAAATGGAG CATTGCGCCT CATTGACAGG CGGCCGGCCG GGCGTGCTGC ACGGCAACCG GACCTATCTG CTGCAGGACG ATGACGGCCA AATCCTCGAA GGCTTCTCGA TTTCGGCGGG CCTGGATTAC CCGGGGATCG GGCCGGAGCA TGCCTGGCTG CATGAGACCG GGCGCGCGCA ATACGTGTCC ATCACCGACA AGGAAGCCCT CGAGGCGTTC CAGCTGTCCT GCGCGATGGA GGGGATTATC CCGGCGCTCG AGCCGAGCCA CGCGCTCGCT CATGTCACTA AGATCGCACC AGAACTGCCC AAGGACCACA TCATCGTGAT GAACATGTGT GGGCGTGGCG ACAAGGACAT CTTTACCGTA GCCCGCCACC TCGGGTTTGA TATGTCAGAC ACCGAAGAGG GCCGCGACCT CGAAGAGTGA
|
Protein sequence | MAEDLINSFM NGPDENGRFG IFGGRFVSET LMPLILSLEE EYEKAKVDPD FWAEMDDLWK NYVGRPSPLY FAERLTNHLG GAKVYMKRDE LNHTGAHKVN NVLGQILLAR RMGKTRIIAE TGAGQHGVAT ATVCAKFGLK CVVYMGAHDV RRQAPNVFRM RLLGAEVIPV TSGRGTLKDA MNDALRDWVT NVRDTFYCIG TVAGPHPYPA MVRDFQSVIG KEVRWQLAEQ EGEGRLPDTV IAAIGGGSNA MGLFHPFLDD PSVNIIGVEA GGKGVDEKME HCASLTGGRP GVLHGNRTYL LQDDDGQILE GFSISAGLDY PGIGPEHAWL HETGRAQYVS ITDKEALEAF QLSCAMEGII PALEPSHALA HVTKIAPELP KDHIIVMNMC GRGDKDIFTV ARHLGFDMSD TEEGRDLEE
|
| |