Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1440 |
Symbol | |
ID | 7976889 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1511153 |
End bp | 1512313 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 644798352 |
Product | tryptophan synthase subunit beta |
Protein accession | YP_002949525 |
Protein GI | 239826901 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0133] Tryptophan synthase beta chain |
TIGRFAM ID | [TIGR00263] tryptophan synthase, beta subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000000281264 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTTAG TTCAGCAAAG AAGAGGTTAT TTCGGAGAAT TTGGCGGAAG CTTCGTGCCG CCAGAGTTGC AGGAAGCGCT TGATTATTTG GAGGAGCAAT TTCTCAAGTA CAAAGACGAT CCAGCGTTTA ACGATGAATT TAAGTTTTAT TTAAAAGAGT ATGTCGGCCG TGAAAATCCG CTTACATTTG CGGCTCGGCT CACTGAACGT TTAGGCGGGG CGAAAATCTA TCTAAAGCGC GAAGATTTGA ACCATACAGG TTCGCATAAA ATTAATAACG TCATCGGACA GATTTTGCTT GCAAAACGAA TGGGGGCGAA ACGCATAATT GCCGAAACCG GGGCGGGACA ACATGGTGTC GCCACTGCAA CGGCTTGCGC GATGTTTGGC ATTGATTGCA CCATCTACAT GGGCGAGGAA GATACAAGGC GTCAGGCATT AAACGTGTTT CGCATGGAGC TTTTAGGCGC AAAAGTCGTT TCGGTATCAA AAGGACAGAG AAGATTAAAG GATGCCGTCG ATGAAGCGTT GAATGACTTT GTGCAAAACT ATAAGGATAC GTTCTATTTG CTTGGTTCAG CGGTTGGGCC TCATCCGTAT CCAAGCATCG TTAAACATTT TCAGTCTGTT ATAAGCGAAG AAAGCAAACG GCAAATTTTA GAAAAAGAAG GACGTTTGCC TGATGTCGTC ATCGCTTGCG TTGGCGGCGG AAGCAACGCG ATTGGCGCGT TTGCCCATTA TCTTGATGAA CCAAGCGTGC GCCTGATTGG CGTCGAGCCG GAAAAAGCGG CGACGCTTAC CAAAGGTGTC CCGGCCGTGC TTCATGGGTT TAAATGCTTA GTATTGTTGG ATGAAGAAGG AAATCCTCAG CCGACTTATT CGATTGCCGC TGGTCTTGAC TATCCAGGAA TTGGTCCTGA GCATAGCCAT CTGAAAGTAT CCGGACGTGC CGAATATTAT ACGGTGACAA ATGAGGAAGT TCTTGAAGCA TTCCAGCTTT TGTCGAAAAC GGAAGGGATT ATTCCAGCGC TTGAGAGCGC CCATGCGGTT GCTTATGCAA TAAAATTGGC ACCAACATTG GATAAAGATC AGATTATAAT CGTTAATCTT TCAGGGCGTG GCGATAAAGA CGTTGAGCAA GTGTTTCATA TGTTAAAGTA A
|
Protein sequence | MSLVQQRRGY FGEFGGSFVP PELQEALDYL EEQFLKYKDD PAFNDEFKFY LKEYVGRENP LTFAARLTER LGGAKIYLKR EDLNHTGSHK INNVIGQILL AKRMGAKRII AETGAGQHGV ATATACAMFG IDCTIYMGEE DTRRQALNVF RMELLGAKVV SVSKGQRRLK DAVDEALNDF VQNYKDTFYL LGSAVGPHPY PSIVKHFQSV ISEESKRQIL EKEGRLPDVV IACVGGGSNA IGAFAHYLDE PSVRLIGVEP EKAATLTKGV PAVLHGFKCL VLLDEEGNPQ PTYSIAAGLD YPGIGPEHSH LKVSGRAEYY TVTNEEVLEA FQLLSKTEGI IPALESAHAV AYAIKLAPTL DKDQIIIVNL SGRGDKDVEQ VFHMLK
|
| |