Gene GWCH70_1440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1440 
Symbol 
ID7976889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1511153 
End bp1512313 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content46% 
IMG OID644798352 
Producttryptophan synthase subunit beta 
Protein accessionYP_002949525 
Protein GI239826901 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0133] Tryptophan synthase beta chain 
TIGRFAM ID[TIGR00263] tryptophan synthase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000281264 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTAG TTCAGCAAAG AAGAGGTTAT TTCGGAGAAT TTGGCGGAAG CTTCGTGCCG 
CCAGAGTTGC AGGAAGCGCT TGATTATTTG GAGGAGCAAT TTCTCAAGTA CAAAGACGAT
CCAGCGTTTA ACGATGAATT TAAGTTTTAT TTAAAAGAGT ATGTCGGCCG TGAAAATCCG
CTTACATTTG CGGCTCGGCT CACTGAACGT TTAGGCGGGG CGAAAATCTA TCTAAAGCGC
GAAGATTTGA ACCATACAGG TTCGCATAAA ATTAATAACG TCATCGGACA GATTTTGCTT
GCAAAACGAA TGGGGGCGAA ACGCATAATT GCCGAAACCG GGGCGGGACA ACATGGTGTC
GCCACTGCAA CGGCTTGCGC GATGTTTGGC ATTGATTGCA CCATCTACAT GGGCGAGGAA
GATACAAGGC GTCAGGCATT AAACGTGTTT CGCATGGAGC TTTTAGGCGC AAAAGTCGTT
TCGGTATCAA AAGGACAGAG AAGATTAAAG GATGCCGTCG ATGAAGCGTT GAATGACTTT
GTGCAAAACT ATAAGGATAC GTTCTATTTG CTTGGTTCAG CGGTTGGGCC TCATCCGTAT
CCAAGCATCG TTAAACATTT TCAGTCTGTT ATAAGCGAAG AAAGCAAACG GCAAATTTTA
GAAAAAGAAG GACGTTTGCC TGATGTCGTC ATCGCTTGCG TTGGCGGCGG AAGCAACGCG
ATTGGCGCGT TTGCCCATTA TCTTGATGAA CCAAGCGTGC GCCTGATTGG CGTCGAGCCG
GAAAAAGCGG CGACGCTTAC CAAAGGTGTC CCGGCCGTGC TTCATGGGTT TAAATGCTTA
GTATTGTTGG ATGAAGAAGG AAATCCTCAG CCGACTTATT CGATTGCCGC TGGTCTTGAC
TATCCAGGAA TTGGTCCTGA GCATAGCCAT CTGAAAGTAT CCGGACGTGC CGAATATTAT
ACGGTGACAA ATGAGGAAGT TCTTGAAGCA TTCCAGCTTT TGTCGAAAAC GGAAGGGATT
ATTCCAGCGC TTGAGAGCGC CCATGCGGTT GCTTATGCAA TAAAATTGGC ACCAACATTG
GATAAAGATC AGATTATAAT CGTTAATCTT TCAGGGCGTG GCGATAAAGA CGTTGAGCAA
GTGTTTCATA TGTTAAAGTA A
 
Protein sequence
MSLVQQRRGY FGEFGGSFVP PELQEALDYL EEQFLKYKDD PAFNDEFKFY LKEYVGRENP 
LTFAARLTER LGGAKIYLKR EDLNHTGSHK INNVIGQILL AKRMGAKRII AETGAGQHGV
ATATACAMFG IDCTIYMGEE DTRRQALNVF RMELLGAKVV SVSKGQRRLK DAVDEALNDF
VQNYKDTFYL LGSAVGPHPY PSIVKHFQSV ISEESKRQIL EKEGRLPDVV IACVGGGSNA
IGAFAHYLDE PSVRLIGVEP EKAATLTKGV PAVLHGFKCL VLLDEEGNPQ PTYSIAAGLD
YPGIGPEHSH LKVSGRAEYY TVTNEEVLEA FQLLSKTEGI IPALESAHAV AYAIKLAPTL
DKDQIIIVNL SGRGDKDVEQ VFHMLK