Gene SeD_A1602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1602 
SymboltrpB 
ID6874933 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1544335 
End bp1545528 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content56% 
IMG OID642784748 
Producttryptophan synthase subunit beta 
Protein accessionYP_002215416 
Protein GI198242751 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0133] Tryptophan synthase beta chain 
TIGRFAM ID[TIGR00263] tryptophan synthase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.0886138 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACAC TTCTCAACCC CTACTTTGGT GAATTCGGCG GCATGTATGT GCCGCAGATC 
CTGATGCCTG CGCTGAACCA GCTTGAAGAG GCCTTCGTCA GCGCGCAAAA AGATCCTGAA
TTTCAGGCGC AATTCGCCGA TCTGCTAAAA AACTACGCGG GACGCCCCAC CGCGCTGACG
AAATGCCAGA ACATTACCGC CGGTACGCGT ACCACGTTGT ATTTAAAGCG CGAAGATTTA
CTGCACGGCG GCGCACACAA AACCAATCAG GTACTGGGTC AGGCGCTGCT GGCCAAACGG
ATGGGTAAAA GCGAGATTAT CGCTGAAACC GGCGCCGGTC AGCACGGCGT CGCCTCTGCG
CTCGCCAGCG CCCTGCTGGG TCTGAAATGC CGTATCTATA TGGGTGCCAA AGACGTTGAG
CGCCAGTCGC CGAACGTCTT CCGTATGCGT CTGATGGGCG CTGAGGTTAT CCCGGTTCAT
AGCGGCTCCG CTACGCTAAA AGATGCCTGT AACGAGGCGC TGCGCGACTG GTCCGGTAGT
TACGAAACCG CGCACTATAT GCTCGGCACG GCGGCAGGAC CGCATCCCTA TCCCACCATC
GTTCGCGAGT TCCAACGCAT GATTGGCGAA GAGACGAAAG CGCAAATCCT CGACAAAGAG
GGCCGTCTGC CAGATGCCGT TATCGCTTGC GTCGGCGGCG GCTCAAACGC TATCGGGATG
TTTGCGGATT TTATTAATGA TACCAGCGTC GGGCTAATAG GCGTTGAACC TGGCGGTCAC
GGTATTGAAA CCGGCGAGCA TGGCGCGCCG CTTAAACATG GTCGCGTTGG CATCTATTTC
GGGATGAAAG CGCCGATGAT GCAAACAGCG GACGGGCAAA TTGAAGAGTC CTATTCCATT
TCCGCCGGGC TCGATTTCCC ATCCGTTGGG CCGCAACATG CGTACCTGAA CAGCATCGGA
CGCGCGGATT ATGTCTCCAT TACCGATGAT GAGGCGCTGG AAGCCTTCAA AACGTTGTGC
CGCCATGAGG GAATTATCCC GGCGCTGGAA TCCTCCCACG CGTTGGCGCA CGCTCTGAAA
ATGATGCGCG AGCAGCCGGA AAAAGAGCAA CTGCTGGTGG TCAATCTCTC TGGCCGCGGA
GATAAAGACA TCTTTACCGT ACACGATATC CTGAAAGCGC GAGGGGAAAT CTGA
 
Protein sequence
MTTLLNPYFG EFGGMYVPQI LMPALNQLEE AFVSAQKDPE FQAQFADLLK NYAGRPTALT 
KCQNITAGTR TTLYLKREDL LHGGAHKTNQ VLGQALLAKR MGKSEIIAET GAGQHGVASA
LASALLGLKC RIYMGAKDVE RQSPNVFRMR LMGAEVIPVH SGSATLKDAC NEALRDWSGS
YETAHYMLGT AAGPHPYPTI VREFQRMIGE ETKAQILDKE GRLPDAVIAC VGGGSNAIGM
FADFINDTSV GLIGVEPGGH GIETGEHGAP LKHGRVGIYF GMKAPMMQTA DGQIEESYSI
SAGLDFPSVG PQHAYLNSIG RADYVSITDD EALEAFKTLC RHEGIIPALE SSHALAHALK
MMREQPEKEQ LLVVNLSGRG DKDIFTVHDI LKARGEI