Gene Dshi_1031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1031 
SymboltrpB 
ID5710999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1064459 
End bp1065691 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content66% 
IMG OID641266942 
Producttryptophan synthase subunit beta 
Protein accessionYP_001532374 
Protein GI159043580 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0133] Tryptophan synthase beta chain 
TIGRFAM ID[TIGR00263] tryptophan synthase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.466116 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.300404 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGACG ACCTGATCAA TTCCTTCATG ACCGGCCCGG ACGAGAATGG CCGGTTCGGC 
GATTTCGGGG GCCGGTTCGT CTCCGAGACC CTGATGCCGC TCATCCTGGA GCTCGAACGC
CAATACGAGT TCGCCAAGAC CGACCAGGCC TTCTGGGACG AGATGCACCA TCTCTGGACC
CATTACGTGG GCCGGCCCAG CCCGCTCTAT TTCGCCGAAC GCCTGACCGA GCGGCTGGGC
GGCGCGAAGG TCTACCTCAA GCGGGACGAG CTGAACCACA CCGGCGCGCA CAAGATCAAC
AACGTGTTGG GCCAGATCAT CCTCGCCCGC CGCATGGGCA AGACCCGCAT CATCGCCGAG
ACCGGCGCGG GCCAGCACGG CGTTGCCACG GCCACGGTCT GCGCCAAGTT CGGCCTGAAA
TGCGTGGTCT ACATGGGCGC CCATGATGTC GAACGCCAGG CGCCCAACGT CTTCCGCATG
AAGCTGCTGG GCGCCGAGGT CGTGCCCGTC ACCTCCGGTC GCGGCACGCT CAAGGACGCC
ATGAACGACG CGTTGCGCGA CTGGGTCACC AATGTGCGCG AGACCTTCTA CTGCATCGGC
ACGGTCGCGG GCCCGCACCC GTACCCGGCC ATGGTCCGCG ATTTCCAGGC GATCATCGGC
CAGGAGGCCC GCGAGCAGAT GATGGAGGCC GAGGGCCGGC TGCCCGATAC GCTGATCGCC
GCGATCGGCG GGGGCTCCAA CGCCATGGGC CTGTTCTACC CGTTCCTCGA TGACAAGGAG
GTCGCGATCA TCGGGGTGGA GGCCGGCGGC AAGGGCGTCA ACGAGAAGAT GGAGCATTGC
GCATCCCTGA CCGGTGGCCG CCCGGGCGTG CTCCATGGCA ACCGCACCTA CCTGCTGCAG
GACGATGACG GCCAGATCCT CGAAGGCTTC TCGATCTCCG CCGGGCTGGA CTATCCCGGC
ATCGGACCGG AGCATGCCTG GCTGCACGAT ATCGGACGGG CGAAATATGT CTCGATCACC
GATGCCGAGG CGCTCGACGC CTTCCAGCTC TGCTGCGAGA CCGAGGGCAT CATCCCCGCG
CTGGAGCCGT CGCACGCGCT GGCCCATGTG GCCAAGATCG CGCCGGACCT GCCCCGCGAC
CACATCATCT GCATGAACAT GTGCGGACGC GGCGACAAGG ACATCTTCAC CGTCGCCAAG
GCGCTGGGTC AGGACATGTC CGGCGCGGTC TGA
 
Protein sequence
MPDDLINSFM TGPDENGRFG DFGGRFVSET LMPLILELER QYEFAKTDQA FWDEMHHLWT 
HYVGRPSPLY FAERLTERLG GAKVYLKRDE LNHTGAHKIN NVLGQIILAR RMGKTRIIAE
TGAGQHGVAT ATVCAKFGLK CVVYMGAHDV ERQAPNVFRM KLLGAEVVPV TSGRGTLKDA
MNDALRDWVT NVRETFYCIG TVAGPHPYPA MVRDFQAIIG QEAREQMMEA EGRLPDTLIA
AIGGGSNAMG LFYPFLDDKE VAIIGVEAGG KGVNEKMEHC ASLTGGRPGV LHGNRTYLLQ
DDDGQILEGF SISAGLDYPG IGPEHAWLHD IGRAKYVSIT DAEALDAFQL CCETEGIIPA
LEPSHALAHV AKIAPDLPRD HIICMNMCGR GDKDIFTVAK ALGQDMSGAV