Gene P9211_01811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_01811 
SymboltrpB 
ID5730718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp174061 
End bp175311 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content43% 
IMG OID641284525 
Producttryptophan synthase subunit beta 
Protein accessionYP_001550066 
Protein GI159902722 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0133] Tryptophan synthase beta chain 
TIGRFAM ID[TIGR00263] tryptophan synthase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0720607 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAAGCA CGCTACCCTC GCAACCAAAG GATATGGAAC TCGCAAACAG TTCCCGACCA 
TCGGTCCATG GACGATTTGG TCGATTTGGA GGTCAATATG TGCCTGAAAC TCTTATGCCA
GCTCTTGCTG AACTAGAGAA AAAAGCTGCC GAAGCATGGC AAGATTCTTC ATTCACAAAT
GAACTATCTC ATTTATTAAA AACCTACGTC GGTCGAGCAA CCCCTTTATA TGAAGCAAAG
AGATTAAGCC AGCACTACAT GAGCAGAGAA GGAGGGCCAA GAATTTGGCT AAAGCGAGAA
GATCTCAATC ACACCGGGGC TCACAAAATC AATAATGCTC TAGGACAAGC TCTTCTAGCA
ATAAGAATGG GCAAAAAAAG AATCATTGCA GAGACAGGTG CTGGCCAGCA TGGAGTTGCA
ACAGCAACGG TATGTGCACG ATTTGGACTG GAATGCGTCA TATATATGGG TCAAGAAGAT
ATGGAAAGAC AAGCTCTAAA TGTATTTAGA ATGAAACTAC TAGGAGCAAA AGTTCAATCG
GTCACAGCTG GTACAGCCAC TTTAAAAGAT GCAACAAGTG AGGCAATTCG CGATTGGGTT
ACTAATGTCG AATCAACTCA TTACATCCTT GGATCAGTAG CAGGCCCACA TCCTTATCCA
ATGTTGGTTA GAGATTTTCA TTCAGTCATT GGAGAAGAGA CTAAACAACA ATGCAAAGAG
GCTTTTGGCC GATCACCTGA TGTACTACTG GCATGCGTCG GAGGAGGTTC AAATGCGATG
GGATTATTCC ATTCATTCAT AGAAGATCTT TCAGTAAAAA TGATTGGTGT TGAAGCTGCT
GGAGATGGGG TAAATACCAA ACGCCATGCT GCAACAATCA CCCAAGGGAG TGTAGGAGTA
CTTCATGGGG CTATGAGCCT TCTTCTTCAA GACAGTGATG GACAAGTTCA AGAAGCCCAT
TCAATTAGTG CTGGGCTTGA TTACCCAGGC GTAGGACCTG AACATAGCTA TCTGAATGAA
ATAGGTCGGG CAGAATATGT AGCTGTTACA GATAAAGAAG CTTTAAATGC CCTTGAACTA
GTCAGCAAAT TAGAAGGAAT TATTCCTGCC TTAGAAACAG CCCATGCTTT TGCATGGCTA
GACACACTTT GCCCTTCTCT TGCCCCAGGT ACTGAAATAG TTATTAATTG CTCTGGTCGA
GGAGATAAAG ATGTCAATAC TGTTGCAAAA AAAATGGGCT TTGAAATTTA A
 
Protein sequence
MTSTLPSQPK DMELANSSRP SVHGRFGRFG GQYVPETLMP ALAELEKKAA EAWQDSSFTN 
ELSHLLKTYV GRATPLYEAK RLSQHYMSRE GGPRIWLKRE DLNHTGAHKI NNALGQALLA
IRMGKKRIIA ETGAGQHGVA TATVCARFGL ECVIYMGQED MERQALNVFR MKLLGAKVQS
VTAGTATLKD ATSEAIRDWV TNVESTHYIL GSVAGPHPYP MLVRDFHSVI GEETKQQCKE
AFGRSPDVLL ACVGGGSNAM GLFHSFIEDL SVKMIGVEAA GDGVNTKRHA ATITQGSVGV
LHGAMSLLLQ DSDGQVQEAH SISAGLDYPG VGPEHSYLNE IGRAEYVAVT DKEALNALEL
VSKLEGIIPA LETAHAFAWL DTLCPSLAPG TEIVINCSGR GDKDVNTVAK KMGFEI