Gene A9601_01821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_01821 
SymboltrpB 
ID4716866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp169302 
End bp170546 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content38% 
IMG OID640077881 
Producttryptophan synthase subunit beta 
Protein accessionYP_001008577 
Protein GI123967719 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0133] Tryptophan synthase beta chain 
TIGRFAM ID[TIGR00263] tryptophan synthase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTAAGTA CATTTTCTCG CAAAGATCAA AACTATAAAA ATGACGATTT AAATCAACCC 
TCCATAGAGG GAAGATTTGG AAAATATGGT GGTCAATATG TTCCTGAAAC GCTAATGCCC
GCTCTTTTTG AGCTTGAAGA CGCTGCGTCT AATGCATGGA AAGATAAACT TTTTGTAGAA
GAATTGAATC ACCTACTCAA GACTTATGTA GGAAGAGAAA CACCACTTTA TGAAGCCAAA
AGACTTACTG AACATTACAA AAATAAACAA GCAACTCCTA GGATATGGCT TAAAAGAGAA
GATTTAAATC ATACTGGGGC TCACAAAATT AATAATGCTC TTGGACAAGC TTTATTGGCA
ATAAGAATGG GCAAAAAAAG AATAATTGCA GAAACTGGAG CAGGTCAGCA CGGAGTTGCT
ACTGCTACTG TTTGTGCGAG ATTTGGCTTG AAATGTATTA TCTACATGGG TGCTGAAGAT
ATAAAAAGGC AATCCCTTAA CGTTTTCAGA ATGAAACTTT TAGGAGCTGA AGTTAAAGTT
GTAAATTCTG GAACTGCAAC ACTTAAGGAT GCTACTAGTG AGGCCATTAG AGATTGGGTT
TCTAATGTCG AAACCACACA CTACATTTTA GGATCTGTTG CAGGCCCACA CCCTTTCCCA
AAGATTGTGC GCGATTTTCA TGCAGTTATA GGTGAAGAAA CTAAAAAACA ATGTTTGGAA
TCATTTGGAT CTTTACCCGA TATTTTGCTT GCTTGTGTAG GTGGAGGATC AAATGCAATG
GGGCTTTTCC ATCCTTTTGT TAAAGAAACT TCTGTGCGGC TTATTGGAGT TGAAGCCGCA
GGAAGCGGAG TTGATACTGA CAAACATGCT GCCACTATCA CTAAAGGGTC AGTTGGAATT
TTGCATGGAT CAATGAGTCT TCTCTTGCAA GATGATAATG GTCAAGTACA AGAAGCTCAC
TCAATAAGTG CAGGTTTAGA TTACCCTGGA GTAGGTCCTG AACATAGCCA TTTAAAAGAT
ATAGGCAGAG CAGAATATGG ATCAGTCACA GATCAAGAAG CTTTGGACGC TTTAAAACTT
GTTAGTGAAC TAGAAGGAAT TATACCTGCA CTTGAAACTG CCCATGCTTT TGCTTGGTTA
GATAAATTAT GCCCTACTCT TGAAAAAGAT ACTAATATAG TAATCAATTG CTCTGGTAGA
GGCGACAAAG ATGTTAATAC TGTTGCATCT TCATTAGATA TTTAA
 
Protein sequence
MVSTFSRKDQ NYKNDDLNQP SIEGRFGKYG GQYVPETLMP ALFELEDAAS NAWKDKLFVE 
ELNHLLKTYV GRETPLYEAK RLTEHYKNKQ ATPRIWLKRE DLNHTGAHKI NNALGQALLA
IRMGKKRIIA ETGAGQHGVA TATVCARFGL KCIIYMGAED IKRQSLNVFR MKLLGAEVKV
VNSGTATLKD ATSEAIRDWV SNVETTHYIL GSVAGPHPFP KIVRDFHAVI GEETKKQCLE
SFGSLPDILL ACVGGGSNAM GLFHPFVKET SVRLIGVEAA GSGVDTDKHA ATITKGSVGI
LHGSMSLLLQ DDNGQVQEAH SISAGLDYPG VGPEHSHLKD IGRAEYGSVT DQEALDALKL
VSELEGIIPA LETAHAFAWL DKLCPTLEKD TNIVINCSGR GDKDVNTVAS SLDI