Gene PHATRDRAFT_52286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_52286 
SymbolTRPB 
ID7202746 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp779519 
End bp780846 
Gene Length1328 bp 
Protein Length385 aa 
Translation table 
GC content50% 
IMG OID 
Producttryptophan synthase, beta chain 
Protein accessionXP_002182133 
Protein GI219123646 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGCTTG CTACGATTGA GGGTCAAACG CACGAGCTGC CGAGCAAGGA CGGTTACTTT 
GGCGAGTACG GGGGACAGCA ATTGCCTCCG CAACTCGTAG AAATTATGAA CGAGATATCC
GAATCGTACG AGAAACTCAT ACGAACCGAC GCCTTCCAAA TTGAATTGGA CTCCCTCAAC
AAGGACTTCA TCGGTCGACC AAGTCCCATA TTCTACGCAC GCCGTTTGAC GGAAAAAATT
GGTGGTGCCC GCATTTTTTT GAAACGTGAA GACCTGAACC ATACAGGAGC TCACAAAATC
AATCACTGTC TCGGGGAAGC GCTCCTGGCT AAGCACATGG GCAAAACCAA GGTATTGGCT
GAAACCGGTG CAGGACAACA CGGTGTGGCC CTCGCCACGG CCTGCGCCTT GATCGGAATT
GAATGCGAAA TTCACATGGG GCAGGTTGAC GTAGAAAAGT GAGTCCATTG CAATTCACAT
TTTTAGTTTC GTTTATCAAA AAAATTTAAA CTGATTTGGC TTTTTTTCAA TTCCTAACAA
AATTTGGCTT TGTGTTTGCT TTCTTCAGAG AAGCCCCCAA CGTGACCAAA ATGAGAATTT
TGGGTTGCAA GCTCATCACC GTCACTCGAG GCACCCGCAC TCTCAAGGAC GCCGTCGACA
GTGTCTTTGA GGAGTACCTC AAAGATCCGG TCAATTACTT TTACGCCATA GGATCCGTCG
TTGGACCGCA TCCTTTTCCC AAAATGGTAC GAGATTTTCA AAGCATCGTG GGGTGAGAAG
CGCGCGATCA ATTTTTAGAT CTGGAGGGGG GCGCATTGCC TGACGCTATT GTTGCCTGCG
GCGGCGGAGG TTGTAACGCG CGCGGGATTT TTACGGCCTT TTTAGAAGAC CCCGAAGTCC
AACTCATTGG GGTTGAACCT GCGGGCAGAG GTTTGGAGAC CTCCGATCAC GCCGCGACCA
TGACTCTCGG CGTCAAAGGA TCCATCCACG GAATGAATTG CTACAACTTG CAAGATGAGA
CCGGCGAGCC GCTACCAGTA TACAGTATCG CATCCGGGCT GGATTACCCT GGAGTTGGTC
CCCAGCACTG TCTCCTGAAA GACATTGGCC GAACGAAGTA TGTGGCTGTT ACAGATCAGG
AATGCTTAGA TGCATTTATG CAGCTATCTC GTGTCGAAGG CATCATTCCT GCTTTGGAAA
GCGCGCACGC TGTGGCGTAC GCGACGAAGT TGGCCGTGGA AATAGGACCG GGAAAAACTA
TCTTGGTCAA TTTGTCGGGA CGAGGTGATA AGGATGCTGA CTTTGTCGCC AATCGTCTTC
AGTTGTAG
 
Protein sequence
MGLATIEGQT HELPSKDGYF GEYGGQQLPP QLVEIMNEIS ESYEKLIRTD AFQIELDSLN 
KDFIGRPSPI FYARRLTEKI GGARIFLKRE DLNHTGAHKI NHCLGEALLA KHMGKTKVLA
ETGAGQHGVA LATACALIGI ECEIHMGQVD VEKEAPNVTK MRILGCKLIT VTRGTRTLKD
AVDSVFEEYL KDPVNYFYAI GSVIFKASWD LEGGALPDAI VACGGGGCNA RGIFTAFLED
PEVQLIGVEP AGRGLETSDH AATMTLGVKG SIHGMNCYNL QDETGEPLPV YSIASGLDYP
GVGPQHCLLK DIGRTKYVAV TDQECLDAFM QLSRVEGIIP ALESAHAVAY ATKLAVEIGP
GKTILVNLSG RGDKDADFVA NRLQL