Gene PHATRDRAFT_50333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50333 
Symbol 
ID7199072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011696 
Strand
Start bp338149 
End bp340182 
Gene Length2034 bp 
Protein Length619 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185176 
Protein GI219130026 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GAAAAAGGAA GCACGGGAAC AAGTCATAAC AGCGTCGAAG CGTTCTTGTA CTGGATTGCA 
ACGTGAAAGC ATTGTTTCTT GACTACCCAC AAAGTGCTAG CTGGACGAAT ACAATACCAG
GAGTAACGCG CAGGTCCTAA GACGAGGATC CTTTTCGATA CTCAATCAAT TATTATGGAC
CCGTCTATCG GTAGTGTTCA GGATGACCCT CCGTTGTTGC ACCCTTTGGA GCATTCTCAA
AAGGAGCGGA TCGTCAACGA CATGGACAAT GCCGTCCGCC TTGACTCGAA TAACAGCGAT
AGAGCAGCGC AGACCTTTCA ACTTCTAGGA CGTCAATCCT CTGATGACAC CCGCCTGATA
CGGCCGTGCC CACAACGCCA GGAGTTTTCT TCCTGCTGGG TATCGGACAC GACCACCCCC
ATGACGAGTC GTCCTCACTT TACCCTATCA GATAGCCCGC GGCCATTGCG GCGCGTCAGA
TCTAGATCGC TGGGAGCGTC AATCGAACAC ACCGACATCG AGTGGAGTTT AATATCGCCA
ATGGATCGGT CGCTCATCCC GAATCAATTG AAATCGTCGA TAAAGCCGAC TATGGACCTA
GAATTGATCA ACGCTGCAAC ACCGGAAGGG AAGAAAGAGG AAACAAGGAA AATTCAGGGC
AAAACTACTC CCGTTTTCAT TTCCGCCATG TACGGAATGA TCAACGCCAC AATTGTATTG
CCCGTCCTTA TGAGTTTCGG TGCCATCATT TATCGTGATC AAGCCTTTTC GCCCTACATG
CCTGTTTTGG TTAAACTTAC AGTCGTATCA GGAGTGGTGC ATCAGCTGTG CTTTTCGACG
TTGTCTTCGC TACCTTTCGC GGTCGGGCAG GTGCAAGACG CCGGCTTAAT TTTCTTGTCG
AGCATGGCAT CCCACATGGT GGAGCACTGT CGCAGCCGTG GATACGACGA CGAAACACTA
TTGGCGACCA CGACCATAGG TCTCAGTTTA TGCACAGCCC TACTGGGTTG TGGATTGGTG
TTGATTGGAC AATTTCAACT GGCCCAGTAC GTACAACTCC TGCCAACTTC TGTTGTAGGT
GGCTACTTAG CCTTTATTGG TTGGTTTTGC GGGATGTCCG GTGTTGGGCT CATGGCTGCT
TCGACTGAAG TTTCGTTTGC CGTTCTTCTA GACAACTGGC AATTTGTCGT ACCGGGAATT
GCCGGAGGCG TTGTCATTTA TGTATCGGTG CGCTATCTTC GGCATATGGC TGTTTTGCCT
ACTTGTATTG CTGTACTTCT ACTGCTTTTT TACAGTACTT TGTGGGCCAC TACTACTTCG
ATCGATGAGG CGGCCAAATC AGGATGGATT CGGGAAACAG ACGCCCCTCC ACCATGGTAC
AAAACGTGGG AGTATCTGAA ACTGGACAAG GTGGCCTGGT CAGTGATTCC CGAACTAGTG
TTAACGGAAT TGAGTATGAT CTTTGTTGTG GCGCTGTCCA GTTCATTGGA TGTGGCCGCC
ATCGAACTGG AACTTAAAGA ACCGCTGGAC TATAATGGCG AGCTCAAGAT GGTGGGTTTG
TCAAATCTCG TTAGCGGTCT GACGGGAGGC TACACGGGCA GCTACATCTT CAGTCAAAGT
ATCTTTTCCT TACGGGCAGG CATTCGGTCG CGGATCGCCG GCTATGTCTT GGCTGCGTGT
CAAGTAGTAT ATCTGCTCGT CCCCTTTCCC ATTCTGGCGT ACGTACCGAA CTTTTTCTTT
GGGTCGCTCC TGTCAATGAT TTGTGTCGAC TTGATGTATG AATGGTTGTG GGATGTGCGG
AACAAAGTAA CGCCCGTCGA GTACATGGTT TGTTTGGCCA CCTTTGGTCT TATTCAGGTA
GCGGGTGTCG AGTACGGAAT TCTGCTCGGT GTCGTGGTCT TCTTATTATG TCAACGTCTT
GGTTTCGACG TCGGAAATCA ACGGCAAAAT GCAGAGCTCG ACGAAGCCGT CGACGCTCCT
TCTATCCCAA TCAACAGTAC CGACGGCAAC CCCCAACGCT ACGGTTCGCT GTAA
 
Protein sequence
MDPSIGSVQD DPPLLHPLEH SQKERIVNDM DNAVRLDSNN SDRAAQTFQL LGRQSSDDTR 
LIRPCPQRQE FSSCWVSDTT TPMTSRPHFT LSDSPRPLRR VRSRSLGASI EHTDIEWSLI
SPMDRSLIPN QLKSSIKPTM DLELINAATP EGKKEETRKI QGKTTPVFIS AMYGMINATI
VLPVLMSFGA IIYRDQAFSP YMPVLVKLTV VSGVVHQLCF STLSSLPFAV GQVQDAGLIF
LSSMASHMVE HCRSRGYDDE TLLATTTIGL SLCTALLGCG LVLIGQFQLA QYVQLLPTSV
VGGYLAFIGW FCGMSGVGLM AASTEVSFAV LLDNWQFVVP GIAGGVVIYV SVRYLRHMAV
LPTCIAVLLL LFYSTLWATT TSIDEAAKSG WIRETDAPPP WYKTWEYLKL DKVAWSVIPE
LVLTELSMIF VVALSSSLDV AAIELELKEP LDYNGELKMV GLSNLVSGLT GGYTGSYIFS
QSIFSLRAGI RSRIAGYVLA ACQVVYLLVP FPILAYVPNF FFGSLLSMIC VDLMYEWLWD
VRNKVTPVEY MVCLATFGLI QVAGVEYGIL LGVVVFLLCQ RLGFDVGNQR QNAELDEAVD
APSIPINSTD GNPQRYGSL