Gene PHATRDRAFT_50451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50451 
Symbol 
ID7199261 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011698 
Strand
Start bp107586 
End bp109923 
Gene Length2338 bp 
Protein Length688 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185380 
Protein GI219130455 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTTTACTGTT AATACCCGTT CCATCATCGC AGTCAAAACG TAGCAGAACA CGAGAGAGTT 
GATCCTACCG GAAAGTATCT TTAGCCAGTC AGTCAGTGAG TGAGTAACGT CTACCGGAGG
TGTTTTTGTC AGACAGATGA AACGAATGGT TCACAGAATG AGGCGAATGC CCGTTGCGGA
TTCAAGCCGT CGGCTTGGTC GGGTGGGGTA CTTATTGTTG CTTTTGTGCT GGGCATGCCC
GTCGCAACTC GGACAATCAT TCGTGCTGGA GCCCAGGAAG TCGAAGCGAC GCGTTTCGAT
TCTTTCGAGT CCACTATCCC AAACATTCGT GTCGACAACA TCAGCAAGTC GAACGTCGCG
GTCGCGCAGT CTTGATGTTC TGACGACGGA AGCTCCACCG ACATCCTGTA CTCGCCTCGC
TAATGCTGCG GATGACTCCG ACACTCGCCC CGGTGACGAA GGTTCGTCAT TGGCTGGCCA
AGTTGGCAGT GGTCCCAACT GGATCGAGCG TTCCTTTCCG GTGGACGTCT TGGGTACGTC
CAACGTCGAT CCCAAGAAGG TGGACGACTA CAATCTGGGA ATTTGTGGAC AATCCTACCA
AGTTGGACCA CTCGGGGCAC GCATGTACGA GGCCATCACT CGCAACGTCA ATAGCTCCGT
CCAACTGGCC ACACCCGAAA TCACCAAGGC GTATAAAGTC TACGCCATGG ACTTTACAGC
GAAGGAAGCC GTCCGGGCGG CGTTGAAACA GAACGGTCTC GAAATGGTCC TCACCGAAGA
AGAAGAGGAT GAAGGCATGT GGGCTGATAT TGATTCCATT CGCTTGCTTA CCACCAATGA
GGACGGTAGT GTGACAGTGA CTGGTCCGCT CTACGATTCT TGGCAAGATG CGGTGGACGT
CTGGAAGCCG GGACAGGCTT TCCACTTTGT CGCCCGCCAA GTCCCGGCCA AGATGCGGGA
ATTGGAATTG GACGAGCTTT TGCAAGCATT GGATCCGAAA GGAGAACTAC GCGCCCAAGC
TAAAGAAGCC GGTATGGCCT TGCCGGGTGA AGACATTGAT AGTCTAGCCG TCTTGGCCAG
TGACAATGTC CGACGAGTCG AAACGGCGCC TCGGGACGCC GTACCGGTAG CGGACGCCTA
CGCCGGCACG GACGAAAAAC GCGGTTACCG GGTTGTGTAC GCCAGTGATC TATTACAGGA
TTCCATGAAC GCCGATGGCA CGGAACAGCG GAAAACTCTC ATGCACGTGA TGGAAGCCTT
GGTCGCGCAT GGTTGTGTAA TCGTGGATTT GACGGACGGT GGACTGGCTC TGCAAAAAGC
CTTTGAGATG GCCCAAATGT GGGATACGGC ACAAACTTTC TTCGGACAGC CCGATAAAAC
ATCTTTACCG GGTATGGAAA CGGTCCAAGA AACGGGCTCA ACGCACGCCA AGGCGGGTTA
CGCTTCGTAC GATGACGGCA ATTTGCAGTT TCTCGAAACG CGGTACGACC GGGAAGGAAA
TCTACTTCCG GAAGGGGCGC GAGCACTACT GGGACCCCAG GGCTGCGCGG CTTTGCGAAA
GGCTTTTCAC ATTGTCACGA ATGTAGCCAA GGACGTAACT CGTATTGCGG TCTCGGCGTC
GTCTGTCGAA TACCAAGCAT TGGATGGCGT CGCGGCGTCC GCAGCCGCTA TCCAGCTTAC
CGACGAACTG TTGGACGACG GTGAACCATT GACGGCGAAT ATTCCGCACT CTGAAGGATC
CGTCAGTATG AGTCCACATC GATTGTGCCG CTATTCCAAC AATGGCAGCA ATAGCAAGGA
AAAATCGGAC GACCAATCGG ATACGGTCGA TGCGAAAACG AAAGAAGTGT TCGGTGCACA
CACCGATTCC ACCTTTGTTA CGGCCGTGCC GGTCGCCGCC GTCGCCGGCT TGGAAGTGTA
CGATGAAGAC GCCGAGCAAT GGTACCGCCC CGAATTAGCC GCACTCCGGC ACTGGCAAGG
CTTGCAAAAG GGCTTGGGTT TGGACGTGAA CGAGCTCGTG GAAACGATTG ATGGCCAACA
GTTACCTTGG TTCGCCCGCT ACGTAGTCCT AATGCCAGGC GAACTGCTAC AGATCGCCAC
TCGCAACGAA ATTTTGGCAG CCGTGCATCG CGTGGTGGCG ACCGAGGAGA CCTCCAGTCG
GTTGAGCGCT CCTATCTTGT TGCGGGGACG ACCGGGCACC ACCTGGGACG TATCGCGGTA
TCTCGGCGGT GCTTTAGATA ATCCGCTGCT AGAGGAGATT GACGGCATGA CGATAGAACA
AATACACGAC GCGATGCAAC CGAAATCATC GTCGTTCGAA TAAAATATCA AAGCTACG
 
Protein sequence
MPVATRTIIR AGAQEVEATR FDSFESTIPN IRVDNISNLD VLTTEAPPTS CTRLANAADD 
SDTRPGDEGS SLAGQVGSGP NWIERSFPVD VLGTSNVDPK KVDDYNLGIC GQSYQVGPLG
ARMYEAITRN VNSSVQLATP EITKAYKVYA MDFTAKEAVR AALKQNGLEM VLTEEEEDEG
MWADIDSIRL LTTNEDGSVT VTGPLYDSWQ DAVDVWKPGQ AFHFVARQVP AKMRELELDE
LLQALDPKGE LRAQAKEAGM ALPGEDIDSL AVLASDNVRR VETAPRDAVP VADAYAGTDE
KRGYRVVYAS DLLQDSMNAD GTEQRKTLMH VMEALVAHGC VIVDLTDGGL ALQKAFEMAQ
MWDTAQTFFG QPDKTSLPGM ETVQETGSTH AKAGYASYDD GNLQFLETRY DREGNLLPEG
ARALLGPQGC AALRKAFHIV TNVAKDVTRI AVSASSVEYQ ALDGVAASAA AIQLTDELLD
DGEPLTANIP HSEGSVSMSP HRLCRYSNNG SNSKEKSDDQ SDTVDAKTKE VFGAHTDSTF
VTAVPVAAVA GLEVYDEDAE QWYRPELAAL RHWQGLQKGL GLDVNELVET IDGQQLPWFA
RYVVLMPGEL LQIATRNEIL AAVHRVVATE ETSSRLSAPI LLRGRPGTTW DVSRYLGGAL
DNPLLEEIDG MTIEQIHDAM QPKSSSFE