Gene PHATRDRAFT_50238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50238 
Symbol 
ID7199015 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011696 
Strand
Start bp70649 
End bp73068 
Gene Length2420 bp 
Protein Length625 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185118 
Protein GI219129906 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TAATGACGAT CACATTCAAA GGAACAGCAC AAGATTGAAC CACTGTTCAG ATAGCCACAA 
ATTGTTAAGC GAAAACCGAT TCAGACAACG AAGAAGAGCG TCCAGTAGAA GCGTCCCCCA
GATACAGGAA CTTGCTGATG AGACGGCCTT TAGCTTGGTC GACTCATTTT GTCGGTCTGC
TAGGTGGTTT TGTCGTTCTT TCAAATCAAA CGGACGCAAG CTGGGTAGAT CCGGACACGA
AGCAAGCTCA TTTGACGACG CAACCCCTCG CAAATGAAGA CAACCGAGCG TACGAGCTAG
TGAGTAGTCT TTACTGCGTT TTGGTTTTGG TTGTGCTGAC TGGATATGTT TCGAAGTAGC
TCATCCAATT CGATTTTCAC TTGTTCCACG CTACAGGTGT TTTCGGATGA GTTTGAGCAA
GCCGGGAGGA AATTTGCAGA TGGGAGTGAT CCTCGATGGA CAGCAATCAA CAAAAATGAC
TGTATGTGAA TATTCAAAAG CGGTCGACAA GTATTGCCAG TTTCTCTTTT ACTCACATTT
TTCCATTCGT TTTGTAGATA CGAACGAAGC CTTGCACTTT TACAGTCACG ACAACGCGCA
CACGTCCAAT GGGGTGTTGA ATATTACTAC CGTTGAAAAG GAAAACGTTT ATAAAGCATT
TAACGAGCAC ACAAAGCAGT TCTACGCCGA TAAGAAATAT GTGCAGAGCG GGATGCTACA
GAGTTGGAAC AAGTTTTGTT TCGTTGGCGG CATCGTCGAG TTTTCGGCAA AACTACCCGG
AGATCCACAC AAAGGAGGAT TGTGGCCCGC TCGTAAGTCC AGGAAGTGCA CTATCTGCTT
CTCTTCGTCG TTTGGCTCCA ACTCTGACCT TTTCTCGATC GTAATAAGTG TGGATGCTCG
GAAACTTGGC TCGGGCAACC TATGTAGGTT CTTCCGACTA TGTTTGGCCA TTCAGTTACA
ACCAATGCAA TCCCAAGACC CGACAGAGCC AAGAGATCAA CGCTTGTTCT TTAGTAAACC
ATTACGGATT AGCTCCGAAA AGCGGTCGAG GTTCTCCTGA GATTGATATC CTGGAAGCGA
TGCAAGGCGA ACCGGGAGAT TTACCAAATA CTTTCATCCA ACGACCATAC CAATCAACCT
CATTACAAGT GTCACCTGGC ATTGAAATAG ATCGGCCTAT TCTGGGAAAA CGACCACACG
AGGTTCGTTT GTATCCGTCG TTTTTCTTTT GTTTTTTGGT TCTCTTCCAA CAGTCTGTAT
TTCTTGTTTT TCTTTTAGGG TCACTGGTAC CCAGACTTAG AATATAGCAG TCAGAACAAA
TCAGACTTGA ATCCGTTCTT TTACGGAGTC ACGCTGGTGC ACAAGCCTAA CTCTTACACG
TACCAGTCAG ACGCCCTGTC GGCAAACTTG CAGCTAAAAG CCACGCACTA TAGCAAACAG
CATGTCTACC GAGTCGAATG GGAACCACCG GCAGAAGACG GCACAGGAGG CTACATAAAG
TGGTTTACCG ATGGGGAGCT TATCTACGGG ATTCATGGCA AAAGTCTTGA CATCATGAAG
ACGGAGATTC CTAGCGAACC GATGTATTTG TTAATGAATA CGGCAGTGTC AAGTCACTGG
GGCTTTCCTC AGCCATGCCC TGAAGGCTGT TCTTGCAAGT GCTTTGAATG CGGGAACCCA
GAGTGTGCGT GTGCAATACC GTCAGGGTAT TGCGACAATT TTCCTGCCTC CTTTGAGATC
GATTATGTTC GTGTGTATCA GGCTATCAAT GAATCTAAGC ACATTTTGGG ATGTTCACCA
GAGGCACGGC CAACTGCAAC GTTTATCGAA GGACATGCAA AGCGATATAT GACAGAAGGG
CAGCGACGGC CGCTAGAACC CGTCGTGACA GGCGGTGGGA GCTGTTCTAG CCACAAAGAC
TGCGGAGGAA TCGAGCGAGG CGTTTGCTCA GCTTCAGGTC TCTGTGAATG CTCGGAGTAC
TCAGCTGGTC CGCTGTGCTT AGCACATGCA GCCTTCTACG ATTTTGATAC CAGCAAACAA
CCAAAAATAT TTTCATGTAA GTTCGCGTGC TTACCTTGTT CAGCAGGCTG GTAAACGACA
ATTTAACCAC TCTTTTGTCC ATCCGTCCAC CTATAGATCG TCACATACAC TTCCCATCGA
GTCTCATGGT AGTGGTCAGT TTGCTGATTG GAGGCTTTCT GTTGTCGATG GCTTCGGCGG
TAAGGGAAAA GTCGAAAGAG CCGAAATACA GCAACGTGAA TGGGGGAACG AGTAATTTAT
CTTTCCAGAC GACAGGATCT GGAGCTGGTG TCGGCTCTTA CCAGAACCCA GATGGTGCAA
CTTTCACTGT ACCTGCAAAT CAAAAGGATG TGACCTATTG CGTCATCGAT GGACGACTAG
TCGATCAAGA CCATAACTAA
 
Protein sequence
MRRPLAWSTH FVGLLGGFVV LSNQTDASWV DPDTKQAHLT TQPLANEDNR AYELVFSDEF 
EQAGRKFADG SDPRWTAINK NDYTNEALHF YSHDNAHTSN GVLNITTVEK ENVYKAFNEH
TKQFYADKKY VQSGMLQSWN KFCFVGGIVE FSAKLPGDPH KGGLWPALWM LGNLARATYV
GSSDYVWPFS YNQCNPKTRQ SQEINACSLV NHYGLAPKSG RGSPEIDILE AMQGEPGDLP
NTFIQRPYQS TSLQVSPGIE IDRPILGKRP HEGHWYPDLE YSSQNKSDLN PFFYGVTLVH
KPNSYTYQSD ALSANLQLKA THYSKQHVYR VEWEPPAEDG TGGYIKWFTD GELIYGIHGK
SLDIMKTEIP SEPMYLLMNT AVSSHWGFPQ PCPEGCSCKC FECGNPECAC AIPSGYCDNF
PASFEIDYVR VYQAINESKH ILGCSPEARP TATFIEGHAK RYMTEGQRRP LEPVVTGGGS
CSSHKDCGGI ERGVCSASGL CECSEYSAGP LCLAHAAFYD FDTSKQPKIF SYRHIHFPSS
LMVVVSLLIG GFLLSMASAV REKSKEPKYS NVNGGTSNLS FQTTGSGAGV GSYQNPDGAT
FTVPANQKDV TYCVIDGRLV DQDHN