Gene PHATRDRAFT_50428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50428 
Symbol 
ID7199227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011697 
Strand
Start bp331067 
End bp333517 
Gene Length2451 bp 
Protein Length816 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185311 
Protein GI219130311 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATATT GCATATTCTC CTACGAATTG ACCGGAAACC GCGCGTTGTA TCTTGGCGAC 
GGTGATCGAC ACGAAACCGC CTTCAACCAA TACGAAGTCG TCGTTCCCTT CAACGCCTAT
CGAGACCCCG AGCTCGCTGC CAACACGGAC GGGCATTGCC GTTACTCTCT ACACATCTAC
CCCAGTCAGC AGTTTGCTCA GGGATACAAG TCCAGCCTTC CCATCGTCTT CACTTCCCTC
GTGGCAGCCA CCTTCTTCCT CATGGCTCTG ACCTTCTTGG TCTACGACCG CTTCGTCCAC
CGCCGTAACA TCAAAGTGGT TAATGCCGCT GCTCGATCTA ACGCTATCGT GTCATCGCTG
TTCCCTTCTA ATGTCCGTGA TCGCTTGTTT GAAGATGCCA AGGCAAGAAG CGACGTCAAC
CAGGCCGCCC ACTCTCGTCT CAAGACGTTC CTGCACAATG GTGACTCCTC TGATACCGCC
ATCACTGACG AGAATGCGCA TCACAGTGAC TTCTTCAAGA CCAAGCCCAT TGCTGAGCTG
TTTCCCCATA CTACCATCAT GTTTGCAGAT ATCAGCGGCT TCACGGCATG GAGCTCGTCT
CGAGAGCCGG CACAAGTCTT CCAGCTGCTG GAGACCCTGT ACCACTCGTT CGACGAGACG
GCCAAGAAGC GTCGTGTCTT CAAAGTCGAG ACGGTCGGCG ACTGCTACGT TGCCGTTGCT
GGACTGCCCG ATCCGCGAAA GGATCATGCC GTAGTCATGG CTCGATTTGC CAAGGACTGC
ATGCACCAGA TGCACTCGCT GACAAGAAAG CTGGAAGTAT CTCTAGGCCC GGACACGGCC
GACTTGTCCC TACGAATTGG GTTACACAGC GGACCCGTGA CTGCCGGTGT CCTGCGCGGG
GAGCGGTCTC GTTTTCAGCT CTTTGGAGAC ACCATGAACA CGGCAGCAAG GATGGAAAGC
AACGGAATTC GCGGCCGCAT TCAAATCTCG CAGGAGACGT CCGATCTGCT TGCAGATGCT
GGCAAGACCC AGTGGTTCGT TGCACGCGAG GATACGATTG TGGCCAAGGG AAAGGGCGAG
CTCAACACAT TCTGGCTTTC TGTGGGAGAT GTGGGTAAGG GGAGGTCGAC AACCGACACG
ACGCACAGTA GCGACGATGT TCTTGCGCCC AACAACTATA ACAGCTCTGT AGCGTTGGAT
AGTCTGATGA CGACCAGTGC CGAGTCAGAT CAGCAGGTCT ACAACCTTGT TTCGAATAAG
ACCTCGCGTC TCATCGACTG GAACGTCGAT GTACTATCGC GATTGATCAA ACAAATCGTG
GCGCGTCGCA AGGCCTCCAA GGTACCAAAG AAGGACTCGT CCAAGCAATA CTTCTGTCCC
GGCGACAACC GAGGAGCCGG GACCACGGTC CTGGACGAGG TGACGGAGAT TCTGGCTCTG
CCGGAGTTCG ACGCGGACGC TGCTCGTCGC CAGCAGGATC CTGAGAACAT CGAGCTAGAT
GACAGTATCA CGTCACAGCT GCAGCAGTAC GTGTCCAATG TGTCTGCCAT GTACCGGAAC
AACCCCTTCC ACAACTTTGA GCACGCCTCG CACGTGACCA TGTCAGTGGT AAAGCTGTTG
TCCCGTATTG TGGCTCCCGT GGATGTGGTG GTGTCGGACG GGAAGAACCA GAAGCGGTCC
TTCGCCTCCA AGCTACACGA TCACACGTAC GGCATCACGT CGGATCCTCT GACTCAGTTT
GCTTGCGTCT TTTCGGCACT CATTCATGAC GTCGACCACA GTGGAGTTCC CAACGCGCAG
CTAGTGAAGG AGAACAGTAA GATTGCTACA TTCTACCAGG GGAAGAGTGT TGCCGAGCAG
AACTCCGTGG ATCTAGCTTG GGACCTGTTG CTAGACGACA GCTTCAAAGA TTTGCGAGCC
GCCATCTTTG CCACGGACGT GGAGAAGGCC CGGTTCCGAC AGTTGGTAGT CAACTCCGTC
ATGGCAACGG ATATCATGGA CCCGGATCTC AAGGCTATTC GTAACGCACG CTGGGAGAAG
GCCTTCACAG CCTCCCCAAA TATGCAGGAA GACCTGAAAG ACTTGACAAA CCGCAAGGCG
ACGATTGTAA TCGAGCACCT GATCCAGGCC TCCGATGTGG CACACACCAT GCAGCACTGG
CACATATACC GGAAGTGGAA TGAGCGACTG TTCCTGGAGC TGTACCAGGC CTACATAGCC
AGTCGAGCAG AAAAGAGCCC GGAAACATTT TGGTACAAGG GTGAGCTGGG CTTTTTCGAC
TTTTACATCA TTCCACTGGC TATGAAGCTG AAGGAATGCG GTGTATTTGG AGTGTCGAGC
GACGAGTATC TGAACTACGC GATGCGCAAT CGCAAGGAAT GGGAAGACCG TGGTCAGGAA
GTGGTGCGCG AAATGATGGA AAAGATCAAA GGTAGGGTGA AGCGTTATTG A
 
Protein sequence
MPYCIFSYEL TGNRALYLGD GDRHETAFNQ YEVVVPFNAY RDPELAANTD GHCRYSLHIY 
PSQQFAQGYK SSLPIVFTSL VAATFFLMAL TFLVYDRFVH RRNIKVVNAA ARSNAIVSSL
FPSNVRDRLF EDAKARSDVN QAAHSRLKTF LHNGDSSDTA ITDENAHHSD FFKTKPIAEL
FPHTTIMFAD ISGFTAWSSS REPAQVFQLL ETLYHSFDET AKKRRVFKVE TVGDCYVAVA
GLPDPRKDHA VVMARFAKDC MHQMHSLTRK LEVSLGPDTA DLSLRIGLHS GPVTAGVLRG
ERSRFQLFGD TMNTAARMES NGIRGRIQIS QETSDLLADA GKTQWFVARE DTIVAKGKGE
LNTFWLSVGD VGKGRSTTDT THSSDDVLAP NNYNSSVALD SLMTTSAESD QQVYNLVSNK
TSRLIDWNVD VLSRLIKQIV ARRKASKVPK KDSSKQYFCP GDNRGAGTTV LDEVTEILAL
PEFDADAARR QQDPENIELD DSITSQLQQY VSNVSAMYRN NPFHNFEHAS HVTMSVVKLL
SRIVAPVDVV VSDGKNQKRS FASKLHDHTY GITSDPLTQF ACVFSALIHD VDHSGVPNAQ
LVKENSKIAT FYQGKSVAEQ NSVDLAWDLL LDDSFKDLRA AIFATDVEKA RFRQLVVNSV
MATDIMDPDL KAIRNARWEK AFTASPNMQE DLKDLTNRKA TIVIEHLIQA SDVAHTMQHW
HIYRKWNERL FLELYQAYIA SRAEKSPETF WYKGELGFFD FYIIPLAMKL KECGVFGVSS
DEYLNYAMRN RKEWEDRGQE VVREMMEKIK GRVKRY