Gene PHATRDRAFT_46563 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46563 
Symbol 
ID7201846 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp725233 
End bp730024 
Gene Length4792 bp 
Protein Length1245 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181062 
Protein GI219120656 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0629816 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GATGGAGTAA CTGCAAATTA ACGTAAGCAT CATTACGGTT TCATCCACAC ACGTTTCTCT 
CCAATACAAA GCATCCTAGG ACCTAGGACC AAGGCACTCC TTCCCTGGTA CGTCTACTCC
GCATGACGAC TCCCACGGAA AGCTATATAC TTAGAGAGCA ACCTGATTGT GCGTGCGGGC
ACGGAAAAAG GACCCAAGAT TTGCCTCACC AAAAAAAAAT GGACGGCACG AAAGGATGAC
CAACGTAAAT CTATGCCAAA AGACCCTACC AGTTTAAACT GAATTTGGAT ATAGTCCTGG
GAGCAATGGT GACTCGTTTA AATTTGTCTC ACATACTTCT GTTCGCTTGT ATTCTTCGAA
GGAGTATCAG AGCATTTGGG GTAGCGTTCT TCATCTCTAT CAGAAATGAC CGTAAAAAAA
AGGGTAAGGT CAAAAAGTGC TATGCTAAAA AAATGATGAT ATTCCTAAGA AAAACCATTC
AAATGTCAAA AGACCCATCG ATATGAGTCT GAAAGTGAAA CTCATTTTGG GCTTTCCCCA
AGTCATACTT TTCTTTGAGG CGTACAATCC CAAAGAGTTC CACCAGATGC TTTGGGAGAC
CTCTCACAGA AGATTGATGG AAATGACCGT GTTGGAGTAG ACCGATCGTG CCCAGACTAG
TACACCCTAC GAGATCCATT TTGCTGAAGA AATAAAAATG ACGGAGAATC GAAGAATGAC
CAACCGATGT CAACAATGTC AAAAAGACCT TGATACCAAG GATTTGGTAA TTGAGGTACT
GTTTGATGTT ACATTTTCAT GCTCTTTTCT GCTCATAATC TTTGTGATCA CAATTGCAAC
TCACAGTCAC CATTTTTGAA AAGCTCTCGA GAGAGAACAT CACAAGCGAA GGCTAACGAT
GTACTCTCAC CTTGTCCTCG GCTTAGCCAT TCTTTTCCTT GGCGCTAGCC AGTGCAACAG
CAACCTGATT CGTGGTAATT CTTCGATGTC TGCGAGCGAA GCCGGCGATT CTGAAGCGCT
GACAACAACC ATATCTCGGC GTCTTGTCGA AAATTTTGTT CCTTTGAGGT GCAATGCTGC
CATCGGTAAA GCCCCTTGTT CATCTTTCGT CACGATGTTT GGCAATGGCG CTGTTCGATC
AAACAGAGTC ATTATTCCAT GTGGCAAGTG CGTTACAATG GACTATCCGG GACCGCAACT
AACACTAAAT GATGGCATCG ACATACGAGG CAAACTAGTG TTTCCAGACG GATACCGACT
CACGGTTCGG ACGAACTTGG TGTCGGTCCA AGGCGAGCTT GAGATGAGAA GTTCTAAACC
AGTTGATGGA AACCCCGATG TACGGTTTGT AATGCAAGGT ACTGAGAGTG CGTCGTTCGC
ACCGATCAGC AGCAACGCCG CAAAATGTGG AGGAAATTGC GATGCAGGAG CTCGATCCAT
TGCAGTAGCT GGTGGGAAGG TCAATCGTAA GTTCGGATTG TTGCGAAAAT GGCATCCGCA
TTTCCTGCAC ACATATACGT GACTGTCGGC ACTTACAGAA ATCCCTAATT GTTGCAGTCA
ATGGCCTACC TACTAGCACA CCAACCTGGC TACACGTTCA CGACGTTATC GGAAGCTCGG
CTATTGTCGT TTCCAATTCG GTCCGTAATA AATGGGGTGC CGGTGCAACT ATCGTCATTA
CCTCCGAGCA CCAAGGTTAC TTTGGCGAGC AAGTTCGTAA AATCACCAGC ATTTCGGACG
TCGGCTCCAA TAGCGTGCGT CTCAATCTCG ACCAACCCAT CAACCGTCCG GTCACGCTTC
GCGACAGCCC AGACTTTGCC ACCGAGGTAG CCTTGCTGTC ACGCAACATT GTCTTTGAAG
GGGCCCCCGG AACAAAGGGT GGCCACTTTT GGATCATGCA CACTCCCCGA GTAAGGCAGC
GTATCGAAGG AGTGGAACTC GTTAACTTCG GACAAGAAGG CCTCTTGGGT AGATACCCGG
TCCACTTTCA TATGTGCGGG GACGTCTCAG GCTCGGTAGT AGTCAAGAAT ACCATTCGCA
ATTCCAACCA GCGCTGCGTT GTCGTCCACG GTACCAACAA CCTCCTTGTT CAAGAAAATG
TAGCCTATTT CACCAAAGGA CACTGTTACA TGTTGGAAGA TGGCATCGAG ACGGGCAATC
AGTTTGTGCG AAACATTGGC ATACGCACTG TCAAACCAGA GATCATCATT CCGAACATGG
GTAGTAACGG CAAGGAATCT GACATGACTG CCAGTACCTT CTGGATCACG AACGCTGACA
ACTCGTGGAT CGGAAATGTA GTAGCCGGAT CCGAAGCCCT GGGCTTTTGG TTTGAGCTGT
TGGTGCGTGG AAATCTGGCC AACGAGCACC AAGACTTTGA TCCTATGATG GTTCCGACCC
GCAAGTTCGA GGGCAACGTC GTTCATAGCG TGTATGGGGT AGGGATGACC TACTACTTGA
GCGGTTACAT CCCGGAGACA CTGCAGTACT TCAAAAACAA CAAGTTCTTC CGCAACCATC
ACCTTGCGCT CCGTATCCAC CGGGCGCGAA ACATCGTTCT GACTGGCAAC AAGTTCTCGG
ACAACAGATA TGCCATTCGA ATCGATCGTG ACGAAGAAAT TCATGTCACC GACACAACCA
TTGTTGGTTA TTCCGATCTA TTCAAGGACG TGGTCAGAAG GAATCGCTTT GCACAAGCAC
CATGCGCCCA AGGGATATCT TTCCAATCGA CTAATCCATG GAAAGACAAG ATGGATTCGG
AGCTCAACGG TGTCATTTTG GACAAAGTCA GATTTTCTGG ATTTTCCAAC GCGGTGTGTT
CGTCTTCAAC CGCAATCGAG CTGGATTCCC GACTCGACGG GTACAAATCG TTCGAGATGT
TCTCGCAGTA CTCCGGCGCC ACCGTAAGTG ACGCAAACTC GATCGACTTT TGTCGTGGAA
AGTCTGCCGG TGCACGTGAT GTGTACGTGT CCGATACCAC AGGTTCTCTT CTCGACGGCG
TTGCCTCTGC TCCTTCTACT CTGATGGTCA ACTCGCCGGA GCTGGCAAGC TTTGTCAATC
CCAGTGCATG TACCGAAAAC GCAGCTCGAT GCTACACCTA CTGTAGCAAC ACTTGCTTCC
GTACCGTTCA CTACTACGTC CCAGTGGGCC AAAGTCGAGA CTACAAGCTC AAAGTATGCG
ACCGCAAGGA CGCGTCCGAC TGTACCGTAC TGTCCGGTTA CGTACACTCT AATCACCCCT
GGCCTCGCCG ATTTGCCGTG CATGTCCCGT CAGGGCGGGA GTACGACACT TACTTCCTTG
ACAAGGGAGT GCCTGTGTAC CCAACGAATG TCGAGATTGT GTTCCAGGAA AAGCTTTGCC
CTACGGCACC GGATGATGAT GATATCGCTC TTCTCTACAA GGCTCGAGGT ACCACCTTTC
CCCCCACGCC TAGTCCGACG ACAGCTCTCG CCGCATGTGG AAACTTGATC GCAAACTCGG
ACTTTGAGCG TGGTTTCAAC GGATATTGGG ATGCACAAGG AGCCGGTACA TTGTCTACTA
CAGCCGGCTA TGAATCTGCC ACAGCTATGT ACTACGCCTC TGGTAATCGC AACCGGTACT
GGGTGGGACC ATCACACCAA TGGCGAGAAG GTCTGGACTT GAAATGTCTA AAGCAGGGTA
CCATATGGGA GTTTTCTGCT CGCTTGAAGC TTGTCGACTC AAAGACTGGA AGGGGAGCCT
CATGTGACCC AGGCTCTTCC TCGGGAGGCG AAATGTGCCC TCAGGTGCAG CTCATTGTGC
GCGACCAATC TTGGACTCAG CATTCTTTCA GGATCAGCGG CTTCGACGGT GGGGACACTT
GGGTAGCCAA TGGGTTTAAC GAGTTCAAAG GCTATTGGAC AATCCCAGCG AATGGTTCAG
GATGGCAAGG GGGTGTCGCA AACATGCGAG TGATACTTTC CGAATTCCCA TTTGGTATGG
ATTTGGTTGT TGACAATTTC AGCATGGTAC TGACGTTTGA CACGAACAAG AGTCTCGCAC
AGTCTCCAAC CTTGAGTCCG GTGCTGGCCG TGCCTCCTCC AACCCAAGCT CCTATCCGCA
TCCCAACTCA GGCTCCTGTC CGCATTCCAG CACAAGCTCC AATCCGCACT CCAAACGAAA
GTCCGCTGAT TGGTCCCGGA ACGGACCTGA ATACATGCGC TAAGATGATC GCGAATTCCA
ACTTTGATCT GGGGTATGAA GGATACTGGT CAGTTCCTAG CGGCGGAAGT CTGTCGAACA
AGGAAGGCTT CAGTTCCTCG ACTTCCATGT ACTACGATTC TGGGAACCGG CGAAGATACT
GGATAGGTCC TGAGTACAAG TGGCCAAATG GTGTTGACTA TGGGTGTTTG TTGCAAGGAA
CAACATGGAA GTTTACCGCT AAATTTCAAC TTATCGACGC TGCAACTGGT AAAGGAAGCT
CTTGCAGCGT TAATTCAAGT AAAGAAGGAG AAATGTGTCC CCGCGTTCGA CTGACTCTCC
GAGACGATGG GTGGACACTT CGAAACGTGA GGATCGATGG ATTCAGCGCG GCGGATACCT
GGGACGCCAA CGGCATGAAT TCGTTCACGG GCTACTGGAC TGTCCCAGAA AACGGCCCGG
ACTGGCAGGG TCGTGTCCGT AATATTCGTT TGGCAATTGC TGATTTTCCC TTTGACAAGA
ATCTAGTCGT TGATGATTTT CAGATGGCGT TTTTCAATAC GAACTAGGAA AACGCCAACA
AACTCAGAAA CCAACAAACG GTTCCAACAA AATATAAACT ACTCTATAGT CG
 
Protein sequence
MYSHLVLGLA ILFLGASQCN SNLIRGNSSM SASEAGDSEA LTTTISRRLV ENFVPLRCNA 
AIGKAPCSSF VTMFGNGAVR SNRVIIPCGK CVTMDYPGPQ LTLNDGIDIR GKLVFPDGYR
LTVRTNLVSV QGELEMRSSK PVDGNPDVRF VMQGTESASF APISSNAAKC GGNCDAGARS
IAVAGGKVNL NGLPTSTPTW LHVHDVIGSS AIVVSNSVRN KWGAGATIVI TSEHQGYFGE
QVRKITSISD VGSNSVRLNL DQPINRPVTL RDSPDFATEV ALLSRNIVFE GAPGTKGGHF
WIMHTPRVRQ RIEGVELVNF GQEGLLGRYP VHFHMCGDVS GSVVVKNTIR NSNQRCVVVH
GTNNLLVQEN VAYFTKGHCY MLEDGIETGN QFVRNIGIRT VKPEIIIPNM GSNGKESDMT
ASTFWITNAD NSWIGNVVAG SEALGFWFEL LVRGNLANEH QDFDPMMVPT RKFEGNVVHS
VYGVGMTYYL SGYIPETLQY FKNNKFFRNH HLALRIHRAR NIVLTGNKFS DNRYAIRIDR
DEEIHVTDTT IVGYSDLFKD VVRRNRFAQA PCAQGISFQS TNPWKDKMDS ELNGVILDKV
RFSGFSNAVC SSSTAIELDS RLDGYKSFEM FSQYSGATVS DANSIDFCRG KSAGARDVYV
SDTTGSLLDG VASAPSTLMV NSPELASFVN PSACTENAAR CYTYCSNTCF RTVHYYVPVG
QSRDYKLKVC DRKDASDCTV LSGYVHSNHP WPRRFAVHVP SGREYDTYFL DKGVPVYPTN
VEIVFQEKLC PTAPDDDDIA LLYKARGTTF PPTPSPTTAL AACGNLIANS DFERGFNGYW
DAQGAGTLST TAGYESATAM YYASGNRNRY WVGPSHQWRE GLDLKCLKQG TIWEFSARLK
LVDSKTGRGA SCDPGSSSGG EMCPQVQLIV RDQSWTQHSF RISGFDGGDT WVANGFNEFK
GYWTIPANGS GWQGGVANMR VILSEFPFGM DLVVDNFSMV LTFDTNKSLA QSPTLSPVLA
VPPPTQAPIR IPTQAPVRIP AQAPIRTPNE SPLIGPGTDL NTCAKMIANS NFDLGYEGYW
SVPSGGSLSN KEGFSSSTSM YYDSGNRRRY WIGPEYKWPN GVDYGCLLQG TTWKFTAKFQ
LIDAATGKGS SCSVNSSKEG EMCPRVRLTL RDDGWTLRNV RIDGFSAADT WDANGMNSFT
GYWTVPENGP DWQGRVRNIR LAIADFPFDK NLVVDDFQMA FFNTN