Gene PHATRDRAFT_32453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_32453 
Symbol 
ID7196983 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp2507457 
End bp2510862 
Gene Length3406 bp 
Protein Length1120 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176992 
Protein GI219110481 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTCCCG CCACCCGGCA AATGACGAGT GCAGCCGTCT ATGCCCACCT TTTGGACAAC 
GTACTTCTTC CCCAAGGGCA TCCTATCCGT CTCAGTTTTG AGCAGCAAGG ATATGAATCG
GCTGATGATC TCCTGTGTAT TTTTGAGAAT GAACTTGAGT CTCTTGGATA CACTCCTTCT
GTCCTTCCCG ACGGCCCGGA AAACCCGCCT ACCATTCCCC TTCTCATGGC GCACCGACAG
ATCATACGTC ATTTCTTGCG CTGGCAGGCA TCTTTGGAAC AACAAAAGGG GACACCTTTG
AAGAACTCCG AGCTTGTTGC ACTTAACAAT GAAGATTTTG TCATTTACCG TCGCTCAGCC
CTTGGTCAAG TCTCGACAGC AACTGCACCG GTTAATGCTT CCCCAACTGT TCAGAGCCCC
ATAGGAAAGA CACATTTGGC TGTTGAGGAC TTCAAGCGTG GGATCAAACG TGACAAAACT
CACTATCCCG TGCTTAAAGA TGATCGGTAC TGGGACAACT TTTATCGGTC GTTTGTTGTT
ACTGCCGTAA CACATAACGT TGACAAAGTT CTAGATCCGA CGTACATCCC TACCAATCCT
TTGGAGAAAT CCCTTTTTGA AGAACAGAAC AAGTTTGTAT ATTCTGCTCT AGAGCATACT
CTCCAGACGG ACATGGGCAA GAACATTGTA CGCGAGCATA GTTTCGACTT CAATGCCCAG
GAAGTTTTCC GTAAGGTTGT GAAACACTAC ACAGAGTCCG CTAGCGCGAA GATTAGTTCG
TCTACTACCC TGGGATACCT TACAACTGCA AAGTACGGAT CGTCATGGAC TGGCACAGCA
GAAGGTTTTA TTCTTCACTG GAAAAATCAC TTGCGCATCT ACAACGACAC TGTTCCTGCT
GGTGAACAGC TCCCTCAGCA ACTATGCCTT AGTCTTTTGG AGAATGCTGT TCATGATGTA
CCTGAGCTTC GACAGGTAAA AATCACTGCA ACTCTTGACT TAGCAAAGGG AGGTAATCCT
ATTAGCTATG ATGGTTATCT CAGTCTACTA CTCGCATCGG CATCACTCTA CGACAACGGC
AATAATCTAT CTAATTCTCG TAGTGGCAAG AACAAGCGCA ACATCTATGC TAATGAACTA
GAGTACAATC CGATGGATTT TGAGAGTAAA CCGGATGTAG ACTATGATAT AGATGTGTCA
CCGACCGCAA TCTACAAAGC CAATGCTCAT GCCCGTAACA GCAGTTCCCG GAGTCGTACT
CCGGCAGCTA ATCGCGAGCG ACCTTACATC CCTCGTGAAA TGTGGAACCT ACTCTTCGAC
GATGCCAAAG CCATCCTCCA AGGCTTAAAA GCCCCCGGGA AGCAGGCCCC ATTGAATAAT
AGTTCGCCAC ACCAATCGTT GCAGACCAAT ACGCACGATA CCATTGGCGC GGAACAAATC
ACAACGGACA CCTTCCATGA TTGCGCACCC GAAACTGAAT TGCTTGCCCA CCTGACTGAG
CGTGTTAGTC GCATGAGCGA CGGCGACATA CGTAACGTTC TTGCCGCATC TCGTGATGGT
CCCCCCTATG ATGAGCCCAA ACCACTGCAA TCTAACGTAC TTCAATATCA AGTGTCTCGT
CACAACGTCA TTGAAACTAC GGCAGCCCTC GTCGACCGTG GAGCCAATGG AGGTCTTGCC
GGCAGTGATG TCATGGTCTT GCACAAAACA GGTCGTTCTG CAACCATCAC AGGCATCAAC
GATCATACCT TGTCCGATTT GGACATTGTC ACCGCTGCTG GCTACACTGA ATCCCAAAAT
GGCCCCATCA TTCTCATTAT GAACCAATAC GCCCATTTGG GACAGGGTAA AACTATCCAC
TCCAGTGCAC AGCTTGAACA CTATCGCAAC CATGTCGAAG ACCGTTCCCG TACCGTAGGA
GTTAACCAGC GAATTGTAAC ATTGGACGAC TACATCATCC CATTGCACAT TCGACAAGGA
CTCGCGTATA TGGATATGCG GCGCCCTACC GACAAGGAAC TTGCGTCCCT TCCACACGTT
GTCCTAACCT CCGACGTCGA CTGGGATCCC TCCGTACTTG ACCACGAAAT TGATCTCGCG
ACCTCTTGGT ATGATGACAT ATACGATTTG CCTCAATCAC CTTACGTCGA ACCATGTTTT
GACCATACAG GCAAATACCT CCATCGTCAC ATTTCCTTTT GCAACCATCG CGATGACGCC
GTTGACCGTG TCTTATATTG CCAACAGCAC CTCGTCACGA AAAATGTGCA AGATTATGAG
GCCCTTCGTC CGTGTTTTGG ATGGGTCTCT GCTGAAACCG TTCGCAAGAC CATCATGGCG
ACCACGCAGC ATGCACGCGA AGTATATAAC GCTCCGTTAC GCAAACATTT TAAGTCTCGC
TTTCCCGCTC TAAATGTACA CCGTCGTAAT GAACCAGTTG CTACCGATAC CATTTGGTCC
GACACCCCTG CTGTCGATAA TGGTGCTAAA TTTGCACAAC TTTTCGTTGG TCGACGGTCC
CTTGTCACCG ACGCTTACCC CATGAAAACT GATAAAGAGT TTGTCAATAC CCTTGAGGAC
CATATCCGTT ACCGGGGTGC CATGGACAAA TTGATTAGCA ATCGTGCCCA GGTTGAAATC
AGCAAAAAGG TCACCGATAT TACACGCGCA TATAATATCG ACCAGTGGCA AAGTGAACCA
AACCATCAAC ACCAAAACTT TGCCGAACGT CGTATCGCCA CTATCGAGGC TAATACCAAC
AACATTCTCA ATCTTTCCGG TGCCCCTGAT TCCGCCTGGT TACTTTGCGT GACATATGTT
TGTTATGTTT TCAACCATTT GGCACATGAT TCACTAGATA ACCGCACTCC CCTTGAAGTC
CCCACCGGCT CCACGCCTGA TATCAGTGTT CTCCTTCAGT TTCATTTTTG GGAACCGGTC
TATTATAAGC TCGAAAATGC GACATTTCCT TCTGGTGGTA CTGAACAACA AGGACGTTTT
GTTGGCATCG CCGACTCCGT CGGCGACGCT CTCACTTATA AGATCCTTAC CCACACCACC
AATCGCATTC TTCATTGCTC TAGTGTCCGT TCTGCGACCA TTCCCGGACA AACCAACCTA
CGCCTTACGC CACAGGATGG GGAGAGTGGT CCTAAACCCA TCAACTTTAT CAAGTCGCGT
AGAACCGAAA ACAAAAATTC CTATGCCATT AAGGAGTTGC CTGGTTTCAC ACCTGATGAC
CTTATAGGTT GTACGTTCCT CACCGACACT CGGGATGATG GGGAGCGTTT GAAGGCACGA
ATCACGCGGA AAATATTGGA CCCAGACAAG CCCTCGGATG TAAAGGTCCT TGTCGAAATC
AATGATGGTG AATATGACGA GATTCTAGCA TACAACGAAA TTCTAG
 
Protein sequence
MVPATRQMTS AAVYAHLLDN VLLPQGHPIR LSFEQQGYES ADDLLCIFEN ELESLGYTPS 
VLPDGPENPP TIPLLMAHRQ IIRHFLRWQA SLEQQKGTPL KNSELVALNN EDFVIYRRSA
LGQVSTATAP VNASPTVQSP IGKTHLAVED FKRGIKRDKT HYPVLKDDRY WDNFYRSFVV
TAVTHNVDKV LDPTYIPTNP LEKSLFEEQN KFVYSALEHT LQTDMGKNIV REHSFDFNAQ
EVFRKVVKHY TESASAKISS STTLGYLTTA KYGSSWTGTA EGFILHWKNH LRIYNDTVPA
GEQLPQQLCL SLLENAVHDV PELRQVKITA TLDLAKGGNP ISYDGYLSLL LASASLYDNG
NNLSNSRSGK NKRNIYANEL EYNPMDFESK PDVDYDIDVS PTAIYKANAH ARNSSSRSRT
PAANRERPYI PREMWNLLFD DAKAILQGLK APGKQAPLNN SSPHQSLQTN THDTIGAEQI
TTDTFHDCAP ETELLAHLTE RVSRMSDGDI RNVLAASRDG PPYDEPKPLQ SNVLQYQVSR
HNVIETTAAL VDRGANGGLA GSDVMVLHKT GRSATITGIN DHTLSDLDIV TAAGYTESQN
GPIILIMNQY AHLGQGKTIH SSAQLEHYRN HVEDRSRTVG VNQRIVTLDD YIIPLHIRQG
LAYMDMRRPT DKELASLPHV VLTSDVDWDP SVLDHEIDLA TSWYDDIYDL PQSPYVEPCF
DHTGKYLHRH ISFCNHRDDA VDRVLYCQQH LVTKNVQDYE ALRPCFGWVS AETVRKTIMA
TTQHAREVYN APLRKHFKSR FPALNVHRRN EPVATDTIWS DTPAVDNGAK FAQLFVGRRS
LVTDAYPMKT DKEFVNTLED HIRYRGAMDK LISNRAQVEI SKKVTDITRA YNIDQWQSEP
NHQHQNFAER RIATIEANTN NILNLSGAPD SAWLLCVTYV CYVFNHLAHD SLDNRTPLEV
PTGSTPDISV LLQFHFWEPV YYKLENATFP SGGTEQQGRF VGIADSVGDA LTYKILTHTT
NRILHCSSVR SATIPGQTNL RLTPQDGESG PKPINFIKSR RTENKNSYAI KELPGFTPDD
LIGCTFLTDT RDDGERLKAR ITRKILDPDK PSDVKHTTKF