Gene PHATRDRAFT_231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_231 
Symbol 
ID7201612 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp38474 
End bp41667 
Gene Length3194 bp 
Protein Length970 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180935 
Protein GI219120392 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GATCCGGAAG TGCGCGCCGA GCGGGAGCGA GTGCGGGAAG AACAACGACA GGAAACGGAA 
GAACGACGCG CCAAGAAACG GCAAGCGGAA CAGGCTTTGG CCAACAAACA CCGGAAACTC
GACAAGGAAG AAGCTCGCAA GGCACAAGCG CGCTTGGAAT ATCTGTTGCA GCAGAGTTCC
ATTTTTGCGA AACTGCAGGG TGGATCCGGT GCCATTCCGC AGGGACCGGA CGAGGCCAAA
GACCAGGAGA CCGCGGCCGC CGCGGCAAAG TCGAAAAAAA AGCGCACGTC CACCGCGTCG
CCGAGCTCTA ACAAGGCACA CCATCGACAC GGGGCCGAGT CCAACGACGA CTCGAATGAA
GAGGAAGAAG CCGACGAAAC AGAAGTCGGA CACGTCTTTT TGACCAAGCA ACCCACTTCG
ATCAAGTTTG GTACCCTGAA GCCCTACCAG CTCGAAGCTT TGAACTGGAT GATTCATCTT
TCCGAAAAAG GGCTGAACGG GATTCTCGCT GACGAAATGG GTTTGGGGAA GACCTTGCAA
TCCATTTCCG TCCTTGCGTA TCACTGGGAA TTTCTACGCA TACAGGGACC ACATCTGATC
TGTGTCCCCA AATCTACGCT TTCCAATTGG ATGAACGAAC TTAAACGTTG GTGCCCGTCG
CTTCGGGCCA TCAAGTTCCA CGGCTCGCGG GAAGAAAGGG AATATATGAT TGACAACATG
TTTCACAACG AAGCAGCCAC GCACGATGGT CGTCGACCGG ATCGACAAAT TATGGACGGA
TCGGGCGAAT TGATCGACGA CAATACCGAC ACACCGCGTC CCTGGGATGT CTGCGTAACT
ACCTACGAAG TCGCCAACGC GGAACGCAAA ACTTTGCAGA AGTTCACGTG GAAATACCTC
GTAATCGACG AAGCCCACCG ACTCAAGAAC GATGCTTCCA TGTTCAGCAA GACGGTTCGG
TCGTTCCGGA CGTCGAATCG CTTGCTCTTG ACGGGGTACG TGCTGGTCGT TTAATTTTTG
GCGTTGCTGG AGTTGTAGAA GCGTGGCGTT ACGGTTTTCT CACACCTCTG TATTTTCACA
CCAATTTTTA CCTACAGCAC TCCGTTACAG AACAACCTCC ACGAGCTTTG GGCTTTGCTC
AATTTCTTGT TGCCCGATAT CTTTTCGTCG GCGGACCAAT TTGACGAATG GTTTGATCTG
GAAATTGACG ATGAAGAAGC CAAAAAGAAC ATGATTTCGC AGTTGCACAA GATTTTACGT
CCGTTCATGT TGCGTCGTCT GAAAGCTGAT GTGGCCAAAG GGTTGCCACC GAAAACGGAA
ACCATTCTCA TGGTTGGAAT GTCCAAAATC CAGAAACAGC TCTACAAAAA GCTCCTCTTG
CGTGATTTGG ATAGTATCAC GGGCAAAGTC TCGGGCAAGA ACAGAACCGC CGTTCTCAAC
ATTGTCATGC AGTTGCGGAA GTGCTGCGGT CACCCGTATT TGTTCGAAGG AGTCGAAGAC
CGGACTTTGG ATCCACTTGG AGAGCATTTG GTCGAGAATT GTGGAAAGTT GAGCATGGTC
GACAAGCTAC TCAAGCGATT GAAGAGCCGT GGGAGCCGTG TTTTGATCTT TACTCAAATG
ACGCGCGTGC TCGATATTCT TGAGGATTTT ATGGTTATGC GTGGATACCA ATATTGTCGC
ATTGACGGCA ACACCAATTA CGATGACCGC GAGAGCTCCA TTGACGAATT TAATCGGGAA
GGCACCGATA AGTTTTGTTT CTTGTTGTCG ACCCGTGCGG GAGGTCTCGG AATTAACCTG
CAGACAGCCG ACACGTGCAT CTTGTATGAT TCGGACTGGA ACCCACAACA AGACTTGCAA
GCCCAGGATC GCTGTCATCG CCTTGGACAA AAGAAACCAG TCAATGTGTT TCGTTTGGTC
TCTGAGAATA CTGTTGAAGA AAAGATTGTG GAACGCGCTC AGCAAAAGCT CAAGCTGGAC
GCAATGGTTG TGCAGCAAGG GCGACTGAAA GATCAAGACA AGGTGACCAA GGACGAAATC
ATGGCCGCCG TTCGATTTGG TGCGGATACG GTCTTTCGAT CCGAAGAGTC TACAATCACC
GATGACGATA TTGACGTGAT TTTGGAGCGT GGGTACGTGG TAGCAGGTCG GGTTGAGAGT
ACCACAAGCG GACGCGAAGG ATGCCGGAGA ATCTTCTGAC ACAGTTTCAT CTTTTCTTTT
CTTCGTCTTT CAGAGCGGCC AAGACCAAGG AGCTTGCCGA AAAGATCCAG ACACGGGACA
AAGGCGATCT TCTGGACTTT CGCCTGGACG GGGGAATATC AGCGCAGACG TTCGAGGGCG
TGGATTACAG TGACAAAGAT CTACGCGATC ATCTCCGCAT GCTAGCTGCA GATTCAATGG
GAAAGCGAGA ACGCCGACCG CCTCCCACGA GCTATAATCC TATCATCATA TCGAAGAAGT
CAATGGTGGT AAATAATCGC CGGATCAAAC TACCTAAATG TCTACGTATT CCACAAATGG
AAGACCATCA TTTTTACAAT CGCGAACGCC TCTTGGACTT GGGAAGGCTT GAGTTCGAGA
CTTACGCTGC GCTTAGAGAG GCTGGTGAGC TTCCACCGAA AGAGTTTATG GAACGGAAGA
GGACGTTGTT ACCCGACGAG CTGGGACAAG AAAAGCTAGA GCTCTTGGCT GAAGGATTTG
GGGACTGGAG TAGAAGTCAA TATTACGCCT TTGTCAAGGC AGCTGCGAAG TATGGGCGTG
ATGACATCAG TGGCATCGCC AACGAGTTGG ACATGCCAGA AGTCGAAATC GCTGCTTACA
GTAAATCATT TTGGGCCTAT GGACCGACTG AACTCGAAAG CGAGTGGGAA CGCCTCGTTG
GTAATATCGA CCGAGGGGAA AAGAAGCTGG CGAAGCAAAA GAAACTCAAG TCTCTCCTCG
CAAAGTTCGT TAACACTTTT GAAAACCCTA GAGATGATAT GGTCTTTGCT AATAAAGGAA
CCACTCCCTT TGCTCTAGAG CAGGATCGAG CACTGCTATC TGCCGTCGAC AAACACGGAT
ACGGTAATTG GGATTCCGTC CGCGAGGAGA TTCGCACTGA TGGACGTCTC AAATTTCAGC
ATTCAACCCA AGGTATGACA GTACAGGCAA TTGGGAAGCG CTGCGATTAC CGAATGAGGC
AAATGGAAAA GGAA
 
Protein sequence
DPEVRAERER VREEQRQETE ERRAKKRQAE QALANKHRKL DKEEARKAQA RLEYLLQQSS 
IFAKLQGQRP GDRGRRGKAH HRHGAESNDD SNEEEEADET EVGHVFLTKQ PTSIKFGTLK
PYQLEALNWM IHLSEKGLNG ILADEMGLGK TLQSISVLAY HWEFLRIQGP HLICVPKSTL
SNWMNELKRW CPSLRAIKFH GSREEREYMI DNMFHNEAAT HDGRRPDRQI MDGSGELIDD
NTDTPRPWDV CVTTYEVANA ERKTLQKFTW KYLVIDEAHR LKNDASMFSK TVRSFRTSNR
LLLTGTPLQN NLHELWALLN FLLPDIFSSA DQFDEWFDLE IDDEEAKKNM ISQLHKILRP
FMLRRLKADV AKGLPPKTET ILMVGMSKIQ KQLYKKLLLR DLDSITGKVS GKNRTAVLNI
VMQLRKCCGH PYLFEGVEDR TLDPLGEHLV ENCGKLSMVD KLLKRLKSRG SRVLIFTQMT
RVLDILEDFM VMRGYQYCRI DGNTNYDDRE SSIDEFNREG TDKFCFLLST RAGGLGINLQ
TADTCILYDS DWNPQQDLQA QDRCHRLGQK KPVNVFRLVS ENTVEEKIVE RAQQKLKLDA
MVVQQGRLKD QDKVTKDEIM AAVRFGADTV FRSEESTITD DDIDVILERG AAKTKELAEK
IQTRDKGDLL DFRLDGGISA QTFEGVDYSD KDLRDHLRML AADSMGKRER RPPPTSYNPI
IISKKSMVVN NRRIKLPKCL RIPQMEDHHF YNRERLLDLG RLEFETYAAL REAGELPPKE
FMERKRTLLP DELGQEKLEL LAEGFGDWSR SQYYAFVKAA AKYGRDDISG IANELDMPEV
EIAAYSKSFW AYGPTELESE WERLVGNIDR GEKKLAKQKK LKSLLAKFVN TFENPRDDMV
FANKGTTPFA LEQDRALLSA VDKHGYGNWD SVREEIRTDG RLKFQHSTQG MTVQAIGKRC
DYRMRQMEKE