Gene PHATRDRAFT_1785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_1785 
Symbol 
ID7196749 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp2143053 
End bp2145266 
Gene Length2214 bp 
Protein Length433 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177456 
Protein GI219111409 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.287483 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GGTATTTTCG AGTGCGATTA TTGTCATGCT GACATTTCCC AATTGCCGCG AATACGTTGT 
GCGGTGTGTG TCGATTTCGA TTTGTGCCTG GACTGCTTTA CTTCGACTGA CCATGCAACG
GCAATAGCCC GTCTAAAGGC GGCACCGGAC CAAAACGGTA TGGCAGCCTT GTCCGCCATT
CAACACGAGG CTACACACGG CTATCGTGTT TGTGATAGCA CGCGTTATCC ACTTTTTTCC
ACTGCTCGCA CCCGAGTTAA CAGATTGAAT ACCAATGTCG AGAAGAATTC CGGTATGAGC
AATGAGGAAC AAAGAGTTTC CGATATTAAT GTAGCATCGA GCGGGATTGA TGAAATAGGG
GATGAAATGG ATGTGGATGA AGCAGCGCTT GGTTCAAACG AGGAAATTAT GGAACGGGAA
CAAGATGCCA ACTCGATGGA CGTGACTGAG CTAGAGAGAA CCGCTGTTAA CGACGGCGTC
CTGACTGAAG AATTAAACTC TACCGATAAA TATATCGTTT ACGATGATCC AAAATTTTTT
TGGACTGTGG AAGAAGACTT ACGTTTGCTG GAAGGTATCC AGACGAACGG TTTGGGAAAT
TGGGTTGAGA TCGCAGAGGC AGTCGCAGGT CAAGGATCCA TTGGTAAGAC CCCTCGTCGC
TGCATGGAGC GCTACTTTGA CGATTTTCTT GGACGCTACG GCCACATTCT CCCGTCACAC
ACTTTACAAG CTGAAGGTGA GGATGAAGTG GAAGAATCCG ATGCTACAAA GTATAGTGTA
GAAGAATTTG ATAAGGGAGA TACCGACGAC ACTCCGTCGC GTACTTCCAA ACGTCGGGCG
GTGATGATGC GTAGCCCAAG CTCAATGTCA ACCATGGCGT TCACGAGCCG CAAGAAGTAC
AAGGCCATTC CAACCGAGAC TCTCGAAGGA TATGGCGAAT TTTGGGCCAA TCCATATTTA
CCGCCAATTG AGGGCGTGAG GACTGGTCAA GAAGTAGGAC GTGACCATGC TTACAAGGCA
GAACAGCTTT TCGTAAAAAT GAGCATGGCG ATGGATAGCA TTGAACAAGT TAAAAATTTA
CACAAAGAAT GGACTGAGAC TCGTCTTTTG AAACCCGGTG GTCCTACCGT CCTTCCTCCT
CGACCGGACG ATGTTGTTGG AATGACAGGT GCTGAACTCT CTGGCTTCAT GCCTAGACGG
GAAGATTTCG ATGTGGAATG GGAAAATGAT GCCGAGCAGG CCGTGGCAGA TATGGAATTT
CTTCCTGGCG AGCCGATAGA GGACAAGCAA CTTAAACTAC AGGTACTGGC AATCTACAAC
TCTAAGCTTG ATGAACGTGA GAAGCGCAAG AAATTCGTCC TCAGTAGAAA GCTATATGAT
TATCGGAAAA CCCAAACAGA ACACGAGAAG CTCCCACAAG ACGAACGTGA CCTTGTGCAT
CGAATGCGTC TGTTCGAGAG ATTTCATACG CCCGAGGAAC ACAAAGAATT TCTTGCGGAT
CTGCTCAAGG CGAAGCGCCT TCGCAAGGAG ATTGCAAAAC TGCAAATGTA CCGAAGACTT
GGCATCCGCA CATTGCTCGA AGCGGAAAAA TACGAATTAG ACAAAGAGCG CCGGCAGTTC
CACAAGACAG CCCACACACA GAAGAACACC GATGTCAGCA CACCAGATGA GAATACTGCC
GCAACCTCGG CGGAGGTAAG TGGCCGTTCA GGAATGTCTC AGTCCGTAGG AACTGTTTCA
TCTTCATATT GGAAGCAATA CCGCACGGGT GATCGTCGTG AGAGGAAGAG TATCAATCGG
GGCGTGCCCT GGGCAGACAG CCAAGAGACG AGCAATAATT TGAAAAATAA TTCGGCTGAT
GGGTCTAGCA AAGTAGTAGA CACTAGAAGA GATGATGGGG ATATGGATGC TGTGCAGCCA
GTAGAAGATA CAATTGCGAT ACAGGCCAAA CTCGAAGTTT CTTCGAGGGC AACCAAGGAA
GACGACTTTG CACATTTGCC TGGTTACAAT CTGCTTTCCT CTCGTGAAGT GTTGTTGTGC
CAACGCACAA GGCTAACGCC AGAACAATAT TTGGAGGTAA AGAACGTGCT GATTCAAGAG
TCACTGCTTA AGGGGCTTCT GGATAGGGAG GGTCCCGGAT CTAGCAAAAG AGCGTTGGTA
CGGATCGACG TAGAGCGACG GGGCGACGTA ATAGACTTTT TAGTTCGGGC CGGC
 
Protein sequence
GIFECDYCHA DISQLPRIRC AVCVDFDLCL DCFTSTDHAT AIAQLNSTDK YIVYDDPKFF 
WTVEEDLRLL EGIQTNGLGN WVEIAEAVAG QGSIGKTPRR CMERYFDDFL GRYGHILPSH
TLQAEGEDEV EESDATKYSV EEFDKGDTDD TPSRTSKRRA VMMRSPSSMS TMATGAELSG
FMPRREDFDV EWENDAEQAV ADMEFLPGEP IEDKQLKLQV LAIYNSKLDE REKRKKFVLS
RKLYDYRKTQ TEHEKLPQDE RDLVHRMRLF ERFHTPEEHK EFLADLLKAK RLRKEIAKLQ
MYRRLGIRTL LEAEKYELDK ERRQRDDGDM DAVQPVEDTI AIQAKLEVSS RATKEDDFAH
LPGYNLLSSR EVLLCQRTRL TPEQYLEVKN VLIQESLLKG LLDREGPGSS KRALVRIDVE
RRGDVIDFLV RAG