Gene PHATRDRAFT_33013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_33013 
Symbol 
ID7197020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp1312570 
End bp1314285 
Gene Length1716 bp 
Protein Length571 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177804 
Protein GI219112105 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0011992 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCTTG GTCGAAAGCA GCTGCCACGC CGGCGCAAAG CAACGGGTCC ATTGCGACTC 
GATTGGCCTG CCGTCGGCGC CTCTCAAGAT CTCGTCGGCA CCAAAGATCT TGTGTCGTTT
TCACTGGAAG AAAGTGCATT CGTTCCAGTG CGTCGGAAGC AAAAGCGACC ACGCAAGAAT
GCAGTCTCCC ATACCGGAGG TGACACTAGT TCACATGACA ATGCTCTTTG GATCGATAGG
TATAATCCTC AATGTATAGC CGATGTCTGT GTGGCGCCAA AAAAAGTTCA GGAAGTTCGT
CAATGGATCC AGTCTGCCAT GAAAGATCAC GTTCACAAAC TCTTAATATT GGTAGGGAGC
CCTGGAATTG GTAAGTCGAC AATGATCCGT TGTTTGGCGA AGGAAAATAG ATGGTCAATC
TCTGAATGGA ACGAAACGTT CTCGAATCAA TACAGTGCTT TGAACTCGGC AATGCACTCT
GTAGATCAAC AGTCTTCTCT GAGCTCGTTT CAGGAGTTTC TCCGGCAAGC AGGGACCGGC
TACCATTCTC TGACCTTCGA ATCGTCGTCA AATTCATCAA CAAAGCAAGA CGGATCTCAA
ATTTCGGGGT CCATCATCTT GCTTGAAAGC CTTCCGACGC AACACGAATC AACCCAGATG
CGATTAAGAG AGCTGTTTAC TGAACACGTC CGCACCACAT CCGTGCCAAC GGTTCTCATC
TTCAGCGATG TCTTGGAAGG GAAACACAAA CGAGAGGATC TGGAGTCCTT GGTGGACCCC
AATTTGCTGT ATTCAGACCT TTGCCTCATT CTACAAATTC AGCCTTGTAC TAAGCAAAAC
ATGAAAAGGG TTTTGTCGCT TATTGTCCGT GCAGAAAGGC TTTCGGTACC TTCCAGTATA
TACGAAGACC TTCACGAGCG GAGCAACGGG GACTTGCGCT CGGCAATCAC AACTTTTCAG
TACGAAGCCA TGGGGCAGTC GATGACCGTA AAGAACACAG ATACAACCAA CCGAGACCGC
AGACTTTCGC CATTTCACGC CTTGGGTAAG CTCCTTTACG CGAAGCGCGT CACTGGTGCA
CACAAGGATC CATTAAGTTG GTGGAAATGG AAGGATGATC GTCCACCAAT CGACTTTAAT
CCGGAAAACG TACTGGAGCA TAGTGGGATT GAACAATTTG GGACTCTATC GTTTCTCGAA
CATCACAGTC CGGATTTCTT TTCCGACATA TCGGAGCTGA GCGATACACT CGCGACTTTT
TCAGATTCGG CGTTATTGAT GGACTGTTCC TCTATTTCTG GCTCCCAGAA TGCTGCCGCC
TCGTTGGCTG GCCGTGCCGT CGCCGCCTTC AATCGACATC CACGCGCAAA CAAATTCAGG
CAACTTTCTG CTCCCAAAAT TTTCGAAGTC AATCGCAACC GTCGGGAAAA TGAAGTGCAC
CTTCGTCACC TACACCATTC TTTATCAACG AATCGTAGCA ATGAACTTTC TTTGCATTCG
GCCCTCGGAG CAACTTCGCA CTTCGTCTCC GACAGCCTTT CGTTTCTCCG ACGCATCATC
CCCGAGTCAA TAGATCTGTC TCTGAATACT ATGCACTCAC GATTCCGTCT TATAGACAAA
TCCATCTCGA GCAGCAATGA TGCGAAAACA GGATTGTTAA AGGAGCAGCA GCAAGTGCTT
CTGGACGACG ATATTGGTGA TTTTGATTCA GAATAA
 
Protein sequence
MKLGRKQLPR RRKATGPLRL DWPAVGASQD LVGTKDLVSF SLEESAFVPV RRKQKRPRKN 
AVSHTGGDTS SHDNALWIDR YNPQCIADVC VAPKKVQEVR QWIQSAMKDH VHKLLILVGS
PGIGKSTMIR CLAKENRWSI SEWNETFSNQ YSALNSAMHS VDQQSSLSSF QEFLRQAGTG
YHSLTFESSS NSSTKQDGSQ ISGSIILLES LPTQHESTQM RLRELFTEHV RTTSVPTVLI
FSDVLEGKHK REDLESLVDP NLLYSDLCLI LQIQPCTKQN MKRVLSLIVR AERLSVPSSI
YEDLHERSNG DLRSAITTFQ YEAMGQSMTV KNTDTTNRDR RLSPFHALGK LLYAKRVTGA
HKDPLSWWKW KDDRPPIDFN PENVLEHSGI EQFGTLSFLE HHSPDFFSDI SELSDTLATF
SDSALLMDCS SISGSQNAAA SLAGRAVAAF NRHPRANKFR QLSAPKIFEV NRNRRENEVH
LRHLHHSLST NRSNELSLHS ALGATSHFVS DSLSFLRRII PESIDLSLNT MHSRFRLIDK
SISSSNDAKT GLLKEQQQVL LDDDIGDFDS E