Gene PHATRDRAFT_45319 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45319 
Symbol 
ID7199963 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp829056 
End bp832318 
Gene Length3263 bp 
Protein Length1013 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179302 
Protein GI219117015 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.678444 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AAAAGTACTA TAGGATACTT CGTCCCGTTC TACCATAGCC TCGAGAGGAA GTTGAGTAGT 
GAATACCCTA TACCCGGTTG CGTCTACCGT CCTTGTTGCG CTTTTTCTAT CGCACAGCTT
TGGCAATCTC TAGAGAATAT ATCTTCTGTT GCGCCATGTC CGACAACGAA GAGGAGCTCT
ACGACGAGTT TGGGAATTAC ATCGGTCCCG ATCTGGATTC GTCCGACGAC GACGACGAGA
ATGCGTTGGT GCCTCCCGGG ACTGTCGCTC CGGACGACGC TTCGGATGTG TCCGGTGACG
ACCAAGACGA GAATGCTCTC GTGATGCGCG ACGACGAGAA CGTGATGACA ACAACGGCAG
CAACCACGGC CGATCCAATG CACGCCATTG TCTTGCACGA AGACAAGGAA CACTACGCGT
CGGCGGAACA AGTGTACGGT GACGACGTGC GCGTCGCCGT CTTAGATGAA GACGCCATGG
AACTCGAAAC ACCCATCGTG GAACCAGTAC TCACGAAATC GCATCACGCG GATAGCGACG
ATCGGGACAA ACAAAACGTC TTTGCGCCCG AGGATTGGTT GTACACGGAA GACTATCTCG
GCGTGCAATT GTCTAACGAA ACTACCCGCA CTCGGCGGTT AGTGGCCATC GTCGGACACT
TCCATCACGG AAAAACCTCA CTGGTGGACT TGCTGCTGGA ATCTACCTAT CGAGTCAAGA
AAAATCACAA GAATGCTGTC GTCGACGAAA GTCGCCAAGC GAATACACAG GCAGGACCGC
GGTATCTCGA TACGCTGCTG GCCGAACAAG CCCGTCAAAT GAGTCTCGTC AGTACGCCCT
TGACGACACT TTTGCCGGAT ACACGCGGAA AGACCTTCGC CATTTCCATG CTGGACTGTC
CCGGTCATGT GCAATTCCAT GACGAGTCGG TGGCGGCCTT GAAAGCGTCC GATGGTGCCG
TGGTGGTAGT GGACGTTGTG GAGGGAATCA TGATGCATAC GGAAATGGTA GTCCGACAAG
CCATATCCGA AGGACTTTCT CTGACCTTGG TCTTGAGTAA AATGGATCGT TTGATTGTGG
AATTGAAGTT GCCCCCACGG GATGCCTACT ACAAGTTGTT GCACATTGTG GATAGTCTCA
ACGAATTAGT AGGCATGGTG TCGAGGGGAC GCTATCCAAA AATCTCACCC GAACGAGGTA
ACGTGGCCTT TTGTTCCGCA CAACACGGCT ACTTGTTTAC GTTACCCAGT TTTGCGCAAG
TGTACATGGA ACACTTTGAT CGTTTGGGAG ACAACATTGC CGTCGATGGC TTTGCTCAGC
GTCTCTGGGG AGATGCCTAC CTGGATCCGG AAACGCGGAC GTTTCACCGG TCGTCACGCG
ACTGCTTGAC TCCGAACGTG GAACGCACAT TCTGCGTGTA CGTACTGGAG CCACTCTACA
AGATATACAG CGCCTGTCTG GGGGAGCGCG AACCCGACGT CAATGCCCTT TTGCGCGGTG
TCGGCGTTCT CCTTCACAAG GACGAATTGC GAGCCAATTC GACGGTCCTT TTGAAAGCGG
CATTGTCGCG GTTTTTGCAA ACCGCCAACC ACGGGTTTGT TGATATGCTC ACGCAACACG
TACCCTGTCC GGCCGTGGCC GCTGCCGGAA AAATCGCTCG GTGCTACACT GGCCCCCTGC
TCGACGATGA TGCGGACACT GCGGATTCGA AACAAAGGCT GGTGCAGGCC ATGCGCAACT
GCGACCCCCA CGGACCACTG ATCATTCACG TCGTCAAACT GTACGCCTCC CGGGATGGGC
AATCGTTTCA AGCGCTCGGA AGAGTTTATT CAGGAACGGT TCGACCGGCA ACTCCCGTCA
AGGTCCTCGG CGAGGCGTAC GTCCCGAATG TAGACGACGA AGATGTCGGT ACGGCGACGG
TGGAGAACGT TGCGATACCT CGGGGACGCT TTCATACAAG CATAAGCCTG GTCAAGGCGG
GGAACTGGGT TTTGCTGGAA GGCGTGGACG CCACCATTGC CAAGACAGCG ACCATCGTAG
GTTTGGAGTG CCCCGAGAAT GTGCACATTT TTGCCCCGCT CAAATTTCCA CATACGGGAG
GAGAGTCTGT GATGAAGCTT GCGATCGAAC CGCTAAATCC GGCAGAGTTG CCCAAAATGG
TGGAAGGACT TCGGCGGGTC TCCAAAGCGT ATCCCATGGT TCAGACCAAA GTGGAGGAAA
GCGGCGAGCA TGTTTTGCTG GGTACTGGAG AGCTGTATTT AGATTGCGTC ATGTACGATC
TTCGTCAAGT GTATTCGGAC ATTGAAGTCA AAGTTGCGGA TCCGATTGTT TCGTTTCGAG
AAACGGTGAT TGAAACGAGC AGTATCAAAT GTTTTGCGGA GACTACCAAC AAAAGAAACA
AATTAACATT TCTTGCTGAG CCTTTGGATG ACGGTCTGGC GGAAAATCTA GAGGCGGGCA
AGGTCAAGAC GCAGTGGGAC CAGAAGAAAC TGGGTCGCTT TTTTCAAGTA AATTACAATT
GGGATTTGTT GTCGTCACGC TCAGTTTGGG CGTTTGGGGA CTCGCCGACA CACGGAACCA
ACATACTCAT GGACGACACT TTACCTAGTG AGGTCGATAC ATCTCTTTTG AAAACATGCA
AATCCAGTAT TGTCCAAGGC TTTCAATGGG CCACTCGGGA AGGGCCGCTT TGCGAGGAAC
CCGTACGAGG TACGAAGATC AAAATCCTGG ATTGTGTCCT CGCCGATAAG GCTATCCATC
GAGGTGGGGG TCAGGTCATT CCTACAGCTC GCAAAACTGT ACATTCTTCG TTGCTGACCG
CCACGCCTCG ATTGATGGAA CCAGTGTATC GTTTACAGAT ACAATGTCCC GGTGCAATTG
TTGATGCGAT TCAACCCCTG CTGACGCGTC GCAGAGGCCA CATGGTGCAA GATCGACCGG
TTTCGGGCTC GACGCATTGC ATCGTCAAAG CTTACATACC GGTACTAGAC AGTTTTGGAT
TCGAAACGGA TCTTCGTACC TTTACTCAAG GTCAGGCAAT GGTCTTCTCC GTTTTTGACC
ACTGGTCGGT GGTGCCCGGC GATCCACTCG ACCGGAGCAT TATTTTGCAT CCGTTGGAGC
CTAGTCCGGC GCAGCATTTG GCTCGAGAAC TGTTAATTAA GACCCGTCGA CGGAAAGGTC
TGTCCGAAGA TGTTCCTGTG AGCAAGTTTT TCGACGAAAG CATGAAGGCG CAATTAGAGC
AAGTGAATGC CGTGCTACAA TAA
 
Protein sequence
MSDNEEELYD EFGNYIGPDL DSSDDDDENA LVPPGTVAPD DASDVSGDDQ DENALVMRDD 
ENVMTTTAAT TADPMHAIVL HEDKEHYASA EQVYGDDVRV AVLDEDAMEL ETPIVEPVLT
KSHHADSDDR DKQNTISACN LAIVGHFHHG KTSLVDLLLE STYRVKKNHK NAVVDESRQA
NTQAGPRYLD TLLAEQARQM SLVSTPLTTL LPDTRGKTFA ISMLDCPGHV QFHDESVAAL
KASDGAVVVV DVVEGIMMHT EMVVRQAISE GLSLTLVLSK MDRLIVELKL PPRDAYYKLL
HIVDSLNELV GMVSRGRYPK ISPERGNVAF CSAQHGYLFT LPSFAQVYME HFDRLGDNIA
VDGFAQRLWG DAYLDPETRT FHRSSRDCLT PNVERTFCVY VLEPLYKIYS ACLGEREPDV
NALLRGVGVL LHKDELRANS TVLLKAALSR FLQTANHGFV DMLTQHVPCP AVAAAGKIAR
CYTGPLLDDD ADTADSKQRL VQAMRNCDPH GPLIIHVVKL YASRDGQSFQ ALGRVYSGTV
RPATPVKVLG EAYVPNVDDE DVGTATVENV AIPRGRFHTS ISLVKAGNWV LLEGVDATIA
KTATIVGLEC PENVHIFAPL KFPHTGGESV MKLAIEPLNP AELPKMVEGL RRVSKAYPMV
QTKVEESGEH VLLGTGELYL DCVMYDLRQV YSDIEVKVAD PIVSFRETVI ETSSIKCFAE
TTNKRNKLTF LAEPLDDGLA ENLEAGKVKT QWDQKKLGRF FQVNYNWDLL SSRSVWAFGD
SPTHGTNILM DDTLPSEVDT SLLKTCKSSI VQGFQWATRE GPLCEEPVRG TKIKILDCVL
ADKAIHRGGG QVIPTARKTV HSSLLTATPR LMEPVYRLQI QCPGAIVDAI QPLLTRRRGH
MVQDRPVSGS THCIVKAYIP VLDSFGFETD LRTFTQGQAM VFSVFDHWSV VPGDPLDRSI
ILHPLEPSPA QHLARELLIK TRRRKGLSED VPVSKFFDES MKAQLEQVNA VLQ