Gene PHATRDRAFT_44582 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44582 
Symbol 
ID7198088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp966859 
End bp968680 
Gene Length1822 bp 
Protein Length501 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178612 
Protein GI219115633 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.370022 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCAGACGTTT CTATACTTGT GGGTAGGGCC AACTGTACAT GCTGGCCTCT CCATATATCT 
CAACTTCTTG AGGCTACTTT CCAAAGCCGG CCCACCGTGG CTAGTGAAAC AATTCAGAAT
TTTTCGAACC GAAAGGACAG ATCTCATACT TTCGACCTTC AGACTTTTCA ATACGGTACC
CTGCCGTCGA CACCCACTTT TATTGTTCTG ATCCATCTGG CATCATGAAT TTGACGGAAT
GTGATGAAAG GGGGGCAGAC TGTAAGGATA GACGGGCCTC GGATGCGTCA AAATCTTCTC
TTGGAGCAGT ATCAATGGAC GATGAAAATC CGTTAATGAC TCTTGTAGAT GCTGCCGCTT
CAATACTAGG AACCAAAGTA GCCCGACGAG AAGTATCTGT TCCAGGCTCT CCTTTGGGGC
CGCAAATTAA TGTTTGTAGT GATGCAAAGG CAAGCGGTTT AACTGTAAAA GAAAGCCCCA
ACGTCATTCC CGGAACAAAG GAGGATTTGT TTGGCAGTTC GTCAATAAAG CTGACATTCG
CAGAACAGCT CTTTGATATT TTGCAGAACG AAGAAAACCA TGATGTTCTC CAATGGATGC
CTGATGGATG CTCTTTTATA ATCGTGAACC ATAAGAAGTT TATTCTCGAC AAAATGCCTA
AGTTGTTCAA CATTCGCAAC ATGTCATCGT TTGTGAGAAA ATTGGGACGG TGGGGGTTCA
GCCGCGTTCA TGAGAAAGCG ACGAGAAATT CCGATATTTT TAGACACCCC TTTTTTGTCA
GGGAAATGCG CGAGGAGTGT CGGAAGAAGG TTAAATGTAT CGGCCGGATT CCGTCCTCTT
CAAATTCAAA GCCTTCGGTT GGTCAGGTGA ACGGTGTTCC CTACAAGCAA CATTTATATT
CTGTTCTACA CGATAGGCAT TTAGATGATG TCTGCCCACG GTATGGAGAC AGATCCGATG
TGCCTCGTTC CACGTCTCAA CCTATGTCGA ATCTAAGCGA AGGGTCACGT TCGTCCTTGT
ACCGTGACGA TCTGTTGCCA TCACAAGGAG TCCATCTTAC CCAAAGTTCC CCAAATAGAG
TGACTTTTCT TGACGAGCAT CTGCATCGTG TACTTCCAGA GCTACCCTTT TCCAACAAAT
CTTTGTTGCC TTCCGGATTA CCGGCAAACC TTCCTTTGTT CAACAAATCC AAGAACTCTG
ATCTTCAGTA CCGAGGCAAG GAAGGTGTGT TCGTAAAGGC ATCAGCTACC TCTGAGACCT
CTGCAGCGGC CCTGTTCTCA CAGTATGAGA AACAGCTTCA AGAACACCAA ATCAAACGAT
CCTCCTTAGC GAGCCAGCTC TCGCAGCAAT CAGCTTACGA AGAAGCGAGG TGGTTGTCTG
AGCTTGATCA TCAGCTTGCT GAACAGCAAG TTGCTCTAGA GCAGCGGAGA GTGGCTTTGG
AACAACAAAG AATAATAAAG CAACGGCAGG TCCTAATGGA GCAAAGACAA GCAATGGAAA
AGCGCTTTGG TGGCCAGGTT TTCGATCCGA GATACTCTCA AGCCACGAAA GGTACAGGGT
CACTACAGGA GGAGTCTAGA GGGAACGACA ACCTGACATC ATCAATAGCT GACAATCTTC
AACGTGGCAG AGATGGGGTT TGTTTTACAC CGAACATGAG TAGAAAGGAA GCCATCCGCG
CTCTTCTTTG GGAAGAGCGC GAACTAGGGT TCACTGGAGG TCGAAGATGA AAGGCCTTTA
GTCAGACCCG AATTCATAGA GTTTGTTTAT ACTTGAATGT AATACAGCTA CCAGTAAAAG
GTAATTTAAC TAAGAGGGGC GC
 
Protein sequence
MNLTECDERG ADCKDRRASD ASKSSLGAVS MDDENPLMTL VDAAASILGT KVARREVSVP 
GSPLGPQINV CSDAKASGLT VKESPNVIPG TKEDLFGSSS IKLTFAEQLF DILQNEENHD
VLQWMPDGCS FIIVNHKKFI LDKMPKLFNI RNMSSFVRKL GRWGFSRVHE KATRNSDIFR
HPFFVREMRE ECRKKVKCIG RIPSSSNSKP SVGQVNGVPY KQHLYSVLHD RHLDDVCPRY
GDRSDVPRST SQPMSNLSEG SRSSLYRDDL LPSQGVHLTQ SSPNRVTFLD EHLHRVLPEL
PFSNKSLLPS GLPANLPLFN KSKNSDLQYR GKEGVFVKAS ATSETSAAAL FSQYEKQLQE
HQIKRSSLAS QLSQQSAYEE ARWLSELDHQ LAEQQVALEQ RRVALEQQRI IKQRQVLMEQ
RQAMEKRFGG QVFDPRYSQA TKGTGSLQEE SRGNDNLTSS IADNLQRGRD GVCFTPNMSR
KEAIRALLWE ERELGFTGGR R