Gene PHATRDRAFT_52461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_52461 
Symbol 
ID7195117 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011687 
Strand
Start bp318781 
End bp321769 
Gene Length2989 bp 
Protein Length979 aa 
Translation table 
GC content61% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183465 
Protein GI219126439 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000189103 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGACCT CGGCTCATTT CAAACTGAGC GACTTTCCTC ACAAAGTCCT CGACCCGATC 
GCCACCCTCA CCGTCCCACC CACCTACGCG ACCATCAAGC GTGCCCAACG CCAGCTCATG
ACTAACGCCG CCGCCATTCC CACACTCAAC GGAGGTGGCG CCCACGGCCA TATGGCCTTG
ACCCTGACCG CCCTTGCCTA CGCCGACATC AGCGACGTCC CGTTCGTCAT TCCCGTCGCC
CCTCCGGCCA ATCCGCCTCC CGGCGCCACG CAACCGCAAA TCACCGAAAA CAACCGCATT
CATCAACGCG ATGCTGACAT CTACAACCTT TATGTCGCCG TCAACAACGC GCTTCGGCAG
CAACTTCTCG ACGCGGTTCC CCGCATTTAT GTCCGCGCCC TCGCCCATCC CATGTTCGAG
TTTAGCAACG TCACGTGCCT CGACTTGCTC TCGCACCTCT GGACCAAATA CGGCACCATC
AAGCCCGCCG AGCTCCAGAA AAATTTCCAG TCCATGTACA CCCCTTGGAA CACAACCGAG
CCGCTCGAAT CCGTTTTTCT TCAGCTCGAC GAGGCCATCG CTTTCTCCAT TGACGGTAAC
GACCCCATCT CGGAAGCTGC GGCTGTTCGC GCAGGCTACG AAGTCATTGC GCACTCGGGC
CTGCTCCCCC TGGACTGCAA AGAATGGCGC AAATTGCCTA CTGCTGCTCA CACACTTGCC
CATTTCCAGC AGCACTTTTC CCTTGCCGAC GACGACCGGC GCCTCACGGC CACCACCGGT
TCCCTTGGAT ACGCCAATGT GCTTGCTGCT GCCCCCTCTC TTGCTCCTGC CACAACCTCC
GACACTCTCA GCCTTCCTTT CTCCGCGCTC TCTGTGTCCC AAACTTCTGT CTCTTCGCCG
GACATGACCT ATTGCTGGAC CCATGGTACC AGCAAAAACC GGCGCCATAC GAGCGCCACG
TGCAAGAACA AGGCCCCTGG CCATCGCGAC GACGCGACCG CCACCAACAC TCTCGGCGGC
TCCACCAAGG TTTGGACCGC TCCCAAGCCC CCTGAATAGG AAAGAGGGAC GGCTACGCCG
ATGGTTAACT CTAGTAATAC CGATTCTTTA AATCATATTA CTCGTCTTAA TTCATCTGTA
GTCCCCTCCC CGCCTAGTCC CCATACCTCG GCCATTGCTG ACACCGGTTG CACCGGCCAT
TACATCACCG TCAACTGCCC CCACACCCAC AAACGTCCTG CAAGCCCCAG CCTTGCCGTC
CGTGTCCCTA ACGGCGCCGT CCTCCGCTCA AGCCACATTG CCACCCTAGC CCTCCCTGGC
TTCTCCCCTT CTGCTTGCCA GGCCCACATC TTCCCCGGGC TTACCTCGCA CCCACTCATT
TCGATTGGAC AACTTTGTGA CGACGGCTGC ACTGCCACTT TCTCAGCCAC TCGCCTCGAG
ATCCACCGCG ACACTACACT ACTCCTCTCC GGCACTCGTG CACCCACTAC CGGCCTCTGG
CACCTTGATC TTACCCCTGC CAAGCCTCCT GCCACAGCCC ACGCTCTAGT TCCCAACACT
CCCCTCGCTG ACCGCATCGC TTTTGTTCAT GCCTCGCTCT TCTCCCCGGC TATCTCCACA
TGGTGCCAGG CCCTCGACTC CGGCCATCTT GCAACCTTTC CTGCACTTTC CTCCCGCCAG
GTCCGCAAGT ATCCACCTCA TTCCCCCGCC ATGGTCAAAG GCCACCTCGA CCAACAACGC
GCAAACCTTC GCTCCACCAA GCTTCCCCCT GTAGGTTCCC CCATCACGAC GGAACCCCCT
GCCGCCGCTG TGCCCGACCT TGACCCTCCC GACGCCCACC CCGTCACACG CACACACCAT
GTCTTTGTTG CCCACCAACG GGTTACCGGT CAGATCTACA CGGACCAACC GGGCCGCTTC
CTCACTCCCT CCAGTGCCGG CCACAACGAT ATGCTTGTTC TTTATGATTA CGATAGCAAT
GCTATCCACG TCGAACTCAT GAAGAACAAG TCCGGCCCCG AGATTCTAGC AGCCTATAAG
CGCGCTCATG CTCTTCTCAC CCAGCGCGGC CTTCGTCCCC AACTTCAGCG TCTTGACAAC
GAAGCCTCTG CAGCCCTCCA GTCCTTCATG TCCTCCGAGC ACGTGGACTT TCAGCTAGCA
CCCCCTCATC TACACCGTCG TAATGCCGCC GAACGGGCCA TACGCACCTT CAAGAACCAC
TTCATTGCTG GCCTCTGTAC CACAAACCCG GATTTTCCCC TTCATCTTTG GGACCGACTC
CTCCCACAGG CCCTCATTAC CCTCAATCTT CTTCGTCGCT CCCGCATCAA TCCCAAGTTG
TCCGCCCACG CACAACTTCA CGGTGCCTTT GACTACAACC GCACCCCGCT TGCTCCTCCA
GGCACCCGCG TCTTAGTCCA TGTCAAGCCC GCTGTTCGCG AAACCTGGGC CCCCCATGCT
GTCGAAGGTT GGTATCTCGG CCCCGCTCTC AACCATTATC GCTGCCATCG CGTATGGATC
ACGGAAACAC GTGCCGAACG TGTTGCCGAC ACCCTTTCCT GGTTCCCGAC CCGCATTCCC
ATGCCCGCCG CTTCGTCCAC CGACCGCGCC CTGGCCGCCG CCCGTGACCT GGTCCATGCC
CTCCAGAATC CTTCCCCGGC GTCTCCGTTC GCCCCCCTCG ATGCCACCCA GCACCAGGCA
CTCACAGATC TTGCCACCCT CTTTGCCACT GTGGCCGCCC CAGCCGACGA CGTCCCTGCA
CCCGCTCCCG TGCCTCCGGT CCGTCCCCCT GCCCCAACAA CTCCCCTTGC TCAGGTCCGC
TTTGCCGTTC CTCTTGTCAC GGCCAAACAT GCCCCGGCAC TTCCGAGGGT GCCCATTCCG
GCCCCAGCCC TTCCGAGGGT GCCCACCCTG GCCACCTATC ACTCTCGCAC CGGCAACCCC
GGCCGTCGCC GCCGCAAAGC ACGCACACAA CCGGCAACCC CAACCCTAG
 
Protein sequence
MSTSAHFKLS DFPHKVLDPI ATLTVPPTYA TIKRAQRQLM TNAAAIPTLN GGGAHGHMAL 
TLTALAYADI SDVPFVIPVA PPANPPPGAT QPQITENNRI HQRDADIYNL YVAVNNALRQ
QLLDAVPRIY VRALAHPMFE FSNVTCLDLL SHLWTKYGTI KPAELQKNFQ SMYTPWNTTE
PLESVFLQLD EAIAFSIDGN DPISEAAAVR AGYEVIAHSG LLPLDCKEWR KLPTAAHTLA
HFQQHFSLAD DDRRLTATTG SLGYANVLAA APSLAPATTS DTLSLPFSAL SVSQTSVSSP
DMTYCWTHGT SKNRRHTSAT CKNKAPGHRD DATATNTLGG STKERGTATP MVNSSNTDSL
NHITRLNSSV VPSPPSPHTS AIADTGCTGH YITVNCPHTH KRPASPSLAV RVPNGAVLRS
SHIATLALPG FSPSACQAHI FPGLTSHPLI SIGQLCDDGC TATFSATRLE IHRDTTLLLS
GTRAPTTGLW HLDLTPAKPP ATAHALVPNT PLADRIAFVH ASLFSPAIST WCQALDSGHL
ATFPALSSRQ VRKYPPHSPA MVKGHLDQQR ANLRSTKLPP VGSPITTEPP AAAVPDLDPP
DAHPVTRTHH VFVAHQRVTG QIYTDQPGRF LTPSSAGHND MLVLYDYDSN AIHVELMKNK
SGPEILAAYK RAHALLTQRG LRPQLQRLDN EASAALQSFM SSEHVDFQLA PPHLHRRNAA
ERAIRTFKNH FIAGLCTTNP DFPLHLWDRL LPQALITLNL LRRSRINPKL SAHAQLHGAF
DYNRTPLAPP GTRVLVHVKP AVRETWAPHA VEGWYLGPAL NHYRCHRVWI TETRAERVAD
TLSWFPTRIP MPAASSTDRA LAAARDLVHA LQNPSPASPF APLDATQHQA LTDLATLFAT
VAAPADDVPA PAPVPPVRPP APTTPLAQVR FAVPLVTAKH APALPRPFRG CPPWPPITLA
PATPAVAAAK HAHNRQPQP