Gene PHATRDRAFT_34728 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_34728 
Symbol 
ID7200186 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp247328 
End bp250409 
Gene Length3082 bp 
Protein Length951 aa 
Translation table 
GC content60% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179164 
Protein GI219116739 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.148187 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGACCT CCGCTCATTT CAAACTGAGC GACTTTCCTC ACAAAGTCCT CGATCCGATT 
GCCACTCTCA CCGTTCCCCC GACCTACGCG ACCATCAAGC ATGCTCAACG CCAGCTCATG
ACCAACGCCG CCGCCATTCC CACGCTCAAT GGTGGTGGCG CCCATGGCCA TATGGCCTTG
ACCCTCACCC CCCTTGCCTA CGCCGACATC AGCAACGTCC CGTTCGTCAT TCCCGTCGCC
CCTCCGGCCA ATCCGCCCCC CGGTGCCACG CAACCGCAAA TCACCGAAAA CAACCGTGTC
CATCAACGCG ACGCTGACAT CTATAACTTG TATGTTGCCG TCAACAATGC CCTCCGCCAG
CAGCTTCTCG ATGCGATTCC TCGCATCTAT GTGCGCGCCC TTGCGCACCC GATGTTCGAG
TTCAGCAATG TTACCTGCCT TGATTTACTC TCGCATCTCT GGACAAAGTA CGGCACCATC
AAGCCCGCCG AACTTCAGAA AAATTTCCAG TCCATGTACA CCCCCTGGAA CACCACTGAA
CCGATCGAGT CCGTGTTTCT CCAGCTGGAC GAGGCCATTG CGTTCTCCAC CGATGGCAAT
GACCCCATCT CGGAGGCTGC TGCAGTTCGA GCCGGCTACG AAGTCATTGC GCACTCTGGC
CTGCTCCCTC TTGACTGCAA AGAATGGCGC AAACTGCCTC TTGCTTCTCA CACCCTTGCA
AATTTTCAGC AGCACTTCTC CCTTGCCGAC GACGACCGGC GCCTTACGGC CACTACCGGT
TCACTTGGCT ATGCCAACGT TCTCGCTGCT ACTCCCTCTC TGGCTCCAGC CACGGTTTCC
GACACCCTCA GCCTGCCTTT CTCCGCGCTC TCTGTGTCCC AGCCTTCTGT CTCCTCCCCG
GACATGACCT ATTGCTGGCC CCATGGGACC AGCAAGAACA GGCGCCACAC CAGTGCCACT
TGCAAGAACA AGGCCCCTGG TCATCGCGAC GACGCGACGG CCACCAACAC CCTTGGCGGC
TCCACCAAGG TTTGGACTGC CCCCAAACCT CCCGAATAGG AAAGAGGGAC GGCTACGCCG
ACGATTAAAA CTAGTAATAC CGATTATTTA AATCATATTA CTAGTCTTAA CTCGTCTGTA
GCCCCCTCCC CGCCTAGTCC ACACACCTCA GCCATTGCCG ACACTGGCTG CACTGGCCAC
TACATCACGG TCAACTGCCC TCATACCCAC AGGCACCCAG CCAACCCCAG CCTAGCAGTC
CGTGTCCCGA ACGGCGCAGT CCTCCGATCG AGTCATGTTG CCACCCTGGC CCTTCCTGGT
TTCTCCCCTG CCGCCTGCCA AGCACATATT TTTCCTGGGC TTGCCTCCCA TCCGCTCCTC
TCTATTGGAC AACTGTGCGA CGACGGTTGC ACGGCAACCT TCTCGGCCAC TCGGCTCGAC
ATTCATCGTG ACGCTACACT GCTGCTCTCT GGTGCCCGCT CCCCCCACAC TGGTCTCTGG
CACCTCGATC TTGCCCCCGC TCCCTCTCCT GCGACGGCCC ACGCCCTTGT TCCCCACACA
CCCCTTGCCG ACCGCATTGC TTTTGTCCAT GCCTCGCTCT TCTCCCCGGC ACTTTCAACG
TGGTGTCAGG CACTCGATTC CGGCCATCTT ACCACTTTTC CCGACATTTC CTCCCGACAA
GTCCGCAAAT ATCCACCCAG CTCCTCCGCC ATGGTCAAGG GTCACCTCGA CCAACAACGC
GCAAACCTTC GCTCCACCAA GCTTCCCCCT GTTGGTTCCC CCACCACGAC TGCACCCCCT
GCCCGCTCTG TACCCGACCT TGATCCTCCC AATGCCCCAC CAGTCGCACG TACGCACCAC
GTCTTTGCTG CTCATCAGCG CGTCACCGGA CAAATCTACA CCGACCAACC AGGCCGTTTC
CTTACTCCTT CAAGTGCCGG CCATAACGAC ATGCTCGTAC TGTATGATTA CGACAGCAAC
GCCATCCACG TTGAACTCAT GAAGAACAAG TCTGGCCCCG AAATTCTCGC CGCCTATAAA
CGCGCTCATG CTCTTTTCAC CCAGCGAGGC CTCCGTCCAC AACTCCAGCG CCTCGACAAC
GAAGCCTCTG CAGCCCTCCA GTCCTTCATG ACCTCAGAGC ACGTCGACTT TCAGCTGGCA
CCCCCCCATC TACACCGTCG TAATGCCGCC GAACGGGCCA TCCGCACCTT CAAGAACCAT
TTCATTGCTG GCCTCTGCAC CACGAACCCG GATTTTCCCC TGCATCTTTG GGACCGCCTC
CTCCCCCAGG CCCTCATCAC CCTAAATCTT CTTCGTCGCT CCCGCATCAA CCCCAAGTTG
TCCGCCCACG CACAGCTTCA CGGTGCCTTT GACTACAACC GAACCCCGCT TGCTCCTCCC
GGCACTCGCG TCTTGGTCCA TGTCAAGCCG TCCGCTCGCG AAACATGGGC CCCCCATGCT
GTTGAAGGTT GGTATCTCGG CCCCGCTTTG AACCATTATC GCTGCCATCG CGTATGGATC
ACAGAAACAC GAGCCGAACG TGTTGCTGAC ACCCTTTCTT GGTTCCCGAC CCGCCTCTCC
ATGCCTTCCG CCTCCTCCAC CGACCGAGCC CTGGCCGCCG CCCGTGATCT TGTCCATGCG
CTCCAAAATC CCTCCCCCGC CTCCCCGTTT GCGCCCCTCA ACGCCCACCA GCACCAGGCC
CTCACACACC TTGCCGATCT CTTTGCCACG GTGGCCGCCC CGGCCGACGA CGCCCCCGCA
CCTGCTCCCG TGCCTCCGGT CCGTCCTCCT ACCCCAGCAC TTCCCCCAGC TCAGGTCCGT
TTTGCCGTCC CTCTCGTCAC GGCCGAACAT GCCCCAGCAC TTCCGAGGGT GCCCGTTCCT
ACCGCCGCAC TTCCGAGGGT GCCCCCCATG GCTACCTATC ACTCGCGCAC CGGTAACCCC
GGCCGTCGCC GCCGCAAAGC ACGCAAACAA CCGGCAACCC CAACCCTAGT TCCGGCGCAT
CCACACAACA CCCGCACCCG ACCCTTTCTT GTCCCGGCCT CCGCCAACGC AGTTGTCGAC
CCCGCAACCG GCGCCTCCTT AG
 
Protein sequence
MSTSAHFKLS DFPHKVLDPI ATLTVPPTYA TIKHAQRQLM TNAAAIPTLN GGGAHGHMAL 
TLTPLAYADI SNVPFVIPVA PPANPPPGAT QPQITENNRV HQRDADIYNL YVAVNNALRQ
QLLDAIPRIY VRALAHPMFE FSNVTCLDLL SHLWTKYGTI KPAELQKNFQ SMYTPWNTTE
PIESVFLQLD EAIAFSTDGN DPISEAAAVR AGYEVIAHSG LLPLDCKEWR KLPLASHTLA
NFQQHFSLAD DDRRLTATTG SLGYANVLAA TPSLAPATVS DTLSLPFSAL SVSQPSVSSP
DMTYCWPHGT SKNRRHTSAT CKNKAPGHRD DATATNTLGG STKPPPRLVH TPQPLPTLAA
LATTSRHPAN PSLAVRVPNG AVLRSSHVAT LALPGFSPAA CQAHIFPGLA SHPLLSIGQL
CDDGCTATFS ATRLDIHRDA TLLLSGARSP HTGLWHLDLA PAPSPATAHA LVPHTPLADR
IAFVHASLFS PALSTWCQAL DSGHLTTFPD ISSRQVRKYP PSSSAMVKGH LDQQRANLRS
TKLPPVGSPT TTAPPARSVP DLDPPNAPPV ARTHHVFAAH QRVTGQIYTD QPGRFLTPSS
AGHNDMLVLY DYDSNAIHVE LMKNKSGPEI LAAYKRAHAL FTQRGLRPQL QRLDNEASAA
LQSFMTSEHV DFQLAPPHLH RRNAAERAIR TFKNHFIAGL CTTNPDFPLH LWDRLLPQAL
ITLNLLRRSR INPKLSAHAQ LHGAFDYNRT PLAPPGTRVL VHVKPSARET WAPHAVEGWY
LGPALNHYRC HRVWITETRA ERVADTLSWF PTRLSMPSAS STDRALAAAR DLVHALQNPS
PASPFAPLNA HQHQALTHLA DLFATVAAPA DDAPAPAPVP PVRPPTPALP PAQVRFAVPL
VTAEHAPALP RVPVPTAALP RFRRIHTTPA PDPFLSRPPP TQLSTPQPAP P