Gene PHATRDRAFT_45808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45808 
Symbol 
ID7200817 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp345729 
End bp348912 
Gene Length3184 bp 
Protein Length784 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180218 
Protein GI219118903 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.558065 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCATAC AGACCCTTAC TTTTGGAAAA TATATCCCTT TTGGCATAGG GGCAGACTTT 
GTCACGCTGG TAGCTCCTGA ACTCTCCAGT GCCACTCCCA TACCTACCTT TTTCCGGAGT
GAAGCTCACA GGTGCCCTCT CCAAGATATC GCGAGTCTGA GCCAAAAGAA ACTCCTCAAG
TCGCAACAAG GAGAAGATCA ACACTTGCTT TCCTTTTTCA ACGGTCTCTG TGGAGGTACC
TATCTAGAAA TGGGCGCTTT GGACGGTAGG CTGTACAGTA ATTCATTTGC TTTCCACAAA
GCTTTGGATT GGAAGGGTCT ACTTGTGGAG CTGACTCCAG AAAGCTATCT ACGCTTGGTG
GAGAATCGGC CCAGCGAACT AGCCGTCGTT AACGCGGCGG TATGCGATCA ACCAAAGAAG
ATACACTACT ATTCGAAGCA AAGGCAGCCA GCCGTTTCAG GAGTATGGGA ATTCGCTCCT
ACCGAGTTCC GGGAATTATG GTGGCCGGGT ATCACGCTCG CTGATACTCA AGAGATCGAT
TGTAGACCCC TGAGGGAAAT CATTGCAACA AACGTTGGAG AACCAGCCTT TTTCGACTTT
TTTAGTTTCG ACATCGAGGG CTCGGAATTC ATGGCGTTGC AAGGGCTAGA TTTTTCCAAA
GTGGGTTTCG GAATTATATT CATTGAGCGC CAGCCAAATA ATCCAATGAA AAACCTGGCA
ATTCGTACTA TTATGGAGAG AAATGGATAC ATTTATTTGT ACGAAAAATC GAACAGCGTA
TGGTTCGTGA ATGCTCATTT TGATGAAATC TACAAAGATG TCATTGGCAC CTAGAAAACT
GCAACAATAA TGGACTAGAA TTGTGCTGAA AGACCCGAAG AATCCTGCAT ACTTTCGTAG
AAGATCAAAG TTGTAATACA TAGAATTATA GACTTGTGCA ATAGACTAAC AGTCAGGGAT
ATGAACCCCA CATATAATGC TTGTTTCAGG GGTTTTGCTA GTCCCCAGAC ATCGCAGCCA
CAGACATATA GACATGGAGA GATGTGCCCC CTCAAAACCA CAACGGGAAA AATAGGGATC
CACAGCTTGT AAAATAAAAC TTCAACACTG CATGCAATAA CAATCAGAAA TGCAAATTCA
AGTTGTCCCA ACATTGGCCT TGTTGAGAAC GGGAAAGTTG GCAACGAGAA AGTTGGCAAA
CCCTTCTTGT TAATAGAACA AATTTTGAGG AAAACCGCTA TTTCTGCCTC TACGACTTGG
AGAAATCGCT ATTTCGGTCA GATCTGAAAA TTGGTAAGTA CTTTTTGGTG AGCGCCAAAC
GGAATTAAAA AAATAATCTA ATGCTCGGTG TATGCTATGT ATATGGTATG GCGGAGGTAT
CGTGCTGGCA GGGATGAGCT GCTTCCAACC CATGACCTAC CCTTTCAATC TATTTCACCC
AACGTCGCCG AGCACCTTAT CCTTACCCTA CATATCGACA CAAGCGTTCG TCAATGGATT
CTACCGACAC GGACCGGGAA ACCTTCCTCC GCGGACAGCG CCCTACGGGA ACCTCGGGAA
GTGTGGACGC CCCGCGTACC CGCGCTCCCG TCCTGTGACC TCTGGTCCGA ATCTCCCCAA
TCTACCCAAC CCCAAGCGTG TTTTTTTTCG CGACGCTCGC GTTTACGCCC TCGTAGGGCA
CATCCAAACC ACCGACGTGG TTGACGAATT GTGTCAATTG GTCGTTGTTG ACGGTGATGT
TGCCGCCGAA GGTGAAATCA TCACTTCTCG CGTCCAAACA GTGCCTCACC AAGTATCAGC
CCTGGTCGAT ATGCAACTCC ATCCCGGGCA CTATGCTCTC TCCTGCCGGT ACCGCTCTAA
ACAATATGCC GATGCTGGTA GTGGAGGCGG TACCAGTGTC GGTAGAAATA ACGGTATCGG
TGACTCGCAC GACGACACCT CCCTCTGGAT CCACTACCCA CCGTCCCACA GCACCATGGT
GAGCTCCCAA CCGACCGTAA CTGCAGCCTC TAGAGCCGGC CACGACGCTC TGTTGGATGC
TCTGGCACTA CGGGAGCGTA CTCGGATTCT CAATTTGTAC TACAGTCCGT ACGGGGCCAA
CCCGACGGAT GCCGATTTGG AAGAAGCGCT CCACCAATTG TGAGCGAGCT GAACTCGACA
AGGAGTCGCA GATGGAAGCT CACGAAGAAC GCTGGCGACA ATTCCTGTTG ATTGTGTGGG
AAGAAGAACG TATGATGCGC TTGCCGGTAG CGATGGGTCA AGTTAGTTGC AGTACTGACC
TTGATATCTT GGTACGCTCA ACGGCAATTT CCGTACTTGC GCAGTATGGC GACGGCCCAC
CGCTGGGTAC CACCACGACT CCACTACTCT TGGATCAAAT TGCGTCTAAA CTCGTCACCG
CCTTGGAATG GACGAAGTTT CCGAGGCACG CTTGTCCAGT CCAGAACAAG GCGTCCATTC
TGTGATTGCT CGAGGTGATT TGGCTTGGCA TGCTACTGAA GTGGAACAAC TACCAGCGCT
ACTGCTTGTT TCCTTGCAGG ACTGTTTATG GGACGTCGTG ATGCCCTACG AGTCCACTTT
ATTGATCAAA GGCCTAAAGA ATGCCACGGA AACAGATCTT ATTCTGTTTT TGCTACGGCC
ACCGGTCAAA TCCACCTTGC CCGGGCTAAT CAACTGGGAC AGCTTCTGCC TCGTCCATAG
ATGAACACCA ACGCTTGGCA ACTTCCAGCT TAGCGACCTG CTGTATCAAT TCGATTTGTC
GGCTAACGCT AGGCAGATGT TTGGTCTTGT CCCAAATTAT GACAACACGA CCAAATTATA
TGTCGGCTGC CTAATTGATT TATTTTCAGG CTGTGGCGAC AATGTTTGTC AGCACTCAGC
ATATCGCCCT GTCAACTACA GTCTACCATC ACCTTCTACG GATAGTCCTT CCTTGTTGTC
TCCACTGACG AAACGACCAA GCTTCGCTAT TGTCCGTTTT ATCGTGTCAG GGGAACATGG
TGCGTATCCA TCACTATCAA ACAGTTTTAG TACGCAAACT TCTCCCGGGT CGCAGCGAGA
CCGACAAGTA CTTCAATACC ATCCGCCACA TCATAGTGAT GCAAGGGGGT GTCACTTTCC
ATTGCCTGAG GTGACGGGTG GTTAAGAGTT CCTCTTCTTC CTGCAACCAA TCTGACTCTT
GCAT
 
Protein sequence
MAIQTLTFGK YIPFGIGADF VTLVAPELSS ATPIPTFFRS EAHRCPLQDI ASLSQKKLLK 
SQQGEDQHLL SFFNGLCGGT YLEMGALDGR LYSNSFAFHK ALDWKGLLVE LTPESYLRLV
ENRPSELAVV NAAVCDQPKK IHYYSKQRQP AVSGVWEFAP TEFRELWWPG ITLADTQEID
CRPLREIIAT NVGEPAFFDF FSFDIEGSEF MALQGLDFSK VGFGIIFIER QPNNPMKNLA
IRTIMERNGY IYLTNFEENR YFCLYDLEKS LFRSDLKIGI VLAGMSCFQP MTYPFNLFHP
TSPSTLSLPY ISTQAFVNGF YRHGPGNLPP RTAPYGNLGK CGRPAYPRSR PVTSGPNLPN
LPNPKRVFFR DARVYALVGH IQTTDVVDEL CQLVVVDGDV AAEGEIITSR VQTVPHQVSA
LVDMQLHPGH YALSCRYRSK QYADAGSGGG TSVGRNNGIG DSHDDTSLWI HYPPSHSTMV
SSQPTVTAAS RAGHDALLDA LALRERTRIL NLYYTELDKE SQMEAHEERW RQFLLIVWEE
ERMMRLPVAM GQVSCSTDLD ILVRSTAISV LAQYGDGPPL GTTTTPLLLD QIASKLVTAL
EWTKFPRHAC PDCLWDVVMP YESTLLIKGL KNATETDLIL FLLRPPLSDL LYQFDLSANA
RQMFGLVPNY DNTTKLYVGC LIDLFSGCGD NVCQHSAYRP VNYSLPSPST DSPSLLSPLT
KRPSFAIVRF IVSGEHGAYP SLSNSFSTQT SPGSQRDRQV LQYHPPHHSD ARGCHFPLPE
VTGG