Gene PHATRDRAFT_49642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49642 
Symbol 
ID7198295 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011691 
Strand
Start bp284651 
End bp287921 
Gene Length3271 bp 
Protein Length705 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184337 
Protein GI219128265 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGATGG ATCGGCGAAG GTGAAACCTC TATGGATGGA ATCCAGAGTG GTATTAGTAC 
CAGACTCCCT GGACTATATT CTAGCTAGTA GGTAGCATAG CGGACCTGAC TGTGCGCTTT
CGACTCCACA GAGTGCCCAT GTTAGTAGTA GGCAACGAAC ACGGACCCCC CCCCCCTCCC
CTACCGACGG GTGGGGTTGG GCACAGTGGA TGGGTGGACC GACGAAAAAA AAACGTATGC
GCGGACCCCG GGAAGCACCC TCTTCATTAC ATGAGTAGGT GACTGTACTT CTATGGGCTA
ATGTTGGCCG TACCAACAAT TTACCTACCT ACCTACCTAC CTACTTGTTG GATTTTACTA
TACGACTCTT GTGTCATGGA CTGCGTGATG ACTTGCGAGT CACCACCGTC CAGGCGCCAA
CCGACAAGAC GGACAAACGC CACGAGAGAT ACCTGGATAG AATAGAATGG CAGCCGCCGT
TCATTCTTCA CAGACACGAA CCAACCACTT GAGGGACGCG TTTTGGGTGG AAACGAGTCC
AATAGTGAGA GAATCTACAC ACACCACGTG ATACGACACG CGGAACCATC CAGTGTTGGC
CCCCCCCTCA CGGAACATTT TCCAGTCGGC TTGGGAGAAT AAATATCGAA CGCGAGACAC
CAAACACCTC CTACCCAAAT ACACCCACCA CCACACAACA AGCCACACGC AACAAACACT
TTTCATTCAT ATCATTATCG ATATACTACT TACAGTTAAA ACACAACACA ACAAAAACAC
TATTGCCTAT ATTCCTCTGT GTGGTGGTAA CATTGGTATT GACATCATGC GGACGTTCGC
CTTACTGTCC ATTCGATCCG CACACGGCCG GATGTTGTGG AGAGGGACAG TCCTACTGAC
CGTGATGATA CTCCTGCACC GTCCCGAGCC GACGCACGCC TGGACAACGA CAACAACAAC
GACGACGACG AGGTGGCTGC TTTCGTCCCC CACGACGTCT TGGACGAGTA CCACTACACG
TCCGACCGCC GCCGAAACAA GAACGGCAAC GACCGGCACA ATCACGACCC GCCGGCACCT
GTCCAGCAGT ACCTTGCCGG ATGCCAAACT CCCGAACGAT ACCGTCTCTC CCGACCCTAC
TCGCCCGGAG ACCGACGAAA GCAGTCGTCG GCAACAATTA GTCTTCAACC AAGGACTCAA
CGATCTCGCG GAATCCTGCA GTAATCCTAA AGTACAGGTC GTGACCAAAG CACAAGCGTG
TCAAGACCGC TACACCGAAG CGGTACAGTC CCACACACAT CCGACCGACC TCATTTCCTT
CAATACCGTC CTCAAGGCCT GGACCAAGGC CGGAGCTTGT CTGGCGGAAC ACGCACAACA
CGGACACATG CTGGACGCGA ACGTGCCCGT CTACACGCCC CGCGATTGTG CGCAACGGGC
GCAGGATCTC CTCCAAGCCA GGGTCGCCCA ACAACAGGAC GTCGATACCA TGTCCTACAA
CAGCGTCATG GACGGTTGGG CCAAATCACG TGCCGTCGAA GCACCCTTGC GCGTGGAAGA
ATTGCTCGCG CAACTACAGC AAGGCAGTCG ACACGGTTTG TATCCCGATA CACTCTCCTA
CAACGCCTTG GTCGACGCCT ATGCCTACAG CAACAAACCG GAGCGCATGG ATCGACTCGA
ACAGATTTGG CAAGATATGC AGCGCATGGA TCAGCAACAA ACGGATTCGG ACGACGCCGC
GTCCGTTCCT CGCGTGCGAC CAACCGTACG GTCCATCAAT TCTATTCTGC ACGCCTACGC
ACGCCAAGTA CCCGAAGACG CCACCTACGC ACCCAAAGCC CTACAAATCC TGGTCGACAT
GAAACGGCAG CACGAAAAGG TCCCCGATCC CGCAGTCCAA CCCGACGTGA TGACCTACAC
CACCGTCATG GACGCCTTTG CCCGCGTCGG CAACGTCCAA GCTGCCGAAC AGGCCGACCA
ACTCTTTAGC GAACTCCAAT CGCTCTACGA ATCGACCAAG AACGATCGAT TCCGGCCGAG
TGTCTACACT TACGTCACTT TACTCATTGC CTGGTCAAGG TCGCACGCTC CCCAAGCCAC
TACCCGCGCC TCGGAAATTT TGGAAGCCCT CCTGGCCGAC CCACACGTGA CCCCCAATGC
CCGCGCCTTT ACGGCCGTCA TTGCCACCTG GGGACGCAGT AGGGATGTCC GCAAGGCCCC
CAAAGCAGTT CAAATTCTGC AACGCATGAA AGCCTTGGCG ACCACCAATC CTGAAGTGGC
GCCGAGCCTG TACAGTTACA ATAGTGCGAT GGATTGTTGT GCCCGGGTCC GCGGCGATTC
GGTTCAAAAC ACTGCGGCCC TGAAAATGGC CTTTGCTATT TTTCAGTCCC TCAACGCGGA
TACGGCCGTC CAAGCGAATC ACGTCACGTT TGGTATATTG CTGAAAAATG CCGGGGCTTT
GCTACCGGCC GGTGACGAAC GGAACAAAAT TGCGATTGCC GTGTTGAAAA AAGCCATGGC
CGCTGGTCAA GTCGATCCGT CGGTTTTAAT CAACTTCCAA AAGGCGGCCG ATGCGTCGGT
TGTATCGGTA ACGTTGGAGC CACTGGCGGC CGGGCAAGGG CATTTGGATT TCAACAAGAT
TCCGGCAGCC TGGAATAAAC ATGTGCAAAA GTAAAACAAA GATGGGCTTT GGCGAGGCGG
AAACGTTTGA CGAAAGTTGG GAATGAGTTC GTTCGTACGC ATTTCCAAGA GGGGTTGCGC
GTTCCTTTCA ATGCTAACGA TTGGCATTCT GGCACCTGCG CATTGCCCGC GGGCGCTGTA
GTACGGTATC ACCGATGCAT TCTGTCTGAT CTCGCCGCAA TTCGTTTCAC CCGAGAGGAA
TTCGTCGAAC CGTAGGCCAA CGCAGAATAT ATCAAACCAA TATCGGAACG CCGTCTTTTT
GATCAATACT ATTAAATTAA TACGCTGGGA TGTATCATGA ATAGGCAATG TAGTTTACTT
TCCTCTCCCC ACGAGAAGAG CGCCTTAGTT GCTATTCTGA TCCAATGTCG CTAGCGAACG
CATAGAACAC CGCTGACTGT AAACACAATT TGACATGTAG TAGCGCAATC GTGCATCCTT
ACTTGAAGTT CGGGAGATTG CGCATCCCGG ACGAGCCTGC TGGGGCACCT GATAAGTGTC
CGCCACCTTG TTGTTGTTGT TGTTGTTGCT GCTGCTGCTG CTGTCGCAAA ATCAACTGCA
TCAATTGGGC GTGTTCCTGT TGCTGTCGCC G
 
Protein sequence
MGMDRRSIAD LTVRFRLHRV PMLVVGNEHG PPPPPLPTGG VGHSGWVDRR KKNVCADPGK 
HPLHYMIKTQ HNKNTIAYIP LCGGNIGIDI MRTFALLSIR SAHGRMLWRG TVLLTVMILL
HRPEPTHAWT TTTTTTTTRW LLSSPTTSWT STTTRPTAAE TRTATTGTIT TRRHLSSSTL
PDAKLPNDTV SPDPTRPETD ESSRRQQLVF NQGLNDLAES CSNPKVQVVT KAQACQDRYT
EAVQSHTHPT DLISFNTVLK AWTKAGACLA EHAQHGHMLD ANVPVYTPRD CAQRAQDLLQ
ARVAQQQDVD TMSYNSVMDG WAKSRAVEAP LRVEELLAQL QQGSRHGLYP DTLSYNALVD
AYAYSNKPER MDRLEQIWQD MQRMDQQQTD SDDAASVPRV RPTVRSINSI LHAYARQVPE
DATYAPKALQ ILVDMKRQHE KVPDPAVQPD VMTYTTVMDA FARVGNVQAA EQADQLFSEL
QSLYESTKND RFRPSVYTYV TLLIAWSRSH APQATTRASE ILEALLADPH VTPNARAFTA
VIATWGRSRD VRKAPKAVQI LQRMKALATT NPEVAPSLYS YNSAMDCCAR VRGDSVQNTA
ALKMAFAIFQ SLNADTAVQA NHVTFGILLK NAGALLPAGD ERNKIAIAVL KKAMAAGQVD
PSVLINFQKA ADASVVSVTL EPLAAGQGHL DFNKIPAAWN KHVQK