Gene PHATRDRAFT_31322 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_31322 
Symbol 
ID7199351 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011699 
Strand
Start bp106216 
End bp108388 
Gene Length2173 bp 
Protein Length557 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185522 
Protein GI219130753 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGCCGGATCC CAATCCAACC CATCTTGCAC ACCGGGACCC TCGTTCCAAG GGCCAAGCTT 
GATTCACTAC TTCCACCACC AACACAACCA CCACCGTATC ATCGTCCATC CATTCTTTGA
TGCGTAACAA TACCCAAGAT CTAGATCACA GAAACTAACG AACGAACGAA CGAACTAACG
AATTCCTAGC GAGCGATAAA CGGATCGATA CCAGTGGTGA CATGATTGGA CCCGCCATTG
TAAGTACAAC CCGTCCTGTC GCACCTTGCT GTACGAACGA ACGAAGGTTT ACTGTACTTG
GATTCTCACC CATTGCCTCA ATTGTGTGTC ATCGATCGGT TCCGCACTTT GTTAGGATCC
AGCGGATGTC GGCTTGACGG GCCTCTTCTG GCTTTTTTTG AGCTACGGCT ACGTCTTGTA
CTCGAGCAGT AATCTCATTT CGGAAGGATC CGAGCTGCTC CTTCTCATCC CCAGCATGGC
GGGGTTGGTC GGCGGAGTGG TCCTCCCCTT GTTAGGAGCC GTCCCCGACG GCGCCATCAT
TCTCTTTAGC GGACTCGGAA GTCTTGAAGA CGCCCAAGAA ACATTGTCTG TCGGAGTCGG
GGCCCTAGCC GGCTCCACCA TCATGCTACT GACCGTTCCC TTTGCTCTTT CCGTCTACGG
AGGTCGCGTA GATCTCGACG CCAACGGCGT ACCCGACTAC CTTGTCAAAC CCAAACTTTC
CACCAAAACG TCCTGGAAGG CCGAATTCAC AAAGACGGGC GTTACCTTGT CCGATGCCGT
GCATCACGGT GGTGTCTTGA TGGCCCTCAC TACCGTTCCC TACTTCCTCA TACAGGTGCC
CGCATCGATC TACGCCACGC CCGAGAATTC GGAAGACGTC GTTGCGGCAC AGGAGCACTG
GTGGGCGGCG GCTGGATTCA TTCTCTGCTT GCTGGGCCTG ACGGTTTACA TGAGACTGCA
GTTGCATATT TCCCAACAGG GACAGGACAA GGGCAAGCGC ATGGCCGTCA TGAAGAAACT
GCTCAAACAG GGACAGGTTT CACTCAGTGG AGCCATTGCC GCACAAGTCA ACGCCAAAGA
GTCCGCGCTG CAGGCGCAGG CTGCGTCCGA ATACCAGTCC ATCCACGATG TCAAAGATGG
TTACCCCAGT CCGGCCATTG CGGCCTTTCT GAAAGAAATT CTGGCCGACG CCTTTTATTC
CTACGATTCC GACACCAATG GACAACTCGA CAAAACCGAA GTCTTTGTCT TTTTCCGAGA
CTTTCACGAA AGCATATCCG AAGAAGAAAT GGATAAGCTC TTTGCCAAGT TCGATACGGA
CGGCTCCGGT ACCATTTCTT TGGACGAATT TATCGGCCTC GCCTACACGC TCATCAAGGC
GCAGGACCAG CAAACGGCGC CGCGTCACCT GGACGCGTCC AGTCGCGGTA CCCGTGCCGC
CCTGGTACAG GCTGCCTTTG GCGAAGACGA AGACGAGGAG GAAGAAACCG TACCGGAAGA
ATTCACCTCG CTCACGCCCG ATCAACAACA ACGCGCCATC AAATGGAAGG CCTTTCGGAT
GTTGGCGTTA GGAACCGGCC TGGTCGTGCT CTTTTCAGAT CCCATGGTGG ATGTCATGCA
AGAGATTGCG GTGCGGTCGG GCATATCGCC CTTTTACGTT TCCTTCGTGC TGGCACCATT
GGCGTCCAAC GCCAGCGAAG TGATCGCCTC GCAATACTAC GCCAGCAAGA AGACGCGCAA
AACGATTACC GTGAGTTTGA CGGCGTTGGA GGGTGCCGCT TGCATGAACA ATACGTTCTG
CTTGTGTATT TTTATGGGGC TGGTCTTTGT GCGCGGCTTG GCTTGGCACT ACACGGCCGA
GACGGTAGCC ATTGTGATTG TGGAATTCAT AATTGCATTT ATCGTGATTC GAGAAACTAC
CATGACGACG GGAATGGCCA TGTTCATCTT GGCGTTGTTT CCGCTGAGCA TTGTGCTCGT
CGCCGCTCTA GAAGCGTTTG GTTTGGATTG ATGGATCGGT TCACTATTTG AGTGGGTCGG
TGAGTCACAC GCGTGTTGAA AGTATTTGTT ACGTGTCGGC GCCGGGACCA CAACCTTGGC
TCCGTGCCGA ATGCACTTTT TGGAATGCAA TGTTCCGTTA ATGAAAACGC TAGTAGAGAG
AACAAATGTA TGG
 
Protein sequence
MIGPAIDPAD VGLTGLFWLF LSYGYVLYSS SNLISEGSEL LLLIPSMAGL VGGVVLPLLG 
AVPDGAIILF SGLGSLEDAQ ETLSVGVGAL AGSTIMLLTV PFALSVYGGR VDLDANGVPD
YLVKPKLSTK TSWKAEFTKT GVTLSDAVHH GGVLMALTTV PYFLIQVPAS IYATPENSED
VVAAQEHWWA AAGFILCLLG LTVYMRLQLH ISQQGQDKGK RMAVMKKLLK QGQVSLSGAI
AAQVNAKESA LQAQAASEYQ SIHDVKDGYP SPAIAAFLKE ILADAFYSYD SDTNGQLDKT
EVFVFFRDFH ESISEEEMDK LFAKFDTDGS GTISLDEFIG LAYTLIKAQD QQTAPRHLDA
SSRGTRAALV QAAFGEDEDE EEETVPEEFT SLTPDQQQRA IKWKAFRMLA LGTGLVVLFS
DPMVDVMQEI AVRSGISPFY VSFVLAPLAS NASEVIASQY YASKKTRKTI TVSLTALEGA
ACMNNTFCLC IFMGLVFVRG LAWHYTAETV AIVIVEFIIA FIVIRETTMT TGMAMFILAL
FPLSIVLVAA LEAFGLD