Gene PHATRDRAFT_32656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_32656 
Symbol 
ID7197447 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp473989 
End bp476166 
Gene Length2178 bp 
Protein Length644 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177942 
Protein GI219112381 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCCGG GCCTCCCAAC CGGTCCCCCC TTTCGTTTCA TCGTTCCCCA CGGGACGGGC 
CATGGCGAAG TAATCCCATA TCCACAACGA ATTCGTCCCT CGCATTTTTT CCTAGCTTTT
GCCCGACAGG CACTGGAAGG CAACCGTCTC TTGCCGTAGT ACGGTGGTGT GTACCAATCT
TCACACAACT ACACAAATAC AACACACCCG CACATCCACG CGCACTGCTA ACTGTAAACG
GGGTACGGAC ACTGCGTGAA GCATTCGTCG TCATTGTCCC AGCGGTACCG CTACCGCAAT
GATGGACATC CTTCCCGACG CACCCGACGT GATTGTTCCG GTTCCACCGA GACCTTTACA
ACAGCCCCAC CCGCATCGAC AAGGTTGCGA CACATTTACT GTTAATCCTG TCCCAAGAAC
GAATGCGTGT GGTGCCGTGA GTGACGTTCG GGAATCGACG GCGCGTCGGG GACGATCGTT
GGGACGTCTG CCGAATCTAC ACGTTTGGAT CTTGGTGCTG GCCGTAGCTC TCGTGGCGGC
TGTCCTAGTC CAGTGGAGCG AATCGTTGTC GTCTTCCACG AGTCTTTCCC TGGACACCCC
CGAGGTTTCC CCGTTCCGAT GGCTTCTAAA AGGGGACGAT TCCCGTTCCC CACACAGCCA
ACACAAACCT ATTTACGACG AGGCGCACCC CGCACTCTTT CCCTTGTCCC GCAGCGATCA
AATTGGCTTT TTCCTCGCCA CGCTCGGTCT CATGGTGGCG GCTGGTGGCG GCATTGGTGG
AGGTGGGATC CTCGTACCCG TGTACATTCT AGTCATGGGC TTTACCCCTA AACACGCCAT
TCCCTTATCC AACGTTACCG TATTGGGCGG AGCCGTCGCC AATACGATCC TCAATGCCCG
CAAACGACAT CCCCTCGCGG ATCGACCCCT CGTGGACTGG GATCTCATCC TCGTTATGGA
ACCCCTCACC ATTGCCGGAG CCCTGCTCGG TGCCTTTCTC AACAAAGTTC TGCCGGAGTT
GTTACTCACC GTACTACTCG TACTCCTCTT GTCCGTCACG GCTTACACTA GTTTGACCAA
GGCGTTGAAA CTTTACGCCC GGGAAAGTCG CGCCATGGCC GCCGCTCAAG GACTCGTACG
GGTCGACGGA ACCAAGGAAT CGGAACTTAC CGTCATGGCA CGATTGGAAG ATCAAGACGA
CCACGACGAA GCTGCCGAAG TACTTTTGGA AAATATGGAA CGAGACGATG ACGACGACGA
ATCTAGTTCC GACGACGATA TGAAGAGCGT GGAGTTGCCC GCGTCGTCCT TGCAAGCCGA
ACTGGACCAG CTGTTGGAAG AAGAGTGCAC GACGCCCATG GCCAATATAT CGATCCTGGT
AACCATGTTC ATCGTCGTTC TCACGATCAA CGTACTCAAA GGCGGTGGCG CCTTTCCCAG
TCCTCTCGGT ATTCGTTGCG GATCTCGAGC CTTCTGGATC GCCAACCTGG TCATGCTCGC
TTGGATTGGG ATCATTAGCG TGGGCATCCG AGCCTACCTG GTCCGACGAT TTGAACAAAA
GCGACGCCTC AGCTTTCCCT ACGTCGAAGG CGATATTCGA TGGGACGCAC GAGCTACCAT
TGTCTATCCG GTGGTGTGCT GCATGGCCGG ATTCTTTGCT GGAATGTTTG GGGTGGTACG
TAAACGTCCA ATCCTGTAGG GAGACCCTCG CCGCGACGTT AGCGTATCAT ACCACCAGCT
CGCCTAGCCG CAGAACAAAT CTAGTATCGA TGCTCACACG CGTACTCTCT CTCTCTCTCT
TCTCTTCCCA CACAGGGCGG CGGGATTGTC AAGGGACCAC TCATGCTGGC CATGGGCGTT
CATCCGGCCG TTTCGTCCGC GTCCTCTGCT TGCATGATTC TATTCACTTC TTTTACAGCC
ACGACGAGTT TTGTTGTTTT CGGACTCCTC GTCTGGGACT ACGCGTACGT CTGCATGGCT
ATCGGTTTTG TGGCCACTTT CGCCGGCCAA GTGGGGCTGT CCTATCTCAT GAGGCGTGCC
CAACGTAATT CGTACATTGC CTTTTCTATT GGAGCCGTGG TGTTGCTGTC GGCCTTTCTT
ATGACCATAC AATCCCTACT GAGCATGGCA GCGGGAGAAA AACACCACTC GGGAGGAATT
TGTGGCAAGG GAGACTAG
 
Protein sequence
MDPGLPTGPP FRFIVPHGTG HGEVIPYPQR IRPSHFFLAF ARQALEGNRL LPYIRRHCPS 
GTATAMMDIL PDAPDVIVPV PPRPLQQPHP HRQGCDTFTV NPVPRTNACG AVSDVRESTA
RRGRSLGRLP NLHVWILVLA VALVAAVLVQ WSESLSSSTS LSLDTPEVSP FRWLLKGDDS
RSPHSQHKPI YDEAHPALFP LSRSDQIGFF LATLGLMVAA GGGIGGGGIL VPVYILVMGF
TPKHAIPLSN VTVLGGAVAN TILNARKRHP LADRPLVDWD LILVMEPLTI AGALLGAFLN
KVLPELLLTV LLVLLLSVTA YTSLTKALKL YARESRAMAA AQGLVRVDGT KESELTVMAR
LEDQDDHDEA AEVLLENMER DDDDDESSSD DDMKSVELPA SSLQAELDQL LEEECTTPMA
NISILVTMFI VVLTINVLKG GGAFPSPLGI RCGSRAFWIA NLVMLAWIGI ISVGIRAYLV
RRFEQKRRLS FPYVEGDIRW DARATIVYPV VCCMAGFFAG MFGVGGGIVK GPLMLAMGVH
PAVSSASSAC MILFTSFTAT TSFVVFGLLV WDYAYVCMAI GFVATFAGQV GLSYLMRRAQ
RNSYIAFSIG AVVLLSAFLM TIQSLLSMAA GEKHHSGGIC GKGD