Gene PHATRDRAFT_20608 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_20608 
Symbol 
ID7201190 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp664512 
End bp667792 
Gene Length3281 bp 
Protein Length830 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180684 
Protein GI219119866 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGATG TTCCATCTCG TCGGACTAGT CGAGAAACAA GACTACACGG CGGAGGCTAC 
AAGTGCCGAA ACCCCGCGAC AAGCTGGTAG TATTTGAGAT ACTCATCTCG CACAAGGCGA
AGCGAGTCCT CCTTCAGTGG TCGATTGTGT CATCCGTACA AGTTTTCGTC GGCTCGTCAA
CTCCATCTAA AGTTGCCTTT CCGTCGGCGT ACGCAACGAA ACCCATTATC CAATGGCATT
CAATATGAAG TTTCTCGCTT TGTTTCTCAC GTTCTTTCAA GGCGAAGAAT GTTGGGCTTT
CCATCCGATT GCTTCCTTTC GTTCAGGTCG TAGCGGTAGA AAAGGGGTAG ACGGCTCAGC
CTCGACATCG TCACGTTGGG CGTCCGACGG TTCCGGGACA GCCTTTTCGA TTGAATCGGA
CCACAGCGGA CCCGCCGGTT TCCCCGTACA CCGCGTCATA TTTCACTCGC TGCCCGGTCA
AGAGCAGGAT GTTCCTCCTA TTTGCATTGA AACGGGAAAG ATCGGACGAC AGGCGGCGGG
TGCTGTCACA TTGACTCGCG GTGATTCCGT TCTCTACGCG ACAACGTCGC ACGACAAGGA
CCCCAAGGAA GAAATCGACT TTTGTCCCCT CAGTGTGGAC TACCAGGAAC GATTCAGTTC
CGCCGGGCTC ACATCCGGTG GGTACAACAA ACGGGACGGA CGTCCCGCTG AACACGAAAT
TCTTACCTGT CGTCTCATTG ACCGACCGCT CCGGCCATTG ATTCAGTCGG GATGGCGGCA
CGAAACGCAG CTTCTCTCCT GGGTCTTGTC CTACGATGGG GAACGGTCCT GCGATCCTCT
TGCGATCATT GCTTCGGCCT CGTCGCTCTT CATTTCCGAT GTACCCCTGC ACAAACCCGT
GGCGGCGGTG CAAGTGGGCA TGAAGGATGA TGGCACGTTG GTGGTGAATC CTACAAATCT
ACAGATGGAG ACGAGCAAGC TCAACCTGAT GGTGGCCGGT ACCGAAGAAG CAGTTTTGAT
GATTGAAGGA GCCGCCGACT TTTTACCAGA ATCCACGATG ATTCAGGCCG TGAAAACGGG
CCATGAGGCG ATTCAGGTCC TATGTCAAGG GCTGACCGCG CTAGGAAAAG TCGCCGGAAA
GGAAAAGAAG CTGGATACTA TTAAAGCTAC TCCAGAGAAC TTGCAAATTC GAGTAGATGA
ACTTTTCAGC GATCGTATCG ATGATATGTG GTCGTCTGGT CTGGGCAAAG AAGCCCAAGG
GAAAATCATG ACTGATTTAT ACACGGCCGT AGTCGGCGAG CTTATCGAAG ACTATCCTGG
CGAAACGGTT GCCATCAAAG GCGCGTTTAA AGATCTGTTG TGTCGACGCA TGTTTTTCCG
TGCCCGAGAA GAAGGGCTTC GTTGTGACGG TCGTGGACCG ACCGACATTC GTCAGCTTAC
AATGGAGACT GGGTTGCTGC CGCGTGTGCA CGGGTCTGCC CTCTTCACAC GAGGTGAAAC
ACAGTGTGTC GCCACGACAA CACTGGGTGG TTCCGGCATG CGACAAAAGA TTGAAAAACT
GGACGGAACC GATGAGAAGC GCTTTTATCT GCAGTATACG TTTCCGCCTA GTTGCGTTGG
TGAGACGGGC CGAGTGGGAG CTCCGGGACG TCGAGAAGTC GGACACGGTA ACCTGGCGGA
AAGGGCCCTG ATTCCAACCA TACCAGCACT AGCTGATTTC CCTTATACAA TTCGGGTAGA
GTCTCTCATC ACTGAGTCGC ACGGTTCCAG TTCCATGGCC AGCGTCTGTG GCGGATCACT
GGCTTTAATG GATGCCGGTG TACCCATCAA AGCACCCGTG GCCGGTATTG CAATGGGAAT
GCTGCTAGGC GATAAAGGCG GCGTATCCGA CGAGAACGCT GTAATACTTT CGGATATTCT
TGGAACCGAG GATGCACTTG GTACTATGGA TTTTAAGGTT GCAGGTGATC GAGTTGGAAT
ATCCACTTTT CAACTGGATA TCAAGTGTGA AGGTTTGACT TTGGAAACCA TGGAAAGCGC
ACTGGAACAA GCACGCACGG GCCGCTTACA TTTGTTGTCG GAAATGGAAA AGGTAATTGC
CTCACCGCGA GAGGAGCTCC CCGCAACTGT CCCAAAAATG ATGTCGTTTT CAATTCCTGT
TGAGGCCATT GGTAAGATTA TTGGACCAGG TGGTAAACAG ATTCGTGCTA TCATCGAAGA
TTTTGAGCTT GTAAACATGG ACGTCGGTGA AGAGGGAGGG GTACAGTTAT CCTCGTTTGA
TACAGCTAAA ATGGGAGAAG CCCAGACCTT TATTACTACT TTGGTTAGTA GTGCCGGACG
AAATGGACGT GGGCCAAGAG AGGAACGTCC AAAGTATGAA GGACCGGAAC CCGTTGAAGG
TGAAACCTAC ACCGGAAAAA TTACTGGTAT TCATCCGTTT GGAGTTTTTC TTGAGATTTT
GCCCGGTGCC GAGGACGGTT CTTACCCGGG TCTTGAAGGA TTGGTTCATG TCTCGGAGCT
GGCCCACGAG CGTGTTCGAA ACTGCGAAGG TTTCATGAAG AGCATGAATG TTGAAGAACT
GACGGTCAAG TATCTCGGTA AAGACAAGGG TAAACTGCAG CTTAGTCGAA AGGCTCTACT
CGAAGAACAA GGTGGAGACG GAGGCCGAAG AAATGGTTCT CGAGGACCGA GTCGAGAAGC
AGCCGCGCCA ACTCCCGAAA TGACAAAAGA TGAGATCGAC GTGATTGCGC AAGCTATTGA
GGGTGTAACA GAGCTATAGA TTTCTTAGGC TTCGACTCAA GCTATTCCTC TTCGAAGTGA
CCGTCCAGTA CATCCAGTCC CATTTCGTTC AAGATGTTTC GAACTGGACT CGGCTTTCCG
TGTCCAGCTT CTAGCGATTT CAACAAGTTT GTCAGAATCT CAGCGTTGTG ACGGACACTC
TCATCTTGTT GGACATCTTT ACTTCCATTT TCGGAGATCG GTCGGATTCG CAACTCGTTG
TCCATAGCGC TCATGACATC GGTCATGGCT TTATCAACGC CATCGCCACC TTCCTCATCA
TCTAACTGGT AGTCGTCTTT CGAGAAAAAC AGATCCTTCT CGATTCTCGT AAAATCGAGG
TCGTCTGCAC ATTCTGCTTG CAAAGTTGAG TGCAAAATGT TCAAGAACAC AGTAGGATCA
ATGCGCAGTG GACGAGTCAA TCCAATTTGA TGCGAAACAC CTTCAATCGT GCTTTTGCCG
TGCATGAAAG ACTGGACTCC GCTCAGCATA TCGCTTAAAG C
 
Protein sequence
MAFNMKFLAL FLTFFQGEEC WAFHPIASFR SGRSGRKGVD GSASTSSRWA SDGSGTAFSI 
ESDHSGPAGF PVHRVIFHSL PGQEQDVPPI CIETGKIGRQ AAGAVTLTRG DSVLYATTSH
DKDPKEEIDF CPLSVDYQER FSSAGLTSGG YNKRDGRPAE HEILTCRLID RPLRPLIQSG
WRHETQLLSW VLSYDGERSC DPLAIIASAS SLFISDVPLH KPVAAVQVGM KDDGTLVVNP
TNLQMETSKL NLMVAGTEEA VLMIEGAADF LPESTMIQAV KTGHEAIQVL CQGLTALGKV
AGKEKKLDTI KATPENLQIR VDELFSDRID DMWSSGLGKE AQGKIMTDLY TAVVGELIED
YPGETVAIKG AFKDLLCRRM FFRAREEGLR CDGRGPTDIR QLTMETGLLP RVHGSALFTR
GETQCVATTT LGGSGMRQKI EKLDGTDEKR FYLQYTFPPS CVGETGRVGA PGRREVGHGN
LAERALIPTI PALADFPYTI RVESLITESH GSSSMASVCG GSLALMDAGV PIKAPVAGIA
MGMLLGDKGG VSDENAVILS DILGTEDALG TMDFKVAGDR VGISTFQLDI KCEGLTLETM
ESALEQARTG RLHLLSEMEK VIASPREELP ATVPKMMSFS IPVEAIGKII GPGGKQIRAI
IEDFELVNMD VGEEGGVQLS SFDTAKMGEA QTFITTLVRP EPVEGETYTG KITGIHPFGV
FLEILPGAED GSYPGLEGLV HVSELAHERV RNCEGFMKSM NVEELTVKYL GKDKGKLQLS
RKALLEEQGG DGGRRNGSRG PSREAAAPTP EMTKDEIDVI AQAIEGVTEL