Gene PHATRDRAFT_44478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44478 
Symbol 
ID7197709 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp689115 
End bp692602 
Gene Length3488 bp 
Protein Length549 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178281 
Protein GI219114971 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.578489 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTTCGGTCGG CTTCGAATCC AACTCATCGT ACAAACCGCG ATCCAATAAC CAAAGCACAA 
CTACACAAAA ATATTGCACC ATGTTTGGAG GGGGAGGTCG TAAGAAAAAG GATGAAAGTC
TTTCACAGGC GATGAAGCCG GAAAACCCGC CGTCAGGTGG TGGAAGCGTA ACTGGATTCG
ACCCTGAGGG TCTCGAACGT GCGGCGAAGG CCGCTAGGGA TCTAGATAAC AGTCGGAACG
CATCCGCAGC AATCGAGCTG ATCAAGACAC AGGAGGCTAC GAAGCAGCAT GAAGCAGCGG
CGAAGAGAGC CGAAATGGAT GCATATGCGC AGCAACTGCG TGCTCAGAGT ATCGAGAAAG
AAGCCGACGA GGCACGCAAA ACCCTCGATG CACAAACTCA ACACGAACAG CGTCGAGCCG
AGTACCAAGA TCAACTGGAA CGCAAGCGCC AAGTTGATAT GCTTAATGCG CAAAAGTACA
TGCAAGAGTA CGTGGCGGCT CAATATCGTT GTAGGAAACA CAGCTTTTCC GTATTTGTTT
GCTTATTGTA TTTATTTCGA TCGCTTGTGA ATCTTTTAGG GAGCAGCTCA AAAAACAAGA
AGAGATGGTT GCACGCCAAG AGGAGATGCG TCGTAAAACA GCGCAGTATG AGGCTGAGCT
CCGGACGAAG ACCGAAATTG CTAAAGCAAA AGCCGAAGCT GAGGGACGGA TCGCGCAGGA
GCGTCAGAAC CATGATCTCA TTTTGGATAA AGTTCGTCTC GAAGCCTCCG AAAGCCGCGA
TACAGTTTTG AAAGCAATCC AAGATGGCGG CAAGCTTCTT GGAGAAGGAC TTTCTAGCTA
TCTTAATGAT ACCGAGAAAC TCCGAAACAC TGCGTTGACG ATAACTGGCA TCGCCGTCGG
GGTGTATGCT GCACGGACAA GTATTGGTAT CACTGGTCGT TTCGTTGAAG CACGTTTGGG
AAAGCCGAGT TTGGTTCGTG AGACTTCACG AATGACTGTC TCGCAATTTT TTACCAGCCC
TGTAGCATCT AGTCGGCGGA TATTGGGGAT AGGCGTACAC GAGCAAGATG CCTTGAAAGG
TATCGTTTTA GAAGATTCCC TCGATACTCA GCTTCGCAAA GTGGCGGTAT CGACGGCTCA
CACCAAAAAG AATCGTGCCC CTTTCCGTCA CCTGCTACTT CATGGTGAGT AGAAATTGAT
GCCGGACGTT TTTTTACCAA CCTTTTTTTC AAAAGTTGCT TACAATTGTC ACGTTCTTTC
GCAGGCCCCC CTGGGACGGG GAAGACCATG TTCGCACGAC AACTCGCGCA GCATTCTGGA
CTGGACTATG CTGTTTTAAC AGGTGGTGAT ATTGCTCCAC TAGGACGGGA AGCCGTCACT
GAACTTCACA AATTGTTCGA TTGGGCCAAA ACAAGCCGAC GCGGTCTACT ACTTTTCGTC
GATGAAGCCG ATGCTTTCTT ACAGTCCCGT GAAAACTCTC GTATTTCGGA GGATCAGCGC
AACGCATTGA ACGCGTTTTT GTTTCGAACC GGTACAGAAA GTGATCAGTT TATGATGGTG
TATGCGAGCA ACCAGCCTGC TCAGTTCGAC GAAGCTGTCA TGGATCGTAT CGATGAAATG
GTAGAATTTG ACTTGCCAGG ACCACACGAG CGGCGAAAGA TGATCGCCGT TTATATCGAT
AAGTACTTAT TGAACCCACC AAATCGCTGG ACGAGAAAGG TAGAAACTAT CGACATTGGA
GACGCAGAGA TTGAAGAAGT TGTCCGCGAA ACGGAAGGTT TCTCTGGTCG CGCAATATCC
AAGCTCGCTA TTGCCTGGCA GGCCGCCGCT TACGGAACGG ACGGTGCCAT CCTTGATCGC
GAAACATTTT TCAAGACGGT GGAACTGCAT AAAAAAAGCA TGATGACAAA GGAAATCTGG
CTCAAAACCG CAACGAAACG CGCCCAAATG CTTACTTCGG ATCGTTAACT CTAATTTAAG
CCACGATCGT CTTCTTTTCG TCTATTCACA TCTTAAATCT TGTTTGTTGG TTAACTTTTA
TTCCTTATTG GAGCTGATTC AACTCGCAAT CCTTCGTTGA AGTTCACACC CTGCGTTGCT
AGCGGGCAGA CTCTGGCTTT CTCCCAGTTT GCCCACATGT TTCCGATGCG ACGACGAGCT
CCTTCCGTGT TCACATAAAC GCAGGCGAGA GCATCATGGC CACCAGCGCC TGGTACGAGC
GCACCAATAA CTCCAGGCAA GGCAAGGGTT GCTTCGATCA GCTGTGTTTG TTCTTCCGGC
TCAACAGGAA CCTTGGCAGC TACACCCAAT GCTTTCAACT CTTTCCGAGC CGCCTGTAGA
GCGTTGCGTA ATGAAATGAG ACTCTCCTCA ATCTTATCGT GCGCGCACCA CTGCTCCATC
GAACAATTTG CTAGCCTTTC TATTTCAGCA TGATCGATAG GAATTGCGCC TATGTGCTCT
AGCCGGTCCA CCACCTGTCG ATTTATTTCC GCTAACTTAT GCCAGTGAGG AGCATTTCCG
TTTCCAAAAC CTTTCCTCCA CTTTAGCACT CTTCGAGCCA TTGAAGGGCT TTCAGATCCC
CCTGAGACAT CCGCCAGCAT TATTTGCAAG ATCGCGGGTA GTCGAATTGG AGCGGCAACA
CCTCCGGTCC AAGTTTTCTG TACGATAGCT CTCAGAATGG CTTGTACATG CTTTATTTCA
GATTTCGTTT TGTCTAATTC TCTTAGCAAA TCAGCAAGAA GATATTCAGG AAACCGCCGG
TAGACATGGG AGCCGTGACA AGCCGCTGAA ACATCAAATC CACTGCCAAC TTTTCCTTGC
GCGTGACAAT GCGAAATTTG TGCTAAATTG TAAATGACGG AAGGCTGATT GCATGCGTAG
CATAGCGATC CCACAAGACT GGTCACTAAG CAAGCACTGC TACCGAGTCC AGTTTTGAGC
ACATTACCAT CTGGCCCGGA AGTAGTAGCG GGTAAAAACG GGGGAAGTAG TTCTACCGAT
TTCAGAGAGC GATCCAGACC CCGCTCTTGC AGATGCGGAA TCAAGCTGTA AAAGTCGTTA
TCTGCTTGAA TATCCAGAGT AATACGACAA AGGCAATCTT CCTTTGCAGT CAACAGGTAA
AGTAGTGATA CTCGTAGACT CTTTTCAATG AAAAGATTCA CGCTATTGTT CGAGGCATCT
GCAGACAAAG TCAATGTCAC AGAATTGTAA AGATACTTCC ATGTTTGCCC AAACTGTGGA
CTGTTCACAT CAATCTTAAC ATGCGCTGAC GCAGTTTGGT CAAATTCAAA GGTGGCAGTC
GTATAGAACC GCTTATCGAC GGCTAGGACG AGACCAGTAT TGGGGGACTC CAGCACCAAA
TAGCCACCGG CCAGAAGAAT CTTACCCGGC GCAGAAACTG TCACCTTCTT TATAGAGGTC
AGCATGGACA CCACACTATC ATTAAGCTTG ACCAATCTCG ACCAAATATC GATTTGTTCA
CAGTCACT
 
Protein sequence
MFGGGGRKKK DESLSQAMKP ENPPSGGGSV TGFDPEGLER AAKAARDLDN SRNASAAIEL 
IKTQEATKQH EAAAKRAEMD AYAQQLRAQS IEKEADEARK TLDAQTQHEQ RRAEYQDQLE
RKRQVDMLNA QKYMQEEQLK KQEEMVARQE EMRRKTAQYE AELRTKTEIA KAKAEAEGRI
AQERQNHDLI LDKVRLEASE SRDTVLKAIQ DGGKLLGEGL SSYLNDTEKL RNTALTITGI
AVGVYAARTS IGITGRFVEA RLGKPSLVRE TSRMTVSQFF TSPVASSRRI LGIGVHEQDA
LKGIVLEDSL DTQLRKVAVS TAHTKKNRAP FRHLLLHGPP GTGKTMFARQ LAQHSGLDYA
VLTGGDIAPL GREAVTELHK LFDWAKTSRR GLLLFVDEAD AFLQSRENSR ISEDQRNALN
AFLFRTGTES DQFMMVYASN QPAQFDEAVM DRIDEMVEFD LPGPHERRKM IAVYIDKYLL
NPPNRWTRKV ETIDIGDAEI EEVVRETEGF SGRAISKLAI AWQAAAYGTD GAILDRETFF
KTISCGIKL