Gene PHATRDRAFT_44347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44347 
Symbol 
ID7197826 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp290217 
End bp292744 
Gene Length2528 bp 
Protein Length646 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178201 
Protein GI219114811 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00207317 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACCA GAGGTAGACG CGAATCTAAG GATTCACCGT GAGTATAGTA CATTCTGACA 
AATTGATAGC TTCCTTATTG ACATTGCAGT CCTATCGAGG GTAACAGATG GCTAATTCTC
TGCTCACATG AGAGGCTGTT CTTCAACAGT AAAAAAGGTT GCCAAACTTC AACATTGAAA
ATAGCCTCTA GAGAGTTGGA TGAACATAAG CTCTTCATTT TACGAGAGGT TCATTTGATA
TGCACCCCCG AGGGAACCTT ATGGAAGCGG GTATGAACTG ATGCGGCAGG CCATATCTCA
ATGAAAACAT CGGCCTTTCT AGGCCTCAAG CAAACGAAAT CTTTCATGCT CAATTAATAG
CTAAGACAAA CACGGCCTTT GAAATTCAGA TTGGCCTGAC AGCAAAATAG CCTAGATTAA
GTTCTTGATG CGTGCACACT ATGTAGCCAT CCACTATTTG ATAGATTTCG CTGACAATCA
ATCTTTTTTA GGCCTCCCCA AGGTCTCACG AAATCAAAGT TAAAAACTAA TCCCAGCTCC
TCAACCCCAT TTCTTGCATC ACTCATACTT TTGAAGCACG ATCTCCAAGT TACACTCTCT
TATCAAGGTA AGTATGCCCT TTCCGAAGAT AGAAAGAGGT TTCGTGCAAC AAGTGCTCAC
TGACCACATT TTGTTTTACA CACGCTATCT CCTATTAGAT GAAATTGACG AAAGCTTTTA
ACATACTGGC TGTCTTGGCA GGTGTCAACA CCATTGCCGC TGATTGCACG CATGGTCGTC
TCTTTGTCTC CGACATGGAC TCGGCTAATG TTTATACCTA CGAGATTGAT GGAACTAGTA
ATCCCGTCTT GCTCAACACC CTGCCGACGG TGACAGGAAT GGGCCCGCAG TTTTTGTACA
CGTCGTCTAC AGAAGGAGCT GTTACCGTTG TCTACCGTGG ATTGGAAGAA ACTGCGTATC
AGGATGGAGC TATCAGCTTC ATCCGGGTTG GAGTCACTCC AAGCAGCCAC GAAGTCGCAG
GTTTCTCTGT CGAGAAGGAA GACCCTTCTC TGGTTGACGA CTTTTTTGTG AGCTGTGCCA
GACCTATACA TCACGTGGCT CATGACCAAA AGATTGCCAT ATTCTGTGAC GGATCATTTG
AAGATAGCGT CAACTCAACA GTCTGGGTTG TTGACGAACG CTTTTTTGGG CAAGGCAACA
AGACATTGGT CTTTTCCAAG ACGCTGGAAG GTTCTCATCA TGGTGTTGCC GTCCCAGTGG
ATGAGGACCA TATTCTTGTC TCCGTGCCCA CACCGGAACG CGTGGCAAAT GATCCTAATG
CAAGTGCACT TCCTGACGGG TTCCATGTCT ACGACTATGA TATGAATTTG TTGCATGGCC
TGAATGAGGA AGAAGATCCT AGTCGTTCTT GCGCGGGTTT CCACGGAAGC GGTGTAATTG
ACAACACCTT TGTGTTTGCT TGCGATCAGG ATCATGGCGG GATCCTTGTT GTTGACTATG
GTCAAGCAGG TGTTACTTAC ACTTCTCGAG CTCTCTCCTA TCCGGATGGC TTTGATGCCC
ATCGGACTGG AACCTTAACA GAGCATCGTG ATAGCAACGC CATTGTCGGT AATTTTGCTG
ATAGAGCCAC TGGAGATTCT AAGCTTGTTT CATTCGTACC GAAGCAGCAA TCTGATGAAA
TCACTGAAGG ACAACTTCTG CCTCTGGAAT CGGGTCAATG CAGTTTCAGC TTTGAGCAAT
CGGGTGGTAA CCTAATCCTT GCCTGGATGC CGACAGGAAA TCTTCAAGTT TATGCTATTG
AGCCTGAATG GATGCTACTT GCCGACATCC AGGTAATCGA TGACATGTCT TCTTGCGACG
GAACTTCAAT GGCACCGGGT CAAGGACATG CCTACATCAT GCAAGGAACT TCATTGATTG
ACGTTGATCT TCATGACTTG ACGTCGCCGG AAATATCTAG CATTGATCTT GGCTTTATGC
CAGCATCAGC AGTGGTGGCT GGAGTTCCGG CTGGTTATGC CTGCGAGGCT CCCAGCTTTC
CTGAGACCTC TTCCACATCT GCTGTTGATG GATGGATCTC GATTGAGCAA GTCCTTGCTC
CTGCCGGTTC TACTGAATCA ACAGAATTTC TTCGGTCTTT CCGCAATGAT ATTGCTAGAA
GTCTTGGAGT TGGCCTCAAT CGCGTTTTCG TCGAAGAAAC AGTCAAGGAA TCGGATAGTT
CCACCGTTGT CCATGTCAAG ATGAGCGACC CCACGGAACA TGATGCGAAC CCCGCCACAG
GAAAGCAACT CCTCGACCAG CTCATTGCCA GCGGTATGAG TTCGGCAACT AGTGTGTCTA
GCCAAGCCCC TTCTCAGGCC ACGGGAGGCA ACGGCGGAGG AGATTCTTGG CCAACTGGGG
CTACCGTTGG CATTGTGCTT GTTGCAATTG TTGCCATTGC ATCGATTGTA GCGGCCGTCT
TGTTCAAGAA GCGCGAGAAA AAGGCCCTCT TTGACTTGGA AAAGGCCAAT AAAGGGCAAT
CGGCCTAG
 
Protein sequence
MTTRGRRESK DSPPPQGLTK SKLKTNPSSS TPFLASLILL KHDLQVTLSY QGVNTIAADC 
THGRLFVSDM DSANVYTYEI DGTSNPVLLN TLPTVTGMGP QFLYTSSTEG AVTVVYRGLE
ETAYQDGAIS FIRVGVTPSS HEVAGFSVEK EDPSLVDDFF VSCARPIHHV AHDQKIAIFC
DGSFEDSVNS TVWVVDERFF GQGNKTLVFS KTLEGSHHGV AVPVDEDHIL VSVPTPERVA
NDPNASALPD GFHVYDYDMN LLHGLNEEED PSRSCAGFHG SGVIDNTFVF ACDQDHGGIL
VVDYGQAGVT YTSRALSYPD GFDAHRTGTL TEHRDSNAIV GNFADRATGD SKLVSFVPKQ
QSDEITEGQL LPLESGQCSF SFEQSGGNLI LAWMPTGNLQ VYAIEPEWML LADIQVIDDM
SSCDGTSMAP GQGHAYIMQG TSLIDVDLHD LTSPEISSID LGFMPASAVV AGVPAGYACE
APSFPETSST SAVDGWISIE QVLAPAGSTE STEFLRSFRN DIARSLGVGL NRVFVEETVK
ESDSSTVVHV KMSDPTEHDA NPATGKQLLD QLIASGMSSA TSVSSQAPSQ ATGGNGGGDS
WPTGATVGIV LVAIVAIASI VAAVLFKKRE KKALFDLEKA NKGQSA