Gene PHATRDRAFT_50589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50589 
Symbol 
ID7199409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011699 
Strand
Start bp244161 
End bp246311 
Gene Length2151 bp 
Protein Length668 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185546 
Protein GI219130803 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00629858 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAGGAT CCGTGACCAA CACCTACCAC CCAATTCCGC AGAACTGGCA TTCGCAAGGG 
TACTTACATT TACTATAAGG TCGATCCACA CAGCACATGA ACAGGTCCAC AGCCGGTGGT
CCGTCCGCAA TCGGGGAGCG CCAGATGGAA ACCATGGGGA GTCCCGTCGA CGACGTGGAC
GCCATAGATA TGGACGAGGA GGAGTCGATG CCAGATAATT TCGAAGCTCG TTTTCCGCTT
TCGAGGCATA TTGCCAGCTG GAGATTCTGG TGGGCCTTTG CCAAGTACCC GTTGGCTCTA
CTCCTCATCA ACACGGTTGT GTTGCTTTTC CTGTCCCTGT ACAATCCAGA AAATGACTAC
CAAGACACCT ATAACGATGA TTGGGAGACG AGCTCGTTAA TGCTCATGAT GACGAGGTTT
CGTGTCAAAT GCGTAGGGGT GGTAGTGATG GCATTCTTGG ATACCATTTT CCTGTACTTG
GCCGCCGCGT TTCTTCGGAA GGATATGCAC TTGCTAGTGA GGGAGCATCA AGCACAGGAC
TCGCAAGCAC AAAGCGCCGA GACGCAGAGT CCGATATGGA GAACAGCGGC ACCGCTCCCG
CAATTGGATT TGAAGGTACT GGACCACGTG CACGCTCGAG TCGCGCAAAA GGTAGACGCG
TATCTTCATC GCGATCCAGC ATGGCAATCA ACGTTTCCCT TTTTGGTCAG CTTGTTGTAC
CTGCTTATGG CACTCGCGGC GACGGCTTCT CTGACTTCGG TTTTGCTGCT GTTATTTGTA
AATAGCGCGC AAGGAATGTC TCTGTGTACA GCAAGGGACA CCAATGTCAT CAAATTCAAT
CAGACAGATG GTGTCCCGGA AGATTTGCTA GACTGGGCGA ACGAAGGATC CTCCGTTGAA
GCAACATACG TCCGTCTATC CGATGGGACT CTTTACTTCC GAGGGTCTGC AGCATATATA
CAAGAACCTA CAATGCTTTC TGTTTACCAT GGTATGAGGA TCAACGGGTT TCATTCTATA
GAATCTGACA GGGATTCAAA CCTAAGGCTG CTTGCGTCCA AGGCGGATGG AAGTGTTTCC
ATGTATAGCC ATGTGAGCAG CCCTACGTTT TTTGCAAGTG TCTTGGAAGA CAACAGTGGA
ACATCAACTT CCTTCTGTTT CTTGTACAAG GAGGTGGTCC AGGAAGGCTA CGATATAGAT
ATCTTCCCCT CTGTCCCCTA CGCGAGTAGT GTCTTCTCTG TTGCTTGCAT TGACTCTAAA
GAAGATGCCA ATCAAGAAGT TCGGAACACT ACTTTTTCCA GCAGCAACCA ACAGGAACTG
AGATCATGTT TGGGCAGAGC TTATGATGGT GAGTACTGGG TACGCCAGGA TGGAGTAGAT
AGGCATTTTT GGTCCATCCA AGAAGTGCAA ATAGTACGAG TGAATCCACA AACCATGGTG
GCTACAACAA TTGCCAATAA AACCCATTCC CGTGACTTTC GACCGGTGTG GTCCAGAGAA
GGAAAATGCA ACCTTCGGAT CCAAGAAATT GGATATAGTG CTGCTGCAGT ATTTCTGTTT
ATTTTTGCTC TTGCAATGGA ACAAACTTTG AAAATTCCAT CAAGTGCGGG TTGCTTGGCT
ATGTCTATCG TTGCAACTCT GACTTGGATA GATGAAACGT TGGCAGGCTT GCTAAGTCTC
GTGCTGATTA TTGCGACAGC TTTTTACTTG ATCATGGGGT CACCCAGCTT AGTTGTCCGA
GAGCAGATGC TGTGGGGAAT GTACTGCGTT ATCATCCTGC AACTGGTTTT CGAATTTTAC
CCCGTGGTGG TTTTTGGCCT GGGTCTGGGC ATTGCTCGGG ACCATCCCGT ACTTCAGCTG
GGTGGCTGGA TTGGAGGACC ATTTGCTGTC TTTTTTCTTT TGTTCTACTC AATCACTGAT
TCCATTGATT GGTTGGAATT GGTGGCGTTT ATTCCATTGA GTGTCCTTAT TGCTTGCGGG
ATGGTGACGG CAGGGAATCA ACTCACAAGA TACCGCCCAT TTCTGCTGTT CTACCTGAGG
CGTTTGTGGC GATCGCTTTG CCTGAAAATA CGGCCGCAAA TTCGACAACA AAGCAGAAGC
TGAGAAATGG ACTTTTATAC ACAGATTGAA AGGCATGTTG ACTAGCTGCT G
 
Protein sequence
MNRSTAGGPS AIGERQMETM GSPVDDVDAI DMDEEESMPD NFEARFPLSR HIASWRFWWA 
FAKYPLALLL INTVVLLFLS LYNPENDYQD TYNDDWETSS LMLMMTRFRV KCVGVVVMAF
LDTIFLYLAA AFLRKDMHLL VREHQAQDSQ AQSAETQSPI WRTAAPLPQL DLKVLDHVHA
RVAQKVDAYL HRDPAWQSTF PFLVSLLYLL MALAATASLT SVLLLLFVNS AQGMSLCTAR
DTNVIKFNQT DGVPEDLLDW ANEGSSVEAT YVRLSDGTLY FRGSAAYIQE PTMLSVYHGM
RINGFHSIES DRDSNLRLLA SKADGSVSMY SHVSSPTFFA SVLEDNSGTS TSFCFLYKEV
VQEGYDIDIF PSVPYASSVF SVACIDSKED ANQEVRNTTF SSSNQQELRS CLGRAYDGEY
WVRQDGVDRH FWSIQEVQIV RVNPQTMVAT TIANKTHSRD FRPVWSREGK CNLRIQEIGY
SAAAVFLFIF ALAMEQTLKI PSSAGCLAMS IVATLTWIDE TLAGLLSLVL IIATAFYLIM
GSPSLVVREQ MLWGMYCVII LQLVFEFYPV VVFGLGLGIA RDHPVLQLGG WIGGPFAVFF
LLFYSITDSI DWLELVAFIP LSVLIACGMV TAGNQLTRYR PFLLFYLRRL WRSLCLKIRP
QIRQQSRS