Gene PHATRDRAFT_47502 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47502 
Symbol 
ID7202279 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp819479 
End bp821171 
Gene Length1693 bp 
Protein Length534 aa 
Translation table 
GC content59% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181811 
Protein GI219122976 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.839927 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGGTCGTTCC GTGAAAAGAC CCATGGGAAT GTCGGAATGG GGGATCTCTC CTTGTCATCC 
ACATTCACAA CAACAACGAC GACGACGTGG TGGCATTCCA TCCGTACGGC ATTCGGACAC
CGTCCGTTCC CTGCCCAACC CCGGTGGGTG ACGGAAGCGG CGGCGAACGC TACCCGGGCC
GCCGCTCCCT GGGGCCAGCT CTGGCAACCA GACACGGCCC CTCGCGCCGT CCGAGCCGGT
CGGGTGCGCA CCGTCGAGGA CCTCGACCAC AGTAGTAGAT ATGACAACAA GGACGACCGT
ACTAGTGACC TCGACGGTAG CACAAATGAC GACAACAACG ACGATACCAA CAACGACGAC
AACAGTCTGC GACTCTGTGT CCGCAATCAT TTGTCCATAC CCCTCGTCTG GTGTTGGATG
GATGCCCAGG GTCGTCCCCA TCACTTTCGG AAACTCTACC CAATGGCACA CGCCGACACC
GACACCGACA CTGTTACCCG GGATGATCAC GTCGAACAGA CCTACACGGG ACACGCCTTT
GTCCTCGCCA CGGAACCATC CCCCGCCCAT TCCTCCGACA CCATTCCGTC CCTCGATCCC
TCGACCATAC TCGGGGCCTA TAGACCCCAT CGACGACGAG ACCACACCGT TCACATATTG
GAAGTCAAGT ACGTACCGAC AACAGACAAC AATGTCAGCA ACAACAACAA CGTCGTCGAG
GGGATGGAAC TGTTGTCGAC CCGGAGCACG CTCCCACGTC CGACACGGAT CCCACCGAAT
CCACGCCTTT GTACCCACCG ATATTGACAA CGACGACGAC GACAACAACC AGGACGACGA
CAACCCACTC GACGTACCCC GGTCTGTACA ATTACACTTG CACATCGGCC GTCTCGATCC
CACACCACTG GACACGGTCC GTACAAACAA ATACGTCCAG AGTACCCTCG CGGGCTGGCC
CGTCCGTTTC CAGTCCAATT GGGACGGCGG TGACCCGACC CTCCGGGCCC GTCTCGAACA
GGATTTAACA CACGCGGTAC ACTGCCTACC GCCCCACGCC GTGCGAACCC TGCGACGCAC
CACCCCACTC TGGATCAATC GGGACTTTCG TTACGGACCC GCCGCCTCTC CCGTACGCGC
CCGGGGACTC TGTTTCCACC CCTACGCCGA CTGGTTGGCC CACAACCAAT GCCATCCCGC
CAAGCAACAC GGTGTCGAGC TCTACGACGC CGACGAATAC CGAAAGGATG CGGCCCTATG
GGGAACCGGT GGTGTCCTCT TGCACGAATT CTGTCACGCC TACCACTGTC TGTGCGTCCC
ACAGGGCTAC GACAATGCCG AAATTCAGGA ATGCTACCGA CTGGCCCTGG AAGAGGGACT
GTACGAGAGC GTGCCCGTGC ACGGCCCGCA GGGACCGCAC GCACGGGCCT ACGCCTGTAC
CAATGCCATG GAATACTGGG CCGAATTGTC CACGGCCTTT TTGGGTGGCG TCGACACCGA
TACGGAATAC AACAAGTGGT TTCCGTTCCA CCGGAAGCAA CTCCGACAGC ACGATCCGAG
AGCCTACCAA TTGTTGCAAC GATTATGGAA GGTTCCATGC GATAACGATA ACGATAACGA
TACCGACGAC GACCGTATCG ACAACGAGTC AAAGACTGGA GATGTGCTGT CGAGCGGCAT
CCCCACGTAC TAG
 
Protein sequence
MGDLSLSSTF TTTTTTTWWH SIRTAFGHRP FPAQPRWVTE AAANATRAAA PWGQLWQPDT 
APRAVRAGRV RTVEDLDHSS RYDNKDDRTS DLDGSTNDDN NDDTNNDDNS LRLCVRNHLS
IPLVWCWMDA QGRPHHFRKL YPMAHADTDT DTVTRDDHVE QTYTGHAFVL ATEPSPAHSS
DTIPSLDPST ILGAYRPHRR RDHTVHILEV KGWNCCRPGA RSHVRHGSHR IHAFVPTDID
NDDDDNNQDD DNPLDVPRSV QLHLHIGRLD PTPLDTVRTN KYVQSTLAGW PVRFQSNWDG
GDPTLRARLE QDLTHAVHCL PPHAVRTLRR TTPLWINRDF RYGPAASPVR ARGLCFHPYA
DWLAHNQCHP AKQHGVELYD ADEYRKDAAL WGTGGVLLHE FCHAYHCLCV PQGYDNAEIQ
ECYRLALEEG LYESVPVHGP QGPHARAYAC TNAMEYWAEL STAFLGGVDT DTEYNKWFPF
HRKQLRQHDP RAYQLLQRLW KVPCDNDNDN DTDDDRIDNE SKTGDVLSSG IPTY