Gene PHATRDRAFT_38097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_38097 
Symbol 
ID7202952 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp66200 
End bp67650 
Gene Length1451 bp 
Protein Length429 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182314 
Protein GI219124026 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACATCC CCGGAAGGGA CGAGCGCGGA GGCGACGCCG CGCCGGAGAG GGACTACGGC 
GAGTTGGGCT ACGCGTCCGC GGTTCGCTTA CCCGTCCGCC ATACGGATGC GTTTGGACGA
AATCCGTACA GTAATCCCCG GGAACATTCA GTCGTAGCCA CTCGTCCGGA TCCTTCCTCG
ACGACCAAAA AGAAAAAGGC ATGGAAGAAG CCGCCGGTGA GTATCAGGCA AAACTTTGTT
AGCTGGTAGT AGTGAGTCTG TTGATCCTCA GATGGTCCTC CGTTACTATT AATCTAGTGA
TCGCAGATTG GCCTTTGGTC GGTGCCAAAC CCCATAATCC CCTCTCACGC TGCCTTTGCT
CCTTTCTTGG GTGTCAGGGA ATGCCCAAAC GGCCGTTGTC GGCCTACAAT CTCTTTTTCC
GGCGCGAGCG ACAGGAAATA CTGGGGGAAG ACCTTTCCAA GGAGTTCGAG ATTACCGACC
AAAGTAAGCG AAAGCATCGT AAAACGCACG GAAAAATTGG CTTTACCGAT ATGGCGCGAC
AAATCAGTCA AAAGTGGAAA GATTTGGAGG AAGAATTGCG GAGACCCTTT ATCGAACAAG
CCAAGAAGGA GAAGGAAAAA TACATGGTGG CAAAGGATGC TTGGGTCCAG GAGCAGAAGG
TCGCGGTCAA GGCCCGTACC GAAGCTTTGG CAAAGGAAGA AGCGGCGGCG GCCGCCGCCT
CCCGTTGGAC TACCGTAAAT CCCCCGCCCA TGTTGGACAG GAACGCTTGG GATGTATCCT
CGCACACTAC GATCCCCCCG ATAGATCCCC ATCACGGTCG TTTTACAGAG ACCGGAGCGC
GCCAACTCCA TTCCATGAGA TCCGCGGCAA TTCCCATGAA TGCTTCCTTT GAGGCCATGG
GAGGGTCTCG TGGATTTCCG GAAGAAATGC AACGACGCCC CCCGCCTTCA ATGAATCCAG
CGGAAGATGC CTACGGCATG CTCAGCGATC AGGAACGAAT ACGTGAAATG AGGATGCATC
TGGAACGAGC CGCAGCTCTG CAGGAAGAAA TACGTAGAAA CACAGGGGCC AGCAATGCAA
TGATGGATGA ACTGCGGTTA CCAATCCCAC CTCCGCGAGA TGGGCCGGGT CCTCCTGGTG
GTTTGCCAAC GTATGCGCAG GAGTCGCGCC GTTTCAGTTC TCGGGGAGGA TCTTTTGAAG
GATTGGGAGG GAACGATTGG TACGAATACG AGCAGCAACA ACGACAGCAA CAGCAACGTC
GTGCGCAAGA GCGAGCGGAA ATGATGGCTC TACAGCGCCA ACAGCAGCAA CCACGACACC
GACCGGACTT CATGGCTCTG GAACAACGTC GGAGAGCCCT GGAACAATCG ATGGAAATTG
AGCGGAGATT CCAAATGGAA GAAGAAATGC GCCGTCGCGG ACAACGGAGG GGGCCGGGAG
GGAACATGTA A
 
Protein sequence
MNIPGRDERG GDAAPERDYG ELGYASAVRL PVRHTDAFGR NPYSNPREHS VVATRPDPSS 
TTKKKKAWKK PPGMPKRPLS AYNLFFRRER QEILGEDLSK EFEITDQSKR KHRKTHGKIG
FTDMARQISQ KWKDLEEELR RPFIEQAKKE KEKYMVAKDA WVQEQKVAVK ARTEALAKEE
AAAAAASRWT TVNPPPMLDR NAWDVSSHTT IPPIDPHHGR FTETGARQLH SMRSAAIPMN
ASFEAMGGSR GFPEEMQRRP PPSMNPAEDA YGMLSDQERI REMRMHLERA AALQEEIRRN
TGASNAMMDE LRLPIPPPRD GPGPPGGLPT YAQESRRFSS RGGSFEGLGG NDWYEYEQQQ
RQQQQRRAQE RAEMMALQRQ QQQPRHRPDF MALEQRRRAL EQSMEIERRF QMEEEMRRRG
QRRGPGGNM