Gene PHATRDRAFT_37272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37272 
Symbol 
ID7202043 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp583442 
End bp584723 
Gene Length1282 bp 
Protein Length351 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181403 
Protein GI219122125 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAATC GAGTCGATAA TGGCACTACC ACTAAACCAC CCCCAAAGGC ACCCCGCTCC 
GCCTTTATGT GTTTCACCGA CAACAAAAAA GAAGCCCTTA TGGAACAGCA TCAAGTGAAG
GAAAATGCAG ATGGTAGGTA ACAAACAGGA AATTTGCGTC TATACCAGAA TCAAGAAGCT
TCGTTGGCCG GAAAATCCCT TCTTTTCTAA CAAAGAACTC ATATGTAGTT TTGAAGCTCG
TGGCGACTGC TTGGAAGAAA CTCAGCGGCC GCGAACGAGC GTATTGGGAC GAAGAAGCTA
GAAGCGACAA GCTGAGGTAA GGAATTCGCG ACCTGACTGT AGCAGATGTC GAAATCGCTT
TCTGATTTTG TTGAGATTGT TCCTTGTATT TGCGTTTACA GGTTTGTCCG GGAAAAGGCG
GAATACAAGG GTGTTTGGAC TATTCCCAAA CGTCGGGCCA AAAAGCATCC CCTGGCGCCC
AAGAGGCCCA TGTCAGCCTT CCTCAAATAC TCGCAAACCC GTCGGGCTAA GGTCAAGGAA
GAGAATCCCG ATATGAGGCA AGTCACCGCC GGAGAGGGCA GGATCCGTCT ATCAAGATTG
CTGGAATTAA CCTCACAACC TTCCTCTTTA CAGCAACACG GACGTGTCCC GACTCCTGGG
AGAAATGTGG CGTAATGCAA GTAAAACAGA ACGAGCCCCG TACGTAGAAG TTGAAGAAGA
GGAACGGGCA CAGTACAAAG AAGAGGTAAA GCGGTGGCGC CAGAGTCAGG CACGGATGGA
TGCCGATACA AGAACCAGTC ACGATGCAGT CTTGACCTGC AGCAACATCG GTGACTTTCC
TGCTCCGATG ACTCCAGTGC CGTCCTATTT TGAAGATCCT CAAGCTTATC ATAATTTTGA
ACCGCTTCGA ATTCAATCGG TCGATGATGC TATAAACAAG GCGGATCAGC GGATGTCCAG
CAGCCGTCAT CATTCGCCTA CGTTAGCTGT CACTCAGTCT AGTTCTACGG GAGGAGACAG
ACCCTTGTCG AGGAATGAAA CGTGGCGAGA TTCTTCGGAG CAGTCTCCGA TCCACCGACA
AGACCAGCAC ATTTATGGGC AGTCGTTTCG TCCAGCTCTC CCTGTGCAGA AATCGGGGGC
ACGCACTCCG TTCCGCCCAA GCAATAGAGA AGAAACATTG ATGACGAAGC GCGACTTCAA
GATACCGAGT CAAGGGGGAT TTCGGGCGTT TGGGAACAAT TATCAACAAC CGTTCCGCCC
CTTGTATGAT CATGGTGAGT AG
 
Protein sequence
MENRVDNGTT TKPPPKAPRS AFMCFTDNKK EALMEQHQVK ENADVLKLVA TAWKKLSGRE 
RAYWDEEARS DKLRFVREKA EYKGVWTIPK RRAKKHPLAP KRPMSAFLKY SQTRRAKVKE
ENPDMRQVTA GEGRIRLSRL LELTSQPSSL QQHGQRAPYV EVEEEERAQY KEEVKRWRQS
QARMDADTRT SHDAVLTCSN IGDFPAPMTP VPSYFEDPQA YHNFEPLRIQ SVDDAINKAD
QRMSSSRHHS PTLAVTQSSS TGGDRPLSRN ETWRDSSEQS PIHRQDQHIY GQSFRPALPV
QKSGARTPFR PSNREETLMT KRDFKIPSQG GFRAFGNNYQ QPFRPLYDHG E