Gene PHATRDRAFT_35958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_35958 
Symbol 
ID7201427 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp193868 
End bp195240 
Gene Length1373 bp 
Protein Length423 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180590 
Protein GI219119671 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGCAGA TCCTGTCCTT ACCATTCAAG CTGTGCAGGC ATACCGCGAC CTTCTATCGC 
GGCTTTGTGC ATTACTGGAT TGGACAGGGC CGAAATTCTC CTTACCAAAC GCCCGAGCAG
TGCACCTTTG CGCCCCTGCG CGAAACACCG ACGGACTCGC CGACGCAAAA GCTCTTTAAG
CAGCATGCTC GAGTACATCT TTACAGCCTC GCGTCCAACT TTTACCTCTA TCACAAACCG
CACTATCGAA AAGGCTCGTA TCGGGATGAT TTGATCGACA ATCTACGCAA CGTGGCCATT
CCAGGCACGG GCATCCCCTT GTCACTCATG GCCTCGACCC GTCTTACAGC CCTGGGTTTC
TTGTTCTCGG CTTATCCTAC GGTCAGTCTG GTTGCCGCTG TGCATCAGTG GATAAAAACT
CGTGGGAAGA CTAGCATTTC AGAGGAATAC GCCACCCGCT TGTTGGCCCC GAATGATTGG
TTCTCCTACT GGCGCTTGAA TTGCAACATT GTTGGTCTCC ATTCCGTTCT CAACGATATG
CCGGTCGATT ACGAAATGGA AAACAAATGG ACCTTTCTAG AAAATGGTAA AAAGCGGGGT
GTTCCTATTT CACCCTACCT AACCACACCC GGTATTGTCG TCAAGCATCG CAACGAAGAA
GGCGGGCTCG GCATTCATTT CTATAGAAAT GCCGTCGACG GTGGGGACTG GATTATTCAG
GAGCGCATCC AAAATTCCGA CTGGGTGCAG TCGATGCTTC CCGCCAAGGC GCCGTTGTCC
ACTTTTCGTG TCATCACTTG CAGTGCAGCG TATAATGTAT CCGAAGCACC TAACGTACGT
CGGGATTTTG GAATGAACAA ATTTCGATGG CGCTGAGCTC GCCCGATATT TTCACTTTCA
CGCGTTCTCA ATTTTGTTTA TTCTTGCGTT TGCAGCGCGC TGACGTGAAA GCTCTTTCCT
GCGTATTCCG TGCAGGCCGA GCTGGTGCCG CCACCGATCA CGATTCCATC TTGTTTGACG
TCGATGTCAA AACTGGAACC ATAAAGGGAG GGACGACCAA CGCGCACTGG TACCGACTCG
GTTTGCACGA AGCCCTACCG GGACGTTGTC CTTGGCGATC ACACCATGAT TACAGCCTTC
ACCCGGATGG TGACATTCCC GTGACGGGCA ACCAAGTTCC TGATATTGCC CAAATGCTCC
AGTTGGTTGA GCAATCCCAT TTCGACATGT GCCCACGGGT ACCCATGGCT GGCTGGGATG
TCGTCTTTTC GGCTGATCCC GAGGTACCAA TTTGCCTGCT CGAGGTTAAC TTGAGTTGTA
ATTTTTTTCG GGGCTCGTTC GATCAAAAGG TATTGTCTCG GTTTTATGTT TGA
 
Protein sequence
MGQILSLPFK LCRHTATFYR GFVHYWIGQG RNSPYQTPEQ CTFAPLRETP TDSPTQKLFK 
QHARVHLYSL ASNFYLYHKP HYRKGSYRDD LIDNLRNVAI PGTGIPLSLM ASTRLTALGF
LFSAYPTVSL VAAVHQWIKT RGKTSISEEY ATRLLAPNDW FSYWRLNCNI VGLHSVLNDM
PVDYEMENKW TFLENGKKRG VPISPYLTTP GIVVKHRNEE GGLGIHFYRN AVDGGDWIIQ
ERIQNSDWVQ SMLPAKAPLS TFRVITCSAA YNVSEAPNRA DVKALSCVFR AGRAGAATDH
DSILFDVDVK TGTIKGGTTN AHWYRLGLHE ALPGRCPWRS HHDYSLHPDG DIPVTGNQVP
DIAQMLQLVE QSHFDMCPRV PMAGWDVVFS ADPEVPICLL EVNLSCNFFR GSFDQKVLSR
FYV