Gene PHATRDRAFT_35009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_35009 
Symbol 
ID7199970 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp862187 
End bp863446 
Gene Length1260 bp 
Protein Length385 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179308 
Protein GI219117027 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0547502 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATCGTC GCTTCCGTTT ACTGGGTGAA TTGGCCGTCA TGGCTATCTT GGTCAGTGTA 
TGGCGCGGTC ACAAACATTT GTTTCTTCAG TACGACGAAA AGTCTTTCGA GAGCGCCTGG
TTTCCCCCAC TGGTCCCAGC GGAGCCGATT CCCAACGCGA CCGAGTCTCC TTTGGAGGAA
TTCCAGACGC TTGCCACCAA TTGTCGGGGC AAGGAAAGAC TGCTTCGTAT CGCCGTGGCG
ACGGGTATGA ATCGATCCGC AGCGGAGGCG CGGTGCGACG ATTTGCCCTC GTGGAAGGAC
GTGGTGGACT TGTACGGATC GCAACCCGTC ATTCTTGGAC TCGAACACTG TGCCGCCTAC
CGAGCCGCTG TGCGTGACCA CGTCAATCGG ACCCGTGATG CGTATCCCAA ATTTGGAGGG
CTGCAGATTG ACGGAATGTT TAACGTGGGA ACCAACGCTT TGGCACAAAC CGTTCTACAC
AATCTGGATC GAGGACAGTG GGTACAACCG GATTTGTCTT TGGATGATCC GGAATATTGG
GAAAAGATTG ATACCTACAT GCACGGCGTA GGATGGGGGA AACACACGTT GGTCAAGTAC
CGACCACACA ACGCCATATT ATCCCTCCCC GTCGTGATAG TGCGAGATCC GTACCGGTGG
ATGAAGTCCA TGGTACGAGC CAACACACGC ACTTGCGGTG TTTCTTACTG GTGTGCGTGA
AACCCTCCTT CTCAACCTTA CCCGCGTCTT CTGTTGTGTA CCTGTGTCAT TCAGTGCAAA
GCAATGTATC GGGCACACTG GATTCGGCCA CCCAATCACT GTCCCAACCT CGTGTTGACG
CCACAGGAGA AAGTGGAGTA TCCCAATCAC ACCACCTTTG CCGTTACAGT GGAGCAGAAC
GCTCGCAATC CTCACGTGCG GGATAACTTT GACTCCCTGG CGGATTACTG GACGGCGTGG
TACCAAAGCT ACTGGGATGC CAATATCCCC CGACTCGTGA TACGTTTCGA AGACATGCTC
TTCCATGCCG ATGCGGTCGT CCAGGCACTC TCTGAATGCA CTGGTTCCGA GCGAGTGGAA
CCCTTTCAAT ACTACACCCA GCCGGCCAAG GTTCACGGTG AATCGTCCGA TTTTTTGACG
GCACTCGCCA AGACGGGAAC CGAAAAAGGA CGCTACAGCG GTATGACTGT CGATGATAGG
GCGTATGCCG CCAAAGCTCT CAATGCCGAG TTGATGCAGA AGTTTGGGTA CCGACACTAG
 
Protein sequence
MHRRFRLLGE LAVMAILVSV WRGHKHLFLQ YDEKSFESAW FPPLVPAEPI PNATESPLEE 
FQTLATNCRG KERLLRIAVA TGMNRSAAEA RCDDLPSWKD VVDLYGSQPV ILGLEHCAAY
RAAVRDHVNR TRDAYPKFGG LQIDGMFNVG TNALAQTVLH NLDRGQWVQP DLSLDDPEYW
EKIDTYMHGV GWGKHTLVKY RPHNAILSLP VVIVRDPYRW MKSMCKAMYR AHWIRPPNHC
PNLVLTPQEK VEYPNHTTFA VTVEQNARNP HVRDNFDSLA DYWTAWYQSY WDANIPRLVI
RFEDMLFHAD AVVQALSECT GSERVEPFQY YTQPAKVHGE SSDFLTALAK TGTEKGRYSG
MTVDDRAYAA KALNAELMQK FGYRH