Gene PHATRDRAFT_40473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40473 
Symbol 
ID7198180 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011691 
Strand
Start bp510742 
End bp512557 
Gene Length1816 bp 
Protein Length565 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184472 
Protein GI219128546 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACATG GAGACATTCG AAAAGTTCTC GCTTCGGCTT CCTTCTGTAA GCAGAATCCC 
ACGAGCTCAC TACAGTCCAA CATGCTCGAG TACAGTATTT CCCGGCACGC CGTTACTGGG
ACAACATCCT CCCTCATTGA CAGAGGTGCA AACGGTGGAC TCGCTGGGAA TGATGTTAAA
ATCCTGAACA AGACAGGTCG TTTTGCTAGC ATCACTGGTA TCAATGACCA TACCCTGCCT
GATTTAGATA TCGTCACCGC TGCTGGACTT GTTGAATCCC AGAACGGACC TATCATTGTC
ATACTACACC AGTATGCACA CCATGGGAAA GGTAAAACGA TTCATTCTAG TGCGCAACTT
GGATACTACA AGAACGTTGT CGAAGACCGT TCTCGGGTCC TAGGGGGTAA ACAGCGTATC
GTAACTCTAG ACAACTACGT TATTCCTCTT CACATTCGCC AAGGACTGGC TTATATGGAC
ATGCGCCCAC CTTTGGATAC CGAATTTGAC ACACTTCCGC ATGTTGTTCT TACTTCCGAT
GTGGACTGGG ATCCATCTAT CATTGACAAT GAAATTGATC TTGTCACGGA CTGGCATGAT
GCCGTCCAGG ACCTTCCCGG CGATCTGTAC GTTGAACCTC GCTTCAATTC AACCGGGGAA
TACCGACATA GGCACGTTGC CAATTATGAC ACGAATTGGT CGATCCATCC ACGGCTATTG
GCAATATACT CTCGTCAAAC AAGCATGATA TGAGCCGCAA TGCCCACAAT TACGAAGCTT
TGCGCCCTTG TCTTGGTTGG ATCTCTTCCG ACACAGTTCG GAAGACCATC TTGGCCACCA
CACAGTTTGC TCGCGAAGTT TATCATGCAC CTATGCGTAA GCACTTCAAG TCTCATTTTC
CGGCACTTAA TGTTCATCGG CGCAATGAAG CTGTCGCTAT CGATACCATT TGGTCGGACA
CGCCTGCTGT TGACAATGGC GCTAAATTTG CACAACTATT TGTTGGTAGA CGGTCGCTTG
TCACCGACAT TTATCCTATG AAAACAGACA AAGAGTTTGT CAATGCTCTT GAAGACAATA
TTCGTCATTG TGGCGCCATG GATAAGCTCA TTAGTGATCG TGCCAAGGCC GAAGTCAGCA
AGAAGGTTTC TGATATTACC CGTGCTTACC ACATTGATCA ATGGCAAAGC GAGCCCAATC
ACCAGCACCA AAATTATGCT GAACGCCGCA TTGCAACTGT CGAAGCAAAT GCGAATAATA
TCCTAAACAA AACCGGTGCA CCCAATTCTA CATGGTTATT GTGTGTTTCC TACATTTGTT
ATTTGTTCAA TCATTTGGCA CATGAGTCTT TACACGATCG TACTCCCCTT GAAGTCCTCA
ACGGTAGTAC CCCTGATATT AGCGTACTCC TTCAATTCCA TTTCTGGGAA CCGATCTACT
ACCGACTTGA AGACCCTACT TTTCCTTCCG ACGGAACTGA AAAAAGGGGC CACTTTGTTG
GAATTGCTGA TTCCGTTGGT GATGCTCTTA CCTACAAGGT ACTCACCAAC GACTCCCACA
AGATCCTTCT CCGATCTAGT GTTCGCTCTG CGTTGAAACC TAGTGAAACC AATTTGCGTC
TTGAGCCACA TGAAGGGGAG AGTCCTCCTA AGCCCATCAA CTTCACTAAG TCGCGCAGAA
CTGAGGACGG AAATTCTTAT GCCATCCACA CGCTACCTGG TTTCACCCCG GACGATCTCA
TCGGACGCAC CTTTTTAACC GATACCCAGG ACAATGGGGA GCGTTTTCGT GCACGTATTG
CCAGGAAAAT TCTTGA
 
Protein sequence
MAHGDIRKVL ASASFCKQNP TSSLQSNMLE YSISRHAVTG TTSSLIDRGA NGGLAGNDVK 
ILNKTGRFAS ITGINDHTLP DLDIVTAAGL VESQNGPIIV ILHQYAHHGK GKTIHSSAQL
GYYKNVVEDR SRVLGGKQRI VTLDNYVIPL HIRQGLAYMD MRPPLDTEFD TLPHVVLTSD
VDWDPSIIDN EIDLVTDWHD AVQDLPGDLY VEPRFNSTGE YRHRHHDMSR NAHNYEALRP
CLGWISSDTV RKTILATTQF AREVYHAPMR KHFKSHFPAL NVHRRNEAVA IDTIWSDTPA
VDNGAKFAQL FVGRRSLVTD IYPMKTDKEF VNALEDNIRH CGAMDKLISD RAKAEVSKKV
SDITRAYHID QWQSEPNHQH QNYAERRIAT VEANANNILN KTGAPNSTWL LCVSYICYLF
NHLAHESLHD RTPLEVLNGS TPDISVLLQF HFWEPIYYRL EDPTFPSDGT EKRGHFVGIA
DSVGDALTYK VLTNDSHKIL LRSSVRSALK PSETNLRLEP HEGESPPKPI NFTKSRRTED
GNSYAIHTLP GQWGAFSCTY CQENS