Gene PHATRDRAFT_34011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_34011 
Symbol 
ID7197796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp872477 
End bp873730 
Gene Length1254 bp 
Protein Length417 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178594 
Protein GI219115597 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGAAAC AAATGGTAGA CCAACAACAT CGGCGGTCGC ATATGTTTGT ATTACGGTCA 
TCGATGCACT TTGTACTGAG GGCCTTTCTG GTGATATGGG CGCTCCTCTC CGCAACTACG
ACCACTCTGC TGCACTCCCA AAATTCCTTC TCCATTCTTC CCGCGGACGT CGCTTACTAC
ATGGCTGTGT CGGGAAATAA TTCAAATTCA CAACAGGAGG AAAGGAAGGA TGAATACAAG
CTCCACGTGG ATAAGGCCTA TCAAAAATAC AATTTTGAAG TCGATACTCC GACAGCTCCG
GTTTGCTATC CGCTAAGGGC TAAAGATGTC GACTTTACCC TTGTGACGCA ATTATCTGAT
GACCGTCTCG CTATGATGCG ACCGCACTGC AAGCGCTGGG GAAAGCATAC TATTTCTCTG
GCAATTGGAA CCAACGAGAG CCGAGACACC GTCGAGCAGG CATTGTCAAA ATCGGGTTGC
GATACAGCTT TGATCACATT AAGTATTGTG CGCGACTTCG ATTCCGATCA AAAGTACCCT
GTAAATAAAT TGCGGAACGT TGCCATGTCC CAGGTCAGAA CAAGCCACGC AGTCATCATC
GACGCGGATT TCGTTCTCTC GCCGAATCTT TACGAGACCC TTCACTTACA CAATAAAACT
CTGGCCGCTG ACTCTACGAA TGCTTTGGTG ATTCCATCGT TTGAGCTGCG GAAAGCTTGC
CGACGACGAA ACAGGCGCTG TATCACCATG TATTCGGCCA TGGTTCCACG CAACAAGGAC
GGGCTTTTGG AGCTGTACGA CCCGATGACG GAAGACTCTG CTGGATACGG TATCGCCCAA
TTCGACATCA GGGGCAATTA CCACGGTCAC GCGAGTACGC GTTACGCCGA CTGGGCGAGC
CAGCCGGCCG AGCAACTGTT GCCCATTGAG TGTGTGACCT CCGACCGGTA CGAGCCTTAC
CTGGTCGTCC GTCATTGCCG AGACCTTCCG CCATTTCAAG AAGCCTTTGT TGGGTATGGC
CAGAATAAAT TGACTTGGAT GCAACAAGTC CGCCGGAGGG GCTACAAGCT GTTTCAAGTG
GGTGAAGTAT TTGCGGTTCA TCTGCCCCAC AGCAAGTCCC CGGCGTTTAA ACAGTGGCAT
ATGGTTGGCA AAGCAAACCG TAGCTTGCTG GCCGTGACGA CAATTGCGGA CGCATTCGGA
CTCTGGATGA ACGAAACCGT GCCAGATTTC TCACAAGTTC CGTATTGCTC ATAG
 
Protein sequence
MEKQMVDQQH RRSHMFVLRS SMHFVLRAFL VIWALLSATT TTLLHSQNSF SILPADVAYY 
MAVSGNNSNS QQEERKDEYK LHVDKAYQKY NFEVDTPTAP VCYPLRAKDV DFTLVTQLSD
DRLAMMRPHC KRWGKHTISL AIGTNESRDT VEQALSKSGC DTALITLSIV RDFDSDQKYP
VNKLRNVAMS QVRTSHAVII DADFVLSPNL YETLHLHNKT LAADSTNALV IPSFELRKAC
RRRNRRCITM YSAMVPRNKD GLLELYDPMT EDSAGYGIAQ FDIRGNYHGH ASTRYADWAS
QPAEQLLPIE CVTSDRYEPY LVVRHCRDLP PFQEAFVGYG QNKLTWMQQV RRRGYKLFQV
GEVFAVHLPH SKSPAFKQWH MVGKANRSLL AVTTIADAFG LWMNETVPDF SQVPYCS