Gene PHATRDRAFT_35627 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_35627 
Symbol 
ID7200940 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp391756 
End bp392906 
Gene Length1151 bp 
Protein Length345 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180034 
Protein GI219118527 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0500584 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGAAC GGGACAACGA AGACGAAGCC ACTCGTCTGC AACAACAAGC CGCCAAACTT 
CGTGAACAAA TACGTCAAAT GGAAGCTAAC TTGGGCGACC AGCGTCCGCG CAATTACGAA
AGGCCTCCGC CACCGCAGTC ACAGCCCGAC CCCACCGATA CCAATCCATC CCTCAAAGGC
AAACGGGTTT TGGTGACGGG CGCCAACGGA CGTCTCGGCA GTATGGTGTG CCGCTATTTG
TTACGGAACC ATCCACAAAC CGAAGTGGTT GCTGCCGTGC ATGTCGTGGG AGAAAACAGT
TCCACCAGTC GTGGTTATGG ACGATTGTCT TACGAAGTCG GAGCCGAAGA TGGGGTGGGG
CGGATTGGGC CAGCCTGGTC CTCCGAAGAC CGGACGGCAA CGTTTGAATG GGACATTTCC
ATGAAAGATT ACAATCTGCA AAATCTACGT CTCGTCGAAG TGGAATTACT AGATCCGGTA
CAGTGTCGGA CTGTGGCGGA AGGTTGTGAT GCCGTCATTT GGTGGTACGT CTGCAAGGAT
GTTTCTCTCG CGTTTGCTGC TTCCACAAAA CAAAGTACAG CCTCTCACCC ATAGTGCGTA
CCGGCTTTCT TCTAATCGAC TTTTTTGACT CCATACACAG CGCCACGGAT TTCAACGGCA
ATCGTCCGCG AGCAATTTCC GGATTGAACG TGGCTTTTCT TTTCCGTGCG GTGGCATCCC
CCACCAAAGG ACGGGTCGAA GTGGAAGGAT TGGAGAATAT GCTGGGGGCC CTCAAAACCG
CCAGACAAGA CAAGCAGCGA GCCACCGGAC GGGTACCGAC GAACGATCCC GTAAACGTTG
TGTTGGTATC CACGGCTCCG GACGCCTACG ACGATTTTGA AACGCCGTTC GGTTCTTTTC
GAGGTATAAA GCGCCAGGGG GAACAAATGC TGCAAAGTGA CTTTCCCAGT TTGAGCCACA
CCATATTACA ATTGAGCCGA TTCGAAGACA ATTTTGTAGA GGAAGGTTTG GATGTTTCCA
CGGAGCCGTC CCGGGCGAAC GATATGGAGG CTCCGGGCGA TGCGGACAAG GCCCGGCGGC
GCATTAACCG AAGAGATGCT GCCAAGGTAG CGGTAGATGC ACTTCTGGAC GAAGAGCTTA
AGGACAAGAC C
 
Protein sequence
MAERDNEDEA TRLQQQAAKL REQIRQMEAN LGDQRPRNYE RPPPPQSQPD PTDTNPSLKG 
KRVLVTGANG RLGSMVCRYL LRNHPQTEVV AAVHVVGENS STSRGYGRLS YEVGAEDGVG
RIGPAWSSED RTATFEWDIS MKDYNLQNLR LVEVELLDPV QCRTVAEGCD AVIWCATDFN
GNRPRAISGL NVAFLFRAVA SPTKGRVEVE GLENMLGALK TARQDKQRAT GRVPTNDPVN
VVLVSTAPDA YDDFETPFGS FRGIKRQGEQ MLQSDFPSLS HTILQLSRFE DNFVEEGLDV
STEPSRANDM EAPGDADKAR RRINRRDAAK VAVDALLDEE LKDKT