Gene PHATRDRAFT_13606 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_13606 
Symbol 
ID7202097 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp133017 
End bp134171 
Gene Length1155 bp 
Protein Length338 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181142 
Protein GI219121581 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.336999 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAAC AGAACAATCA GCAAGTGAGT CGTTTGCAAC AACTGGTGCC GACAATCGGT 
ATGTTTCACA CCCACCTACC TCTCCGTCGG GCGTTTGAAG TATATAACGA AAAGCACAAA
TTGACGAAAA GACAGCATAT CCAGATTTCA TTCAATGAGA TTCGTCATAT TTTGAACCTA
GCTCAGATTA TGGCTCTCCG CAAAAATGTC TCTGGAGAAA GGCGGCATCC TTTGGATATC
TTTGCGCCGG AGCTAGAGCC AAATTCTTTT GTCAGCTGCA GAAAGTCTGA CGATGTCGAG
ATCGCAGACG GAGATTCGAC CGTGGACAAG GATATTCCTC CGGGTCAGCT CAATGGTCCT
CGTTTGATAA CTTTCGATGG AGATCAGACT CTTTATGCTG ACGGCGCAAA CTTTGACAGT
AATCCTCGAC TCGCCAATTA CCTGTATTTG CTACTTCGCC ATGGAGTATC TGTTGCTGTT
GTTACTGCCG CCGGATACGA GTACAATGTC GAAAAGTATG AATATCGTCT TTCGGGACTT
CTGCATTTTT TCAGACAACG AGGGCTTTCA AATGCCGAAT GTGCGCGATT CTACTTGTTT
GGAGGAGAGT GCAACTACTT ATTTCAGCTA GGACATGGGT ATAGACTGCA GCCTGTGAAG
GAATATGGGC CGGGTGGGTG GATTACGTCT ACTTCATTCA TCAAAGAAAG CCCCGGGAAC
TGGTCCGAAG CCCATATCAA CACGGTTTTG GACTTGGCTG AATCCAACGC CAACGAGACC
TTGAAGGAGC TGAACCTTCG AGGACGCATT GTCCGCAAAC GGCGATCAGT TGGGCTGTGT
CCGAATCATG GACAAGAAAT ACCTAGAGAG AGTCTAGACG AACTAGTTCT CCGCTCTCAC
GAGAAGCTCA ACCGTATGAA TGAAGGTACT GGCCCTGGAA TACCTTACTG CGCCTTCAAT
GGTGGAACTG ATGCTTGGGT CGATGTTGGC AATAAAAGGG TTGGAGTTCA GGTTCTGCAA
TCCTATCTTG GAATTCCAGT GCAAGAAACA CTGCACATTG GTGATCAATT TCTGAACACC
GGCAACGACT ACGCAGCGCG CGACGTCAGC TGCTGTGTAT GGATAATTAG TCCCCAGGAA
ACTACGTATA TTCTT
 
Protein sequence
MSEQNNQQVS RLQQLVPTIG MFHTHLPLRR AFEVYNEKHK LTKRQHIQIS FNEIRHILNL 
AQIMALRKNL NGPRLITFDG DQTLYADGAN FDSNPRLANY LYLLLRHGVS VAVVTAAGYE
YNVEKYEYRL SGLLHFFRQR GLSNAECARF YLFGGECNYL FQLGHGYRLQ PVKEYGPGGW
ITSTSFIKES PGNWSEAHIN TVLDLAESNA NETLKELNLR GRIVRKRRSV GLCPNHGQEI
PRESLDELVL RSHEKLNRMN EGTGPGIPYC AFNGGTDAWV DVGNKRVGVQ VLQSYLGIPV
QETLHIGDQF LNTGNDYAAR DVSCCVWIIS PQETTYIL