Gene PHATRDRAFT_44838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44838 
Symbol 
ID7199555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp392159 
End bp393547 
Gene Length1389 bp 
Protein Length406 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178764 
Protein GI219115938 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000100834 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAACAGATCG TCACACGTTG TTCACAGTCG CGCGAAACGG TTCTAAGATT GCCTTCATAC 
AGGGATTTCG AATATTCGCT CACACAAATG AAGACTTCGT CGCGTGTTCA CATCCGAAAT
CGTGTGGTCG TCGAACCCGA ACTTCATCTC CATACCGTCG ACATCACCGA AGCCATGAGA
TTACGTGGGG ATTCGGGGAC CTCACGTCGA GCAACAACGC CAATGACTTC GGACTTTGAT
CTTGCCGTAA TGCCGCGCGA GAAAATTTTT TCTCGATCTC TTCATTCATG GTCTGCACAT
CATTCTCTGA CGACGAACAA CTGGATTGCT ACCATAGCTC CCGCGTCGAG CCCTTCCGAA
CCGGGCAGTA GCAACAAGGC CCGCTACGTA CAGATTCCTT TTCCTTCGGA GCGGGAAGCT
CGTAGGTTTT GCAAGGCGTA CTCTCCCCCC AGACTCAGTA CCGCTGTTCT CTGTCAGCTT
TGTCAGCTTA CGCCGCAAGC CGCTCGTCAC TGCCGAAATT GCGGGGTCAC CGTCTGTGAC
AGCTGTTCCA CGCGTTGGGG CATTCGGATG GTTCCCAAGA CCTACAACCC ACAGTTGCTA
ACTACTACTG TCCGGGTGTG CAAGTCCTGC GATTGGCTGT CCAACGCGTT CTGCATGGCG
TTACTTCAAG GTCGTTACGA AGATGTCTTA ACAATCGTCG AAACGGGCAA CGTTAATCTC
CGTACCTGCT TTGCGGACAT TCATCAGGAA GCCATGTTTC CTGTCCACTG TGCCATTCTT
GGGGGGTCGC TCGCCACGCT GCAGTGGTTG GTGGAGATGC AAGGATGTCC CTTGTCAGTC
AAGAAGGATC CCAAAACGAA CCGGGCCTTG TCGTTGCAAA CATCCGCTTC CCGCACCTTG
CTCGATCTTG CGATGAAAGG ACGTCCCAAA ATTGACATCC TAGTGTACCT CATACAAAAC
GGATTGAGCA TTAGTGATGT ACATGATCCA TCGCTGGCCC CCAAAACCCT CGAAGTCGTT
CTAAAAGCGG GCTTTCCCAT TCACTCTATC GATACGCTCA TGCCCGATAT GCCTGTCATA
ACCGACGAAT CGGACAAAAA ATTCGATGTC AGCCGCAACA AATCTTTGGT GTACGAGGAA
TCCGTTGCGA CACTTGAAGA CGCTTGCGTC TTGTGCTGCG AACGCTCTAC GGATTGTGTG
CTTATCCCTT GTGGACATCA GATTTGTTGT ACCGATTGCG GTCACCAACT CACTTCTTGT
CCGGTGTGCA AGATAAATTG CAGCGTACTG CGAGTCTATC GTCAGTAGTG GACGTAGGAG
CCATCAGCAA CGCAGCGCAC TGGAAACCGT TTCTGAGTCT AGATTTGCAG TCCGTACTCA
TTTTTAAAT
 
Protein sequence
MKTSSRVHIR NRVVVEPELH LHTVDITEAM RLRGDSGTSR RATTPMTSDF DLAVMPREKI 
FSRSLHSWSA HHSLTTNNWI ATIAPASSPS EPGSSNKARY VQIPFPSERE ARRFCKAYSP
PRLSTAVLCQ LCQLTPQAAR HCRNCGVTVC DSCSTRWGIR MVPKTYNPQL LTTTVRVCKS
CDWLSNAFCM ALLQGRYEDV LTIVETGNVN LRTCFADIHQ EAMFPVHCAI LGGSLATLQW
LVEMQGCPLS VKKDPKTNRA LSLQTSASRT LLDLAMKGRP KIDILVYLIQ NGLSISDVHD
PSLAPKTLEV VLKAGFPIHS IDTLMPDMPV ITDESDKKFD VSRNKSLVYE ESVATLEDAC
VLCCERSTDC VLIPCGHQIC CTDCGHQLTS CPVCKINCSV LRVYRQ