Gene PHATRDRAFT_43674 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43674 
Symbol 
ID7197519 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp1154388 
End bp1157426 
Gene Length3039 bp 
Protein Length854 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178088 
Protein GI219112673 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.300412 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATTCATCTTC CCACGCTTTT CATCTCTGTT AACCGTAAAG AGAGTTTGGG AATAAAGGTA 
AGAGAATGTT TCGATCACTT TTGAGCACAA ATGTCTTCCA ATACGCACGA CCATGTTTAT
GGACGCGTAA TTGTAGGTTT TGAAGAGCGA TAGTACTGGT CTATCACAGT TTGCTTCGTG
CCAAGACTTA ACTTCTCTCA GACAGGCGAC AATAGATCTC TCTGTCGACC GTCATATACG
CTGACAATTT TTGCTTTTTC CTTATCCACA GTCGAAAATA TCTGATTTTA TTGACCATGC
CTGATTTGAG TGACCAAGCC GAAAAGGGCG AGGTTCAGGA AGGCCCGAGC AACAGCTCTG
TGTCTTCCCC GGCCTCAAAC GCGCGCAAAC GATGGTTTAT CGCAGTCGCC ACGCTTTTGG
CCCTGACAGG TGTTATTCTC GCCATTGCCA TCCCAGTCTC GCGCAAAAAC GACCACGAAA
AGGTCTCCAT TGCATCAGAC GAAGACTCGA ACATGGATGG AACCGTACAG CAAACTGCGA
AGGCAACTGC CGACATAAGT GGATACTTCA ACAGCTCATT TTCTCTTTTT GGTGAAGACA
TAACGAACGG ATACATTTCC CCCGATGAAT TTAAATCGGA CCTGCGTAAC GTCGCACGAT
TTTTACTAGA TAGTGTGGTG AAGCGCAATC TCGGCTCTGA GGTTAACAAT GCTGCAGGTG
GGGACCTTGA AGTTGAGCCC GGCGTAGGTG TTGCCGCTGA AGGAAGCAAC TCAGATATGC
GCGCTCCCGA TGTCGGAGAT AACGTGAACG ACTTTGGAAC AAACAACCAG GAAGATAGCG
TCGAAGAAGG CGACATCATC GTATCCAACG GTAGACACGG TATGTGGAAA AGCATGTGCA
GGTTTTGCTA AAGAGAAACA AAGAAAGACA CACTCGCCAC TTTTGCTGAC AAGCCCTTCG
CGTTCCTGGA TTATAGTGTT TGCAGTATAC GGTGATCGCG TCGTGATTTG GGATGCAACC
ACTGGCGACA TGTTGTCTGA TATCAAGATG CCCACCTTTG ATGAATCTCT AAATAGCACG
AAAGGAAGCG CGGCTGCTAC ATCTCGTTCG GATATCGATT TTTTCTACAG CGGGCCGTTC
ATCAATGATC TTCTATTAGA TGGAGACAGA CTCGTGGTAG TAGTCGGTGG ATATGGTAAT
GCTATGCGAG CAGCTCCTGG TGCTGAACAG CCCATTTTAT ACGACTACAA CGGAGCCCGC
ATAGTGATTT ATGACATTTC CGCGCTCGAC AGTACTGGAA CAATTACTCA GCTCTTCTCA
GAAGACATAA ACGGAAGTTA TAACTCAATG CGAGCCATCG GAAGCAATCT TCACATTGTT
ACTATGTCGG GATTAGACAC ATACACTCAT TTGGTCGCGC CTTTTGAACG CTGGAACTAC
CCAAATGTAA CTGACGAAGA ATACATCGCG CAAGTCCAGG AAGCTGCAGA AGGCAAGGTC
ATTACAAAGT TCGTCGAGCA ACTGGCCAGT GAACTAACCT TTCACGGAAA GCTTCCTGAT
TTTGCTCGGA TTAGTCTCAT GCAAGAAGAA TTTTCGGGTG GTGCGCATGA GAGGGTAACG
TATTCTGACG GTGTAGCGAA CTCTGTCGTT CAAGTCTTCT CTCTTGACTT GGCTCAGGAT
TTTTCAATAC TTGGAATTGG GGAGACGCCC TTTAGCGTGT CGGGTGCTTT CCTTGCCCCT
TATTATGGCG AAGTTTACGC GGCAAATGGC ATGTTGATCA TTGCGAGCAA TGGATGGGGA
TACAACAGCG AAAATGGAAT TTCTGAGGAC TACACGTACA TTTTAGCAAT GGCTCTCAGC
GGCCCTTCTT CGACTCCCCA CTCTGTCGGT ACCGTGAAAG GATACTTTCT CAATAAGAAT
TCAATTGATG TTGTCGGTAA CGTGCTCCGA ATCGCAACAA CAATTCAAAA CAGGTGGCGT
TGGCTGATGC CTGAGCCTCT GATTCCTATC GACGGCGATG GAACGGACGG AAACGGAACC
TTGTCTCGGC CGGCCGTCAT GCCTGAGCCA GTCCAAGATG AGCCTTCCAC TGAAAACTAC
ATTATTATGT TGCAAATGCC GGGTGTAGAT GGCACAGACC CAGGTACGAT GCAGGAGCTT
TCTCGGCTTC AGCTTGGAAA AATTAACGAG GTCTTTACAG CCGTCCGCTT CTTTGACAAT
ATCGCCTATG CCGTGACATT TGAAAGAACG GATCCGTTTT ATGTCCTCGA CCTAAATGAC
CCATCCAATC CCGAAATTCT CGCCGAGTAC AATATCACCG GCTTCTCTAG TTACTTGCAC
TCCATGAACA CCGATAACAG TCTTATCTTG GCTATTGGAG AGGAGGCCGA CGGGGATGGA
ATGCCCATTG GTCTTCAGAT CACAGTCTTT GACGTTCTGG ATCCTCGCAA TCCAGTTGCT
GTCCAACGCC ACCTTATTGA GAACGATCCA GATACTTACT CGAGCACTGA TGGTGCATGG
CAATTCAAAG CCGTTCGATA TGAAAAGACA TCTCAACGTC TCATTATTCC TGTGAACATC
AACAACTGGA ATGATCCGAC CTCGAACTAT AATGGGTTCA TTGCATACTA TGTCAGTGCT
ACTTTGATCG AAGAAAGTTG CCGCATTGAG CACGATGCAG GCTACGATGT CTTTATCGAT
CCTATCTTCG TCGATCCGGA TTCCAATGAG ACCGCTGTCG AAAACGAAAC GCTTGTTGGT
CCAGCTGATA CTATCGACGT TGCCCCTTCG GATTGTGTTT ACTGTGCCTC ACTCCAGCCT
CGATCAATGA TCTTCAACGG AAACGTTATG ACAAGCAGTG GCCACTTTAT CCGTAGTACA
GACTTGAACA CATGCGAGCA AGCTTGGAAA TTGGATATCG CCGAGGGCGA GTCAAACTGC
TGCGGTGCCT GGTTCTAGAG AAATATTCCT CACGCAAAAG TTACGGCATA AAATGCCTTC
ATAAAACGTA TGCATAATTA GCAAAGATCG ATGTCATCT
 
Protein sequence
MPDLSDQAEK GEVQEGPSNS SVSSPASNAR KRWFIAVATL LALTGVILAI AIPVSRKNDH 
EKVSIASDED SNMDGTVQQT AKATADISGY FNSSFSLFGE DITNGYISPD EFKSDLRNVA
RFLLDSVVKR NLGSEVNNAA GGDLEVEPGV GVAAEGSNSD MRAPDVGDNV NDFGTNNQED
SVEEGDIIVS NGRHVFAVYG DRVVIWDATT GDMLSDIKMP TFDESLNSTK GSAAATSRSD
IDFFYSGPFI NDLLLDGDRL VVVVGGYGNA MRAAPGAEQP ILYDYNGARI VIYDISALDS
TGTITQLFSE DINGSYNSMR AIGSNLHIVT MSGLDTYTHL VAPFERWNYP NVTDEEYIAQ
VQEAAEGKVI TKFVEQLASE LTFHGKLPDF ARISLMQEEF SGGAHERVTY SDGVANSVVQ
VFSLDLAQDF SILGIGETPF SVSGAFLAPY YGEVYAANGM LIIASNGWGY NSENGISEDY
TYILAMALSG PSSTPHSVGT VKGYFLNKNS IDVVGNVLRI ATTIQNRWRW LMPEPLIPID
GDGTDGNGTL SRPAVMPEPV QDEPSTENYI IMLQMPGVDG TDPGTMQELS RLQLGKINEV
FTAVRFFDNI AYAVTFERTD PFYVLDLNDP SNPEILAEYN ITGFSSYLHS MNTDNSLILA
IGEEADGDGM PIGLQITVFD VLDPRNPVAV QRHLIENDPD TYSSTDGAWQ FKAVRYEKTS
QRLIIPVNIN NWNDPTSNYN GFIAYYVSAT LIEESCRIEH DAGYDVFIDP IFVDPDSNET
AVENETLVGP ADTIDVAPSD CVYCASLQPR SMIFNGNVMT SSGHFIRSTD LNTCEQAWKL
DIAEGESNCC GAWF