Gene PHATRDRAFT_43579 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43579 
Symbol 
ID7197312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp866571 
End bp868676 
Gene Length2106 bp 
Protein Length701 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177704 
Protein GI219111905 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTCTA CATCACTTGA TCCAAAAAGC TCCAAAAAGC GCCCCCGTTC CATATCTCTG 
AGAAGCAATA ATGGCTGTAG CACCAAAGGA CGACTCGCAG ACAATCCAAC GCAGCAACGT
AATCGGCAAC GGAAGAATCA GACCAAGCCG ACGGCATGGG TCGACCACTG GGAACTGAAC
GCTGTGGGTA AATCACTGTG TGCAGCATCG GAGCTTTTCG TCGAGAATGC TGATAATTTG
CTCACTACGA AGCAAGGGCA CGAATTGGGT CAAGCCGAAT CAATGGCGTT GTCAGAAGCC
ATTGAGCGTG TCGCGGTATG GAGAACGCGA CAGACACAAC TGCCACACAT TGTCGAATCA
TCGGCGGCCC TGGCTGAAAT ACTGTTGCGC GAGGCCACGA CAGCAACTGG TTGTAGCGCC
ATGGAACTAC GACACGCCTA CGCATGTGCC GTGATCAGAT CGATCAACGG TCTGGCGGAC
CCACTGCAAC AACAACGATC GGTAGCCACC AGCATTGCTT TACTCTGTCG CCAATTGGGT
CTTCCATCCT GGCTAGTAGA TGTTCGTCAC GAGGCAACCC ACAATCAAAT GCCGACCTTG
GCTGTACTCC GCATGGCGGC CAAAACGTTG CTGCAGTACT TTCAAGCTGT CTACTGGAAA
CCAATCGCAA ATATTCCCGT GAACTACAAT GAGGAAGCAT ACACCTTGCT GGAAGAATAT
GCAAGAAGAA CGCTGCCGCG GCAAGCAGAA TTCATTTCCG AAGAAGACAA TGCTACTTCA
CAGCAAGACT CTGATAAGTC GGTAGACGAT AATGGCAGTC GCGAGGAAGA AGAAGCTATG
GTGGGCGGTC GAAGGCTCGG AACAACGACA AATAGCTTTG CCCTACTTAT GGAACCGAAA
AAGATCAAAC CCGTTAAGCT CGAAAAAAAG GAAAAAATAA AAAAGAAGAA AAGAAAGAAT
GTTTCTGTCG ACGCCCTGAC GAAGTCAGCT TTGGCATTTG TTCAAGCTAA AATCCCAGTG
TCGGAGGCTT TTCGGTCATT GCTTTCCTTT CTCGTATGGG GATTCGGCGG AACTTCAGGA
GTCCTAGTTC CGCAGGCTTC CGGCGTCTTT TCGGTGAATA GCTTTGGAGT ACAGCAAATA
CGAAAACATT ATATTCCCCT GATTTCGGCG GCTGTCAAGG AATGGCCTGG ATTTCATCGA
GCACTTTTTA TCCATTTGTT TGAGTTTCTC TTTGTATCTG AGCAACGGGT TGTTATTCGA
CCGGTAGAAC CATCATTGGC ACGCTCATTT TTCTTTGCTA AGTCGTGGAT TCAGCATTTG
TTGACTCGCC AATTCTTTGA CTTTGTCGAT CCCGAACTCG TTCGTACAGA CACCGCATTC
GAAGAGACTG ACTTGATGCC GCTTTCTATC CTTGAAGATG TTGGCTATCC GCTAAATTCT
TTGTGCGATC GGCTCGAAAT AGCTAAACTA AAAGGATTCG CTCATTTCGA TTGCATTGCC
TCGGTCGTTG AAACGTTCCG GCACATTCTC GGAGATACGC GAGTCGCTCG CCACGGCTAT
CCAGAGAGTC AGGCTTTTAC CCTACCTGAA AAAGAGTTTA CTGCAATTGA CCCCTTACAG
CTTCTAGAAA CTGAGACACT GGAAAACAAT GCGTCACTGG ATGAGATCGA AGCGATGTTG
GCTGGGCAAG TATCGACGAC ATGCGAAGAC GATGACATGA GAAACAAGGC AATCAAATGC
AACCCGCCGC TTACAAAGGA AGGAGGAGCT GATTGTAACA TTGCACACGG TGCTGTTGTG
AATGAAAAAC AAATGAACTG CGAATTGGTG GACAATAATT TTTTCTGCGG TCAATCCGAC
CCACAACCAC ACCGTAATCG TGAAGAAACA ACTGATTTGA GAGGGACGGC TGTTACATCG
GAATTAAGTC AGCTACCCGA GGAAGTTTCA TCTCTTGACA TCAAGTGTGC CAACGAAAGC
ACAACATTAC AGACAGGCTA TTCGTTCGGC AATCCGAATA GTGGGCGAAA AACGCGCCCG
GTATGGAAAT ATTGCAAGTC ATGGGATGAA TGTTCCGTCG GTGCCCTACC GGGACGTCCC
TCATAA
 
Protein sequence
MISTSLDPKS SKKRPRSISL RSNNGCSTKG RLADNPTQQR NRQRKNQTKP TAWVDHWELN 
AVGKSLCAAS ELFVENADNL LTTKQGHELG QAESMALSEA IERVAVWRTR QTQLPHIVES
SAALAEILLR EATTATGCSA MELRHAYACA VIRSINGLAD PLQQQRSVAT SIALLCRQLG
LPSWLVDVRH EATHNQMPTL AVLRMAAKTL LQYFQAVYWK PIANIPVNYN EEAYTLLEEY
ARRTLPRQAE FISEEDNATS QQDSDKSVDD NGSREEEEAM VGGRRLGTTT NSFALLMEPK
KIKPVKLEKK EKIKKKKRKN VSVDALTKSA LAFVQAKIPV SEAFRSLLSF LVWGFGGTSG
VLVPQASGVF SVNSFGVQQI RKHYIPLISA AVKEWPGFHR ALFIHLFEFL FVSEQRVVIR
PVEPSLARSF FFAKSWIQHL LTRQFFDFVD PELVRTDTAF EETDLMPLSI LEDVGYPLNS
LCDRLEIAKL KGFAHFDCIA SVVETFRHIL GDTRVARHGY PESQAFTLPE KEFTAIDPLQ
LLETETLENN ASLDEIEAML AGQVSTTCED DDMRNKAIKC NPPLTKEGGA DCNIAHGAVV
NEKQMNCELV DNNFFCGQSD PQPHRNREET TDLRGTAVTS ELSQLPEEVS SLDIKCANES
TTLQTGYSFG NPNSGRKTRP VWKYCKSWDE CSVGALPGRP S