Gene PHATRDRAFT_47736 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47736 
Symbol 
ID7202726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp683768 
End bp686788 
Gene Length3021 bp 
Protein Length1006 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181955 
Protein GI219123279 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGCGA CAGAAAAAAG CTGCGCAGAC GAAATTGATC GAGAACTAAG TCTCATCTCC 
CTTTATGAGA GTGAACTAAA ACGTCTAAAG CGTAAGCTGA ATGCTCAAAA GGTAGCGCAG
CCCTTGCATT CGGCAGCAGC ACCGGAACAC ACCGGTACCG AGTACGGCAA CGATTCCGAG
CAGGATCCAC TCGACTGGAT TTTGCAGCAA CAGCAAACGT CGGCAGTTTC AAACTTGTTA
GCTTCCATTC AATCACACGA AAGCCTTTTA CGGGATGATA TTGAACCCCT GCAAACAGCA
ATTCGCCAGT TTGAGTTTAC TACCGTCGAT AAAGTACCGC CTCGCGATGG CAAGCCGTCA
TCTCCTCACT CAGCCGTAAG CTATTCTCTA AAAGGCCACT TCCTGGCAAA TGAGTCTATC
CATGCGGACT TTGTGATTGA CTTTCAACTA CAGAATTCCC GCGCCGAAGA CAAGATGACT
CAAACGACAA AAATTCTGGG CATTATCTGC AACTTATCCA GCATAAATGA ACCAGATCGA
GACTTGTCGT GGTTGGCTCG AGAAGCCAAG CCTACTGATT CCGGTAATTT TGCCAGTTTC
GTAACAAAAA TATGTTCCTA CTTGGAATTT GATGTCCGTC GTGAAGCTTG TTTAACGAAA
TGGGGCCATG TCACCGTAAC AAGGGATCAT TCGAAATACC TTATTGAGAT CCCTCTTGAT
CATGATAATA CCGGTACGAA CTTTCATTTG TCGACGCTTT CCATCGTCTG GGGCTGGAAG
TGGAAAGACG AGCATGATGT ATTGCGTCTA ACACAGACGG CTTTTGAGTT GGGGCTCAAA
CAACAAGATT TGGACTTCCT TGTACAAGCA TGCGGCACCT GTGAGAAGGC GATTGGAATC
GTTCTGGCAC AGACGAGCGG GGACACGATG CTTTTCCCCA ATGACGAGCA AGAGTCGAGG
ACGCCAACTT TGCCAGATGT CGAACAAGAC GCTGATCAAG AGGATAATGA CAGTTTGGAC
GGCGGCGTTC TGCGCGAAAA TGCTTCACTT GCGTCAAGCA CGCAATCCCT TACTAAATCC
CCTAGTGGGC GTCGACGCTC CGACTACGAA GTACAGCGGC TTCTTAAAAT CCAGCGCAAC
CAGCAGTTGC TGGAAAAATT GGGTCTCTCA CATTCCTCAA GGTCGAAACC AACCACGGCT
GAGAAGAGGC GCCCTGCGGA AGAGCAAGCA AATCAGGAGC TGGAAACGAA GCGCAGGAAA
CGAAAGAAAG AATTGGACGG AAAGCGACGG TTGTCGGGAC GAGTTCGCCT GAAGCCTGTC
ACCTTTGCGG AAGAACAAGT ATTCCATCGG AAAGGGTCTT GGAAAGATTC GTTCTCGAAA
AGGACAAATC AGATTCAAAA CGGTGAACAC AGGAGTTTAT CTGGACGCAT TGCTCAACCG
CGCGGACGAC CACCCACCGG GTGCGCATGG GATACCAGAA TTGGAATGTG GGTAAAATTC
AACGACACCA GCGATAGGAA GCAAGATGCA GCTGAGCTTA TGAGGCCGTC CTCAAGTGTG
GCAGCCCAGA GCTCGTCAAC GCTAGAAGCA AATTATTCCT TGGTTGATGG TGCTTTTTCT
TCCACTCGAG ATGGGACTGC TCCTCAGATG AACGAGTACA GCAGCCCGTC GAATGGTACA
TTTCCTCGTC CGCGTGGAAG GCCACTTAGT GGGCATTCCT GGGACGAAAG ACATGGTATT
TGGGTGCCGG GAACCAGCCC TGGAAAAGTC AGTCAAAACG ATTATGGTCG CATAGCACCA
TCGATACGCG GTCCGACAGA AAGATTGCCA GAGACACCCA ACCATCCAAG CTTTGAAAAG
TCGCCGTCCA ATCTTCGAAA TCTTGACCAT TCGAAAAGTG GCGTAAAATT CAGCACTCCG
TTCGCCAAGA CAATTCCTCG ACCCCGCGGA AGGCCACGTA GTGGGCATTC ATGGGACGAA
GTGCATGGTG TTTGGGTGCC CCTGGGAAAA CGGAGTCAAA GCAAGTTTGC GTATGTCATA
CAACCGTTAC ACGACAGAAC AGAAAAACTG CCCGAGACGC CCAATCATCC AGGCTTTAAA
AAGTCGGTGT CCAAGTCCAG CAATCGTGAT CTTCCGAAAA GTGACGAGGA GATAAAAAGT
CGTTCCGCCA CGACGTTTCC TCGGCCCCGT GGACGACCAC CAATAGGATG CTACTGGGAC
GAGACACGGG GTTGTTGGGC TACGCAACTG AGTACTCGAG AAGTCGACCA GCCGACAGGG
GCATTGGCAC CCAAACTTTC CCGTAATTCC ATATTCGACA CGGGCGCATT GCATGTTAGC
GCTCGATCAG CTCATATCAA TGTTACAGAT CATCCAAGAA AACCGACAGA TAATTTTTCC
GGTCGCTTTC CGCGACCGCG GGGAAGACCA CGGGCGGGAT GTATCTGGGA CGAGGTCCGT
GGGCTCTGGA TCCCTGAGAT GAAAGCCCCA GAGACAAAGC CAGTATTGAC GGAGCGTCCC
ACCATTCCAC TTCCGATCCC ACGAACGAGT TCCAAAAGTC CGTCAGTCAC GCGGCTGCCA
TTCTCGTTAC GTCCCGACAC TATCCAGCCA AAATCGTCTC CGGTAGCAAA CAGCGACGGT
ACGTACGGCC GACCCCGGGG AAAAGCTCCA CTGTACTATG GATGGGATAC CCACCGCGGA
GTGTGGGTAT CACAGTCAAA CCCGTCCGTC AGCTCGTCAC GGATAGCAGA CCGTTCTCCC
GTTTACGAAT CACCACCCGG CCAGAAATAT AGCAATGCGA TAGCGATGCC ATCTCCAGCA
GTCGTGAAGG TCGGTTCCGA GATCGATCGA ACGCGAGCAG CAGAAGGTTT CGTCATTCGG
GAGCAAGGGA ATCCTATGGG CACGGGAAAT AACGTGGCTG TTGATAGGAA GCAGTCGGAA
ATCGACAAAG AAACTGCACT TTACGAATTG TATCAAGAAA AGCTACGGCA TCTGGAAAAA
CCGCGGAAAT CAAATGGGTA G
 
Protein sequence
MAATEKSCAD EIDRELSLIS LYESELKRLK RKLNAQKVAQ PLHSAAAPEH TGTEYGNDSE 
QDPLDWILQQ QQTSAVSNLL ASIQSHESLL RDDIEPLQTA IRQFEFTTVD KVPPRDGKPS
SPHSAVSYSL KGHFLANESI HADFVIDFQL QNSRAEDKMT QTTKILGIIC NLSSINEPDR
DLSWLAREAK PTDSGNFASF VTKICSYLEF DVRREACLTK WGHVTVTRDH SKYLIEIPLD
HDNTGTNFHL STLSIVWGWK WKDEHDVLRL TQTAFELGLK QQDLDFLVQA CGTCEKAIGI
VLAQTSGDTM LFPNDEQESR TPTLPDVEQD ADQEDNDSLD GGVLRENASL ASSTQSLTKS
PSGRRRSDYE VQRLLKIQRN QQLLEKLGLS HSSRSKPTTA EKRRPAEEQA NQELETKRRK
RKKELDGKRR LSGRVRLKPV TFAEEQVFHR KGSWKDSFSK RTNQIQNGEH RSLSGRIAQP
RGRPPTGCAW DTRIGMWVKF NDTSDRKQDA AELMRPSSSV AAQSSSTLEA NYSLVDGAFS
STRDGTAPQM NEYSSPSNGT FPRPRGRPLS GHSWDERHGI WVPGTSPGKV SQNDYGRIAP
SIRGPTERLP ETPNHPSFEK SPSNLRNLDH SKSGVKFSTP FAKTIPRPRG RPRSGHSWDE
VHGVWVPLGK RSQSKFAYVI QPLHDRTEKL PETPNHPGFK KSVSKSSNRD LPKSDEEIKS
RSATTFPRPR GRPPIGCYWD ETRGCWATQL STREVDQPTG ALAPKLSRNS IFDTGALHVS
ARSAHINVTD HPRKPTDNFS GRFPRPRGRP RAGCIWDEVR GLWIPEMKAP ETKPVLTERP
TIPLPIPRTS SKSPSVTRLP FSLRPDTIQP KSSPVANSDG TYGRPRGKAP LYYGWDTHRG
VWVSQSNPSV SSSRIADRSP VYESPPGQKY SNAIAMPSPA VVKVGSEIDR TRAAEGFVIR
EQGNPMGTGN NVAVDRKQSE IDKETALYEL YQEKLRHLEK PRKSNG