Gene PHATRDRAFT_47891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47891 
Symbol 
ID7203103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp364354 
End bp367296 
Gene Length2943 bp 
Protein Length867 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182379 
Protein GI219124162 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACAAC ATCCACGTCT TCAACGTTCG GTGTTTGACC CCCGCTATAT CGGGGCAATG 
ATTCTAGTTA CGATCCACGG AGTCTCTTCC GACGCCCCCG TCTGTGAGGG CGACCCGGGG
CAGTGCGAAA GTGGCATTTT GGCACCGTGT GGTCTCTACT TGGCCCCGTC AACGAAGAAT
CCGGAAACTC TTACACTCCA TTCGGGTGTA GACCGCGACG CACACGAGCT TGTGGGTGAG
CCAGACATTG CTCTCCCCTT TTTCGATCCC AATAAGAACG AATGGAGCGC ATGGCACGAT
ATGGTATGGA ATATAGACGT GCTAGATGGC CTTATTCTAG AGAACTCTTT TCTTTCGGAG
CTGCTACTAC CCGGAGTAGG AACTTTACCA GCCTGCTCGG TCTTTCAGGG GGAAAACGTT
CGACTGAAAA GAAATCACGT TATAGACAGT TTAGATGTGC ATCGAGCGAA GGATGCGACA
GCCGGCTCGT TCTCGTATCA CCATGGCGTG ACGTACGAGA CTGTACGGTC CATGGCGGCA
GGAGAGGAGC TTTTCTTAGA CTGCTTAGGT CCACCTCCGC CCTTCCGAAG AGACAAAGAA
AAAGACGAGG ACGATGGTAA CGGTGACCTT GACAATGATG TTTACGAGGA TGAAAATGGC
GGCGGTGAAG ACGATGAAAT CCGGTCACTT GAATGGCTGC AAGAAAACGG CGTATGCGTC
GACAACATCT GGATTGGTCC TTCAACCAAA CTGGGAATCG GCAATGGCGC CTTCACCAAG
CGAGCAGTAG CGAAAGGGAC TGTGATCGCT CCTTCGCCTG TGCTTCACTT GGACCGTTCC
CAACTGCAAA TTGTCGAACA GCGTTTTCGG GAGGACCCTT TTCCTCCATT TTTTCGAGAA
CATGGTGTTG AATATTCAGA TTACGTTGTT GGTCAACAAT TGGCATTGAA CTATTGCTAT
GGTCATCCTG ATTCCAATGT TTTGTTGCTA CCTTTAGCGC CCGGGATCAA CTTCATCAAC
CACGATGCCA TAAGTCCCAA TGCATTCGTT CGGTGGTCAA CTTCATTGAC GGAACCATCT
GACTGGCTAG AGGAGACCGC GCACCAGCTG TTCGCAGAGT CTGTCGACGG GACACTGCTT
ATCGAATTTG TAGCATTGCG CGAAATTGCG GCTGGGGAAG AGATATTTAT TGACTATGGG
GAAACATGGA GCACAGCCTG GAATAGCCAT GTAAAGGAAT GGACATTCGA TGGTGCGAGC
TACATATCGG CTGCTAAGTT TGAAGATCTG TACGGAAACG ATGCTATTCG CACGCATATG
GAACAAAGCA AGAATCCCTA TCCAGACAAT CTGACAACTG CCTGCTACTT TGTCGCTATT
GAGGTGGACG ACGAAGAGGA GCTAGTCGAG TGGGAAAACG AAGCGCTTCA TTGCTTGCGA
CCATGCAGTA TCAAGACTCG ATATAAAGAA GATGGTATTA CCTTCTATAC CGCTATTGTT
TATCCTTTGA AGAGCCCTGC TGAGCCACAG TATTGTGGTG AAATCCCAGA TTCGGGATTG
TTCGTGACCG GGATACCTCT CCAGGCTGTA AAAGTTGTGG ACAAGGCGTA TTCATCTGAT
GTCAATCAAA GAAATACTTT CCGACATGAA ATTGGTATCC CCAAAAGATT CTACCCTTCG
AATTGGATGT CTGCCGACAG CCGACCTCTA GGCGATTTTT CTCCAGACCC GTTGAAGCCC
GGTGAAATGG CCGAAATTCG TTGGGCTAGT TCTGGAGACG TAGCTACAAA ATGGGCCTAT
CGCCTTGGTC TCCACGAAAG TATTCGCAAA ACACTGTTGG AATATTGCGA TAGAATGGGG
ATTACCGACA TATTTCGGCA TGTAACTACA AGAGACAACG CACTCCTCCC CGGTGCCGAC
AAAAATTTAG AGCTGAATGG ACATAATTGG TTCTTACAAC GGCCGGACAA GAAATGGCGG
TCAAATCTCC ATTGGCTGAG TCCCGGTGAC AACGCTGCAC ATGAAGACTA CTTGCAAGCC
TTGAGTGTCT CAGGGTTTGA CACAATTCTC AGGGGAATTG GAGAACAGAT GGGCATGGAC
GGCCTAGTGG CATTCCACGT TACTTTCATT GCTGTGTCAT ATTCGACTGA AGGATACATG
CACTACGATG TCACCGCGAC AGGGGGTAAA GCATACAACA TCATAATTCC TCTTATTTTG
GCCAACGAGA CGGGTCCCGA ACTAGACTTA CGGAGCTCAT CAATACTAGG AGAAGATGAG
ACCGAGTCTC TTGTCGGAAG ATATCGATAC GAGTACGAGG TGGCATCTAT GCTGGGTGAC
GACGCTTACC ACGCTACGTC GGCGGTGGAC TATCGGGCCA GTAAGGAAAT GCGAATGGCA
GCGACCATCT ATGTTGCGGA TGTCAACGAG GAGAATGCTG GTGCCATACT GAACGAGTAC
ACACAGGCTT ATCCGCCCGA TGACCGGGAT CTTCTGATGA GTTGGTCTGG ACGACACTGG
CGAAAAGACG ATACAACAGC AAAGCTACCT GCTCCTGTCA GCGGCCATAT TCTCCTTGAG
GCGAATACGG ATAACACTAG CTAACTGGCC AAATCGCTTC ACAGTCAAGC CGCGAGCAAG
CTGTTACAGA ATCCTTTGCA TCTTTACCCG AACTTAAGTA CGCACTGCAG CATTGTAAAT
ATCTATCCTT GAAAATATGC CGCTGTATAT AGCTTCCTAA CCCGTTTATG GTGCACTTGC
TTTTGCAAAG AACAAAGAAA CCTGCAGGCA TAGCTAGGAT ACATATGACA CAATCAAGAA
CGAAGTGAAA ACAACAGAAG GGAGCACGTA TGAGTACTCA ATATCTCACA GTAACCACTT
GTTCTCTTTG TGCTCTTGGA AGTCCTTACC TCACTGACCG ACGAAAAAAT TATTTATACT
TTT
 
Protein sequence
MQQHPRLQRS VFDPRYIGAM ILVTIHGVSS DAPVCEGDPG QCESGILAPC GLYLAPSTKN 
PETLTLHSGV DRDAHELVGE PDIALPFFDP NKNEWSAWHD MVWNIDVLDG LILENSFLSE
LLLPGVGTLP ACSVFQGENV RLKRNHVIDS LDVHRAKDAT AGSFSYHHGV TYETVRSMAA
GEELFLDCLG PPPPFRRDKE KDEDDGNGDL DNDVYEDENG GGEDDEIRSL EWLQENGVCV
DNIWIGPSTK LGIGNGAFTK RAVAKGTVIA PSPVLHLDRS QLQIVEQRFR EDPFPPFFRE
HGVEYSDYVV GQQLALNYCY GHPDSNVLLL PLAPGINFIN HDAISPNAFV RWSTSLTEPS
DWLEETAHQL FAESVDGTLL IEFVALREIA AGEEIFIDYG ETWSTAWNSH VKEWTFDGAS
YISAAKFEDL YGNDAIRTHM EQSKNPYPDN LTTACYFVAI EVDDEEELVE WENEALHCLR
PCSIKTRYKE DGITFYTAIV YPLKSPAEPQ YCGEIPDSGL FVTGIPLQAV KVVDKAYSSD
VNQRNTFRHE IGIPKRFYPS NWMSADSRPL GDFSPDPLKP GEMAEIRWAS SGDVATKWAY
RLGLHESIRK TLLEYCDRMG ITDIFRHVTT RDNALLPGAD KNLELNGHNW FLQRPDKKWR
SNLHWLSPGD NAAHEDYLQA LSVSGFDTIL RGIGEQMGMD GLVAFHVTFI AVSYSTEGYM
HYDVTATGGK AYNIIIPLIL ANETGPELDL RSSSILGEDE TESLVGRYRY EYEVASMLGD
DAYHATSAVD YRASKEMRMA ATIYVADVNE ENAGAILNEY TQAYPPDDRD LLMSWSGRHW
RKDDTTAKLP APVSGHILLE ANTDNTS