Gene PHATRDRAFT_47664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47664 
Symbol 
ID7202854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp455339 
End bp459241 
Gene Length3903 bp 
Protein Length1300 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181911 
Protein GI219123187 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00840283 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCTAG AAGAATGCTC GACCAACAAC GAAAAACCAC ATATTCAACT ACTCCATGAT 
GAAAAGAACT GTGTCGGGCT TTCTCGTCTG CCTTTCCGAT CGTCGCTCAT TTCTTCGGCT
GCGATGCCTC TTTTAGTCAT GATCAGCTTG CTTCTTTTGC AGCCAGTAGC CTCGCAGACA
CCCTATCGGA CGTGTTATGA GGCCCTAAGT AGTGCCGATG GTGATCGAAA TAGTGTTCTG
ACGCAGGCAG AATACGTCGA GTCCTTGAGG ATTTTGACCT TTGGTGCAGT CAGCGTCAAC
TCCTACGAAA ATCTTTCCGA AGAGCTAAAA TCGGGTTTCA CCTTTCGTGC TCCAGACGGT
TCACCGGGAG TCGACATAAG TGAAGTAGCA TCCAGTAGTA GCTCCAGTAT GTCCCTATTT
TGCACCAGTA TATACGCTGG TCTCGTGGAA AGCCTAGGTA TTGCAACTTC TCAACAAGCT
TGCTTCATTG CCATGTCCGT TGGTGATATT GGACGAGACG ATGCTCTGGC AGCCGAGCCG
GATTTTGTCC GATATGCTAA CCAGATGGCG GGAGGGTCCT ACGGAATTTC CATTTCCTTT
GCATCTTTAC CGCCACCCCT GCAAGCGGTC TACAACGACT TTTCAGACGG TGATAATGGG
ATTCCGGTCA TCGGTTCCAA ACCGGGCACG ACGCCAACAA TCGAAGACCA GAGCTTTTTA
ACCAATCTTT GTCGACAGAC CGCTGTGGCT GTTGTTGCGG GGGAACAGCC AATGGCTACA
CCAGCAACCA TGGCTGGAAC GACTCCGGCC ACGTCAGCAC CTGTTTCTTC TCCAACAGAC
GGCGCTGGAT CTATCACTCC CCTTTTTACG TTTTCCGATT GCACAACGGC GATGCTGGTG
TCCGATTTGA ACCGTGATGA TTTCTTCGCC CAAGCGGAGT ATTTAAGATT CTTAAATCGA
CTAACTACCA ACGCATTTTC TGATCAGACG TTCGGTACGC TTCCCAGTCG ACTCCAGAGT
AACTACAACG TCTTGGCAAC TAAGGATGGT CAGATTCCAG TCAACGGGTC TAAGCCAGGT
TCTGCGCCGA GTGCTGAGGA ATTAGCAAGC CTGGAAAATG TCTGCGATTC CACGGAAACA
GCCTTACGTG AGGAAAACAG CGGCGGAGGC GTAGTTCCAA CGTTTGCTCC TACCATGACA
ACGTCCGGGA TAACTATGGC TCCAGTGCCA ACTGAGCAGA CGACCGGTAG TCCGGATCCC
GTTGCCATCC CAACACTTCG TCCAGTCGCA GCTCCATCAG TACCGACTAT TCCTTTTAAT
ATCTGCACAC TCAATATGGC CACGTCGGAT TTGGATCGGA GTGGTAGTCT TAGCTCGAAC
GAGTATTTTA ATTTTGTCAA CAAAATTGCT GGAAATACTT ATGATGGCTT AACATTTGAC
ACCTTACCCG ATGCCGTGCA AGAGGCTTTT GATGGTCTAT CTGATGGCAA ATTGATTGAC
GTTTCGGGCT CGTTTCCAGG TCAACGTCCC AATGAGGCAC GTGAAGCCGA ACTGGAGGCC
ATTTGTGCTA CCTCCTTGGA GGCGATATAC GGTCCTCCGC TTGTGACCGC CACGACGACA
CCATCAGTAG ACCCCGCTCC AACAACGTCA CCAAGCGCTG CGCAAGTCCA GAACATAACA
GTGTTCAACG GCTTTTATAT CTTCAATGTC AAGGGTGTTC GAGCAGCTAC TTTGATTTCA
GGTCAAAATC GAGATGGGCT GAACCGTGGG TACGAAGCTT TTGTCAGGAA CATCACGGCC
GAGTTTATAG CAACTACACA GGTGCCCGGC GAGCCTCAGC GGCATCTTCG ATCGCGCCAT
TTAGAAGAAC CAGGCCTGGC GCCCGAGGCG GCACGAATTT ACGAAATAGA TGATGTTGAC
TGTCCTCCTG TAATCAACGT CGAGAGTACT TTTTGTCAAA TAGTGTACGC AGAGTTCGAG
GTACACCACA TTCGAGAGCC GGCCGCTGAT GCATTTTTTG AATCTTTGAC GAGGATAACT
CAAGACGCAA TTGTCGGCGA CAAGACTGGT CTTAATGCTT TTGTGCTCGC GGCAAATCCC
TATTCGGATG TGGAGGTTTT GGGACCGGCC GAACAGCTTC GTCCGATCAA CCTACCCGGG
GTTCAAGAAC CCGTTCCAGG CAACCCTGTC GGAGCAGAAA GCGGAGGCAA CCAAGCAGGC
CTTATTGCGG GTATTTTCGT CGCGATTGCA GTCATCGTAA CCGTAGCAGG AATTGTACAC
TACCGTAAAC GGAAAGGTGA CAGTTATGGT TTGAACTCAG GTTTCAGTAC GCCATCCTTT
TTCAAGCGAA AACCAAAGTC AAATCCAAAT GTACCAGAAG TCAATTCATT GGGGAGCGCG
CAAAATCATG GCGAGCATCA CGATTTAGAA GACGGATTCG GGCGCTTTGA AACCATTGAA
ACCAAATCTC CTATGAAACC TGGCGGAATG GCAGGCTTCT TCGGGGGTTC GCATCATAGT
AAATCTGGAT CGGACGACGA GGAAAGCGGA GGATTTAGTG TACAGGAACA GTCACCAATC
AAACGAGATG CACGAAATGC TCTAGGAGAC ATTGAAGTAT CCGACAATGG CCATTTGAAA
GGAAACGCTT TCGGTGGGTT CGGTTTTGGA AAGAAAAAAT TGAACAAACT TCAACACTCT
ACCAACGAGC TCGATAGTAG TGAATACGAC GACAATGAAG ACGATTTCGC GAATTATGGA
TTTGAAGAGC CGGAAGAGCA GCGCCATGAT TCTGTGGGGG ACATGTTTGA TATTCTCAAT
CAAAATGAAG GAGAAACAGA CTGGGATCCC CAACACGTTT CTAGCGCAGC ATGGCATGGA
GACGTCAAAG GTACCTGGGG GGCCCAAGGC CACAACCATG ACTTCGGTGA CGACCATACC
GTTTCACAAA GCGGTTCGGA GAGCCAGAGC CACGATAACG AAGAGGAATC CGGAAGCTAC
TCAGAGGATG ATGATTCCGG CTCGAGTGAT GACAACTCGT ACAACGGAGA CGATGGTGAT
GACACGACTC GACGAACTCG AGAACCTTTA AATTCTCGTG CCATTGGCGC GAGCGTCACG
GAGAACATGC GGCACCTCGA CGCAATGGTG CATCATGGGC ACTGGAATGG TTTTGTTCAG
AAAACTGCGG AGCTTGCGGA AGATCGAAGC GAAGGCTCTG AAGACAAAAG CGAAGAAGAG
TCGTTTTCCG GCTCAGGGAC AAGTAGGACA GACTCTTTTG TGGATGATGG ACTAGATGGG
GGTGAAGATG GTCTGAGCTT GAACTCTACT GAGAAGTCTA CACGAGAAAA GTATAGACTT
CAAGTGGAAC AATTGGTTCA AAAAGTTGCA CCAGAGGAGT CAGACAACAT AAATGCAATG
TTCGACAAAT TTCTTGGCCG AGAGGCTGAG CTCTTACAGA CGTTAGAATC CATGAATGAT
CGTTCCGCTT CGCAAAGAGC CCGAAAAGCA GTACACAGGT CGAAAGCTTT TCCTCAACAA
TCTGGGCGGC TTTCTGCGGG AGGGTTGGAT GGTTCGGCCG CCATTGCAGC AGCTAGTACA
CTTGGTGGTG GATTCTACGA TAAAGGTGAT GATGAGCACG ATGAAAATCG TAGCAGTGAC
GAGAACAGTC ACTCATATGA AAGCAGTGAT GAATTTAGCG GCAATGCTAG TGGCAGCGGC
AGCTACGAAG ATGGTCAAGG AAGCCACCGT TCAGACATCT ACAACAAGCC GGAAGGGAGC
TCCCGCTCTG GTTCCGCAAG CTTTGACGTT GTTGATGGGA GCTACCGTTC CGAGTCTGGG
AGTTATGATG ATGCAGGTAG CGATAGCTAT GCCTCTGATA GCGGCAGCGA TGAAAACTCA
TAA
 
Protein sequence
MDLEECSTNN EKPHIQLLHD EKNCVGLSRL PFRSSLISSA AMPLLVMISL LLLQPVASQT 
PYRTCYEALS SADGDRNSVL TQAEYVESLR ILTFGAVSVN SYENLSEELK SGFTFRAPDG
SPGVDISEVA SSSSSSMSLF CTSIYAGLVE SLGIATSQQA CFIAMSVGDI GRDDALAAEP
DFVRYANQMA GGSYGISISF ASLPPPLQAV YNDFSDGDNG IPVIGSKPGT TPTIEDQSFL
TNLCRQTAVA VVAGEQPMAT PATMAGTTPA TSAPVSSPTD GAGSITPLFT FSDCTTAMLV
SDLNRDDFFA QAEYLRFLNR LTTNAFSDQT FGTLPSRLQS NYNVLATKDG QIPVNGSKPG
SAPSAEELAS LENVCDSTET ALREENSGGG VVPTFAPTMT TSGITMAPVP TEQTTGSPDP
VAIPTLRPVA APSVPTIPFN ICTLNMATSD LDRSGSLSSN EYFNFVNKIA GNTYDGLTFD
TLPDAVQEAF DGLSDGKLID VSGSFPGQRP NEAREAELEA ICATSLEAIY GPPLVTATTT
PSVDPAPTTS PSAAQVQNIT VFNGFYIFNV KGVRAATLIS GQNRDGLNRG YEAFVRNITA
EFIATTQVPG EPQRHLRSRH LEEPGLAPEA ARIYEIDDVD CPPVINVEST FCQIVYAEFE
VHHIREPAAD AFFESLTRIT QDAIVGDKTG LNAFVLAANP YSDVEVLGPA EQLRPINLPG
VQEPVPGNPV GAESGGNQAG LIAGIFVAIA VIVTVAGIVH YRKRKGDSYG LNSGFSTPSF
FKRKPKSNPN VPEVNSLGSA QNHGEHHDLE DGFGRFETIE TKSPMKPGGM AGFFGGSHHS
KSGSDDEESG GFSVQEQSPI KRDARNALGD IEVSDNGHLK GNAFGGFGFG KKKLNKLQHS
TNELDSSEYD DNEDDFANYG FEEPEEQRHD SVGDMFDILN QNEGETDWDP QHVSSAAWHG
DVKGTWGAQG HNHDFGDDHT VSQSGSESQS HDNEEESGSY SEDDDSGSSD DNSYNGDDGD
DTTRRTREPL NSRAIGASVT ENMRHLDAMV HHGHWNGFVQ KTAELAEDRS EGSEDKSEEE
SFSGSGTSRT DSFVDDGLDG GEDGLSLNST EKSTREKYRL QVEQLVQKVA PEESDNINAM
FDKFLGREAE LLQTLESMND RSASQRARKA VHRSKAFPQQ SGRLSAGGLD GSAAIAAAST
LGGGFYDKGD DEHDENRSSD ENSHSYESSD EFSGNASGSG SYEDGQGSHR SDIYNKPEGS
SRSGSASFDV VDGSYRSESG SYDDAGSDSY ASDSGSDENS