Gene PHATRDRAFT_47417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47417 
Symbol 
ID7202552 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp576027 
End bp579128 
Gene Length3102 bp 
Protein Length1009 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181586 
Protein GI219122509 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.143888 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAATG GGGAAAAGAA GTCGACGACC TGGCAAACCC TCGCTTGGTG CACTCGCAAT 
TCTTTTCATA TGAGATTGAA GAGAGACAAG AAATTGATCT GTCGAATGCG GCAAAACAAC
GTCCCAACGG TTTTAAATGT CGCCGAAAAG CCATCCGTGG CCCGGGCTTT GGCTTCTGTC
TTTGCCCGAT TACCCAACGC CGTTGATCGT GGAATGCGTC GAGAGGGTAA TCAAGTGTTC
AGCCACGAAA ATGTCTGTTT TCCTAGCGTG TTTTCTCAAG GAAATGGTGT ATGCGTTCAA
GGACCTAGTA CGTCGCAACG AGCTGCTTTG TAGTACGTCA AGCCGCTTTC ATTTTGCCTC
TCACAAACTG GTTACGCGAT ATGCTAAGTA GTAGTCCCAC ACAGTATGAT AACAACATCC
GTAAGGGGTC ATTTAGCGTC CCAAGACTTT CCTCCGGCGT ACGGATGGTC CAAGTGCGAC
CCAATTGCCT TGTTTGAAGC TCCGATTGAA ACGTCGTATC GCGACGACAT GCAGCCCCTT
GAACGCATGT TGAAGTCATT GTCACGTCAA GCCCAGGTCC TAATTCTCTG GTTAGATTGC
GATCGGGAAG GAGAAGCCAT TGGCGATGAA GTACGTACCG TGTGCTTGGG GTCAAATTCA
AGGCTGCAGG TGTACCGGGC ACGGTTTTCG ACCGTGTTGC CAGTTGAAAT TGAGAGAGCC
CTACAAACAC TAGGTCGAGT GAACGAGTAC ATGGTGGCAG CGGTCCAAGC CCGATCTACG
CTAGATCTAC GAGTAGGTGC CGCCTTTACA CGATTTCAAA CCCTACGATT GCAACGCAAA
TTTGACGGCT TTGCTGAGCA GGGTGTCATC TCCTACGGCC CATGCCAGTT TCCGACTTTG
GGATTTGTTG TGGAGCGATG GGCACGGATT GAGACGTTCA TCCCAGAGGA CTTTTGGCAT
TTAGAGCTAG CGATTTCTGT GGATGACACT TACCACCAGC AATCGCAAGA AGAAGACCAA
AGTGCTGCTC GTGGACGTAC CCAACAAAAT CGGACAATTC ACTTTTCGTG GAAACGCGGA
TACTTGTACG ATAAACTTCT GACCACCGTG CTATTCGAAG AGTGTTTGGA AGCTGGAGAA
GCGGTCGTTA CCGCAATGAA TGGGCGCACT AGGAATAAGT GGCGCCCCGT CCCACTAGCA
ACGGTCGAAC TACAAAAGCG GGCTTCAATG TACTTGCGAA TTGGATCCGA AACCTTGATG
TCGGCCGCAG AAGAATTATA TCAACAAGGT TATATATCCT ACCCCCGTAC CGAAACAGAA
CGTTTCAGAC CAGAGTTTGA GCACCGCCCA TTAATACAGC AATTTTCCTC ACTTCAAGGG
GAGTTCGGGG CCTATGCTTC CAAGTTGCTG AATGAAAATG GGTTTCAAAT TCCTCGTGCG
GGGAAAAGCG ATGATCAGGC CCATCCTCCC ATCACGCCTG CCAAAGCCGT TGATCCAAAT
ACTATTCAGG ACCAGATACA GCGAAAAATA TATTCTTTGA TTGTAAAGCA CTACTTGGCC
TGCTGCTCTC GTGATGCTGT TGGGAGAGGA ACAACGTTGA CAGTCCGAAT GGGCACCGAA
GAGTTCAATG CAACTGGTTT GATGGTTATC GAGAAGAATT GGTTAGAAAT ATATTCCCCC
TGGGAACGTT GGGGCTCAGG GCAGGGAGAA TTACCTCCGC TCCAAGTCGG TAGTCGCATA
AGGCCGACAT CGTTCCTGAT GAAGGAGGGT CGCTCAGGCC CTCCACAACC CATTTCAGAG
GTGGAGCTTA TTTCGCTCAT GGATCGCAAT GGTATTGGAA CTGATGCCAC GATTGCGCAA
CACATATCCA CCATTCTCGA TCGCGAGTAT GCTAGAAAGG ACGGCAGGCA AAAATTTCTA
CCAACACCTC TGGGAATCGC ACTCGTGGAA GGCTACAATT CCATGGGATA TCAACTGAAC
AAGCCCGACT TGCGCCGTGA GATGGAGGCC GAATGCAACG AAGTCGCTTC TGGACGTAAA
ACTAAGGAAG AAATTATGGT GCCCATTCTT GCGAAAATGA AAAGCTGCTA CGAAACGGCA
AGAGCTGAAG CTCGCAAGCT GGACGAAGCT GTTGCACGAC ATTTTCCTCG ACTCGGTGCC
GGTGAGAGCA CATCTCAAGT TGTGGAAGAG AGTTTCAGCG AATGCGGAGT CTGTCGCAAC
AGCATGGCGT TGAAGCAAGA ACGAGAAAAT AACAACCGTA CAACAGCTCG CAACACTGTG
CGGCGCAAAC TGTTGTACTG CAGCACATGC CGGGCAGGCT GGACTTTACC ACGGGGTGTA
GTCCGACCAA AAACAGAACA AGAGGACAAT GGTCCTCCTG TCAAATGCCC CATATGTCAA
TTTCAGGTGA TTCGGATATT GCGAGGGGAG GGCTATGAAG GCAACGGTTA TCACGTTTGC
CCCAAGTGCT TTTCGGATCC ACCTTCCGAT CACGGTGGTG CCAGCAACGC TGGCGACTTC
CGCTGCTTTG CTTGTCAACA TCCAACCTGT GCTCTCGCCA GCGGAACACC GGGAGGTGAC
GTTGAAGTCT TTCGATGCCC CTTTTGCCAT CCATCGGCAC AACCAACTTC GACCTCTGAT
TCCGGGAAAG TATGCGTACG CAAAACATCA CGCGGATACG TACTTTCTTG CAACAAGTAT
GTACGAGGTC AGGACCGATG CTCGTATACA ATCTGGCTCC CCAAGGAATG CCACAAAGTC
TCTGTGCTCT CGGGCGATGA AAACCAAAAC GAGATCTGTG GTCGATGTTC CTCGCCGCGT
GCTGTCATTC GCAAGGTCCA TTTCGTCTGG AAACCCGGTA GCGTTCCGCC GCACTTGGGG
CGTGAATGCA CCGTGTGCGT GCTATGCGAT GCCGATTTTC GTCGTGAACT CAATATTTCG
TTGCCACAGA TGAACCAAGT ACAAAGTCGA CCCCGCACGA CAGCCGGTCG GGCAGGGCAT
CGCGGTGGAG GTGGAACAGA GACAGGGCAG GGAGGTGCAG GAAACACTTG TTTCCACTGC
GGCCAGCCCG GTCATTTTGC CAACAGCTGT CCAAATAGAT AG
 
Protein sequence
MDNGEKKSTT WQTLAWCTRN SFHMRLKRDK KLICRMRQNN VPTVLNVAEK PSVARALASV 
FARLPNAVDR GMRREGNQVF SHENVCFPSV FSQGNGVCVQ GPSTSQRAAL YMITTSVRGH
LASQDFPPAY GWSKCDPIAL FEAPIETSYR DDMQPLERML KSLSRQAQVL ILWLDCDREG
EAIGDEVRTV CLGSNSRLQV YRARFSTVLP VEIERALQTL GRVNEYMVAA VQARSTLDLR
VGAAFTRFQT LRLQRKFDGF AEQGVISYGP CQFPTLGFVV ERWARIETFI PEDFWHLELA
ISVDDTYHQQ SQEEDQSAAR GRTQQNRTIH FSWKRGYLYD KLLTTVLFEE CLEAGEAVVT
AMNGRTRNKW RPVPLATVEL QKRASMYLRI GSETLMSAAE ELYQQGYISY PRTETERFRP
EFEHRPLIQQ FSSLQGEFGA YASKLLNENG FQIPRAGKSD DQAHPPITPA KAVDPNTIQD
QIQRKIYSLI VKHYLACCSR DAVGRGTTLT VRMGTEEFNA TGLMVIEKNW LEIYSPWERW
GSGQGELPPL QVGSRIRPTS FLMKEGRSGP PQPISEVELI SLMDRNGIGT DATIAQHIST
ILDREYARKD GRQKFLPTPL GIALVEGYNS MGYQLNKPDL RREMEAECNE VASGRKTKEE
IMVPILAKMK SCYETARAEA RKLDEAVARH FPRLGAGEST SQVVEESFSE CGVCRNSMAL
KQERENNNRT TARNTVRRKL LYCSTCRAGW TLPRGVVRPK TEQEDNGPPV KCPICQFQVI
RILRGEGYEG NGYHVCPKCF SDPPSDHGGA SNAGDFRCFA CQHPTCALAS GTPGGDVEVF
RCPFCHPSAQ PTSTSDSGKV CVRKTSRGYV LSCNKYVRGQ DRCSYTIWLP KECHKVSVLS
GDENQNEICG RCSSPRAVIR KVHFVWKPGS VPPHLGRECT VCVLCDADFR RELNISLPQM
NQVQSRPRTT AGRAGHRGGG GTETGQGGAG NTCFHCGQPG HFANSCPNR