Gene PHATRDRAFT_34924 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_34924 
Symbol 
ID7200134 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp668946 
End bp671546 
Gene Length2601 bp 
Protein Length528 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179265 
Protein GI219116941 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0285995 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCAAC TGATAGCTTC CGATACAGTG GCGGAAGGGG ACACTTTCGT GTGTCGACGA 
CGACGGCTTC GTCAGAGTTA CTCCTCCTCT GGCAGCGCAT CGGCCGCAAT AGGACCCATA
GTAGCAATAC TGCTGCTTCT GTCTCATCAG GGAGAAGCCT TTTTGCACTC GCAAGAAAGC
TGTCTTCGCC GGGTCTCCCC GCGCTACATG GTGAGCTCTT CCCAAAGAAT ACGAAGCAAC
AAATCCGTCG TCTTGAGGAA TCGTCTACAG GCGACGTCAA TCGACTACGA GGATGGCAGC
CAAAGCCGGC TTATACCAGA AGATGTGTCG GATATGCAAG CACTACGCCA GAAGATATGG
GATCTCGAGC AACAAACCCA TATGCTAATA AAGGCGAACG ACTTGCAAGG TGCCTTGGAG
GGAATACAAA GACTACTGGA AGTCATCCGC GTTTCTTCTG CCAATGCTGC GACGGCCAGT
GATAATACGA TGATGCGCTC TATGAGTCAA TGCCTGGACC GAAGCGTTCA AACTTTCGCC
GCCAGGACAT TTAATGTAAA AATGGAATCA CCATCCCGAG CACGAAAGCA TGTCATGATG
GGCGTCGAGG CTTTACAACT TCAGCTTTCG TCGCAGTTCC TGTCGGAACC GTACAACTTG
CTGCCCAAGA TGACCTTTTT GAATGCCTTA AAGGCACTCA CACAACTAAT TGAGGTGGGA
CGAGGTGAAC AGCACGACCC ACTTCTTTCA AACATGTCCG CAGCTGCCTT TAGGATTTTG
CAACGTCTGG TAACCGGTGT GGGCATTCGG AACAAATCTT CCCCTTTGGT GGTATACGAA
AAGGATTTTT GCATGGTTCT CAACGCCTTC ACCGAGTCAG GAAGGATGGA CATGGCGCAT
CGGATTATTG CTTTGCAAGA GCGGACCGAG CATGCGCCGC CACTATCGCC AGTGGCCTTT
TCGATTCTAC TTAAAGGATA CGGTAGATTG AAGGATTTGC AGCAGGTAGA GATGGTCCTC
CAACATTCCG AAAGAAGCAA AATTACTCCG GATACGGTCA TGTTCAATAG CCTGATTGAT
GCGTACGTCA ACTGCAACGC TATCGACAAG GCCCGTGGCG TATTTGATCG AATGCAACGT
CCACAGGATA TGCTCAAGGA CGCAATTGCC ACATCCTTTA CTTGTCCACC TCCAAACAAG
AGAACTTACA ATACCATGCT CAAAGGCTAT GCTAACTTGG GTATGCTCGG TGCGGCATTA
GAACTGTGTG AACAAATGCG GAGGCGGCGC ATGTGTGACG CTGTGACCAC CAACACTTTG
GTCCACGCAG CGGTAGTAGC GGGTGACTTT GGTATGGCCG AACGCGTCTT GTCGGAACAG
ACTGAACGAC AACCTAAAGA AGCAGGCTCA CAGCATCCAA ATGTGGAAGC TTATACAGAG
CTACTGGACG CATATGCGAA GTCTGAGCAA CTAGATAAAG CAGTTTCAAT CCTTCCACTC
ATGCAGTCCC GTGGAGTAGA AGCGAATGAG TATACTTACA CGTGCTTGAT TGCAGGCTTT
GGACGGGCCA AGCGTATGGA AGAAGCAAAG AAAATGATGG CTTACATGAG AAAGATTGGA
ATGCAACCTA GCGTCATCAC GTACAATGCA CTCATTTCAG CTGTGTTGGA GCTGGAAGCC
TCTAACGATG ACTTGGATAG ATGGGTTGAT CTTGGGCTGA AAATATTACG CGAGATGATT
CACGCACAAG TTCGTCCCAA TGCCGTGACG GTATCTGCGT TGGTGGAAGC TCTTGGTCGC
TGTGACGAGC CTCGTGTCAA AGAAGCATGT ACGCTTGTGA GCAAGCTCGA GAAAGAGAGA
ATCATTTCGA AAGGAACTCC CCGTGTGGTG ACTGCACTTG TTCAGACTTG CGGTGTGGGC
GGAGATATCA AGGCATCTCT GGAGGCATTT AGAACGCTGA GAAAACCAGA CACAATTGCA
GTGAATGCGT TTCTTGATGC ATGCTACCGT TGTTGTCAGG ATCGGTTAGC TTTGGAGACG
TTCAAATACT ACTTTCACAA ACGAAACGGC CAAGCTAAAT TAAAGCCTGA TGTAGTTTCT
TTCTCGACGC TGATATCTGC ACTCCTGAAA AAGAACACAA GCGACAGTCG GGGAAGCGCA
CTGCATTTAT ACAATGAAAT GCAATTGAAG GCTTTGATAA AACCTGACAA TGCTCTCGTC
GACATAGTCT TGAAAGCTTT GCTGAAAACG GCACAAACAA ACTGGCTTAC TGACAGTGAC
GTTCGATTCG TTGCCAATGT CCTTCGAGAC GCCGAAAACT TAGGATGGGC GGATGGCCAG
CTTTATCGTC GAAAGCGCGC TGTCCGGGCT GTGCTCGCCG ATCGGTTGCG GGAAACGTTT
AATCAGGACG ACGATCTCTA CAGATTAGTT TCTCCGGATG TTGGAGTCGA TGAGTTATTT
CAGAAGCACG GGTGGAATCA GGTGGACTCC GGATTTCGAT TATGGGGGAG AAACAATGAC
GTGGCCGATG GAGAAGGAGT CGACAAGTTC CTTCAGTCCA AAGGCTGGAA TAACGTCGAT
TCAGGATTCC GAATATTCTA A
 
Protein sequence
MTQLIASDTV AEGDTFVCRR RRLRQSYSSS GSASAAIGPI VAILLLLSHQ GEAFLHSQES 
CLRRVSPRYM VSSSQRIRSN KSVVLRNRLQ ATSIDYEDGS QSRLIPEDVS DMQALRQKIW
DLEQQTHMLI KANDLQGALE GIQRLLEVIR VSSANAATAS DNTMMRSMSQ CLDRSVQTFA
ARTFNVKMES PSRARKHVMM GVEALQLQLS SQFLSEPYNL LPKMTFLNAL KALTQLIEVG
RGEQHDPLLS NMSAAAFRIL QRLVTGVGIR NKSSPLVVYE KDFCMVLNAF TESGRMDMAH
RIIALQERTE HAPPLSPVAF SILLKGYGRL KDLQQDRLAL ETFKYYFHKR NGQAKLKPDV
VSFSTLISAL LKKNTSDSRG SALHLYNEMQ LKALIKPDNA LVDIVLKALL KTAQTNWLTD
SDVRFVANVL RDAENLGWAD GQLYRRKRAV RAVLADRLRE TFNQDDDLYR LVSPDVGVDE
LFQKHGWNQV DSGFRLWGRN NDVADGEGVD KFLQSKGWNN VDSGFRIF