Gene PHATRDRAFT_42858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42858 
Symbol 
ID7196447 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1334860 
End bp1338398 
Gene Length3539 bp 
Protein Length1105 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177271 
Protein GI219111039 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAGTAAATCA TCCTCTACCA AGAATCAGTA CAGTGCCAAA GATCTCTCTG TAGCATTCCA 
AGCGAGCTCC TTCAACCTAT ACACAACTGC AGTTGTTGTT GTTAGATTGC ATTCACAAAC
GCGAACGCGA ATAACAGCAT GGTGGATGGC CAACGGGAAG TCTCGTACCG CCCGACCCCG
ACGGGCAGTC CGTCGCGTTC CGGACCTCCC AGTATGACGG TCCGAGCCCG TCGACGCCGC
GAAGAATCCG TACGTCGTTT GGTTGCCGAT ATGCAAGCTC GTCGAGGAGA AGACGAAACG
CACGAATGGT TGGTCGCTTC CACAATCGCG ACGGCCCTAG AACAAGGATT GGACCGAGAG
TTGCACACGG AATTGGTGCA AGAAACAAAA GACAACGCCT CCCGGATTGG CCAAATCTGT
CACGATCACG CCAACGTCTT CCTCTCCAGT GTTGCTCAAG TAGCGGCTTT GGGCGAGCCT
TCGGCGCAGC TGGCGCAGGG ATTAGAAGAA GCTTACCAGA CGCTGGAATC GCAAACGGCC
GGACCCCTGC AACAAGCCGT GGATCAACAC AAACAAGCCC ACAAATCCCT GACGCAGGCC
AAAGCACTTT CGGAAACGCT CGTGGCTTGT CAGCTTCTCG CGACGCAACT CGAAAAAGCC
CGTAAACAGG CGTTGCTGGG ACGTCCCCGG GCCGCACTGG CGGCCGTCGA TCAAGCCCGC
ACCGTTCTAA CACAACCACT CCTCGTGGAC GCGAGTTTGA ATCAAAATAA AGAAACACGA
TTGACGCTGG AACAAACCCC GCTTGGGCGT CGCGCCCAAA TTGTTCTACC CAAGCTCGAA
ACCGAAGTCC TGCAAGCCGC TCGGCGGGCC TGGAATCGTT GGTGGGTACA ACTCCGCAAT
GGCGAACAAG CGAAGGCTGG TAAGGCTGTA CTCCGACAGG TGGGTCACGC CGTTGCCACG
GGTCCGTCGC AGCTAAGTCT TGGGGGTAAT CTGCCCTCGA GCTACGTCTG GCGGGCCCAG
ACTGCTCATA ATCTCGTATC TCGGGTGGAT CAGAAAACGT CCGTCGCCCG CGCGGCACGG
GCGGCCTACT GGTTGGACCG GGACGCCGCT AAGGAAGCGC AACGCATCGC CACAATCACT
TCTCACGGAC TAGCCCGCAA ACTCGAATCG ATCGCCGCCT GTCTCGGATG GTACCGTTGC
TGGGATGCCT CGGCTTCGCT GCTTTTGGAT TTGTCGGAGT TCAACGGGAC GGACGCGGAA
GGCAACTTGT TGGTGGGGAG TGGCTCTCGA CACGGATCCC GACACGGATT GGCGGGATCG
CGTCACGGTT TGCGGGGGTC ACGACACGGA AAAGCCCGAT CGCTGGGTTT TCGGTCCACA
GCGTCGCGCT CCCAAACTAC CGCTCCGAGC ACGCTGAGTG GTGCTGCGTC AGGGACCAAT
GTGGCTACCG GCAAATGGGC CGAAGTATTG CTTCCGGCAG TGTTACTTTC GCAAACACCC
TCCCGGTATG TCTCTCCTAT ACAATTGCTG ATGCGCGATG CTTTCTCTTT TTGTTTGCTA
ACTCGATGCG ATATTACTAT AAAAATAGCC GTGAGGAAGA CGAAATCCTC ATGTCCCTTC
CTGAATCGGT ACACCCTGTA AGACGGGCAG AATTAGCGTA TCGTCTCTTG GGTCGGACGG
ACGAGTTTGT GCAGTACTAT GAACAGAATC GCTTTAATGG TGGTGCCGAG GACGCAGAAA
GCGAGCGATC GGCACTCTCT GCGCTTACAG GCGACGACAT CACCTTGGGA TCTGATCGTA
CGTTCTTCAG CAAGACCCTA CCAACCTTGT GCGCTTCGAT TGTCGGATTT ACCGCCGTCG
AAGCGGCCTT GGAAGTCGGG ACGTTCGCTG GAGATGAAGA GGAAAACACT GAAGAAAGCA
AAGAATCACG AGCGTTTGCC GCGGTTTCTC GTTCGCCTGC TGCAGCGACA GGTAGCACGA
CCTTGACAGC CAGTCGACTA CGGGAATCTT CCGAACGCTA TGAACGAGCC CTGACGAGCG
AGCTGGGTGA TTTGATTCGC GAGCGTGCCA AGCGTAGCAA TCTAGGTGAA CTGGTCCGCT
CCTCAATTCT CATGTCGAGC TTCCGTTCAT CTCTCAAAGT AGTCCATCCG AGTTCGTCTT
CCCGTCGGCA CGACAAGGAC CTACTGGCTT TGGACACGGA AATCCTCCTA AGAGCCCTCA
AAATTTCGCA AGATGAACAG CTCCGCGCAA CTACTGCAAT TGTGGCGGAG GATCGCAAGG
TCCCCATGCT AGTAGCGGAT TCTTCTGCTG CTCAAAAAGG AAGATCCATG CAGCAACCCA
CATCGGGAAT CCCCGACCCC GAAGAAATTG GCTTGCCTTT TGGTTTGAAC CAAATGAAGC
AGCAACCTAC AAAATCCAGT TTGCAGTTTC AAGAACAATC TCAAGCATCT TTCAATCGAT
CGGCCGTCGA CCAAGCTTAC ACCTTTTCCG ACTCCTTACC TACTGTGATT CGGTCCTTGC
ACGCCCGAGC CATTGCTATG GTTGTATTTG CTCTGAGTCA GGAGGAGTTG GGTCAATCCT
TTTCTGCAAA GAAGGGAAGT AATGCGGCCG GATACGTCTT GGATGCCATT GGCGAGTTTG
TCAACGTGAC ATCTGTTGGC ATGAAGGACA GTGATAATGT CGTGGACGAA GGATCCGTCG
AAAAGGCCGT ACAAATTATG GCCAATATTG CTGCTCTGCA GCATTGTCTT CCTCGCTTTC
TAGGTACTAT TCTCCGAGGC ATGTGTCATA TAGGAATGAT TAAAGCTGAG GAATTGGACG
AAACGTTTGT GTACGCTGAA ATGACGCTCA AGTCAGCTGA CAAAGCTTGC GACGCGCAGA
TGGGTAGCAC GTACAGTTTG GTCTATGAGA TTTGCCGTAA CAAGATTGAT TCGCACATCA
ACTATGCCTT GGAAAACTTC AACTGGGTCG CAAAATCGGT GCGTGATATG CCGAACGCCT
ATTGCGAAGG GCTGATCGGG TACATGCGAT CCGTTTTCAA TTCGTTGGGT CCAATGGACG
AAGGATCTAG GGCTGGACTC CATTTTTCAT GCTGTGGTCA TGTTTCGGAA CGTCTTGTCA
AGTTGTTAGC TGGAAAACCC GGTGATACCG CCACATTCGA CGACTCTGGT CTACCGCCAA
TCGCACGCAT TGATGCCTTT GGTATAAAAA ATTTGGCGTT GGACTGCGAT GAGTTGGAAA
AGTTTGCCGA TTCAACGGCC ATTCCTCAGT TGCGCGACTG TTTCAACGAG CTTCGAGTGC
TGACTTCTGT GATGCTGGAC AAGGACCTTC CTATGCTTGT TATGCCAGAG AATGTAGCTC
AACGTCGACG AAAATACCCT ATTCTGAGTA TGGACAAGGT TGGTAACATT TTGGAAAAGT
ACGTAGGCAC TGGGTTGGGT GACAAACTTA TGGGAGGCTC TCGCAAAGTG GATATTCTTT
TTATTGACAA AAAAGAAGTT CAACAACTGA TCAAAATTGT GCGATCACAG GGTATTTAA
 
Protein sequence
MVDGQREVSY RPTPTGSPSR SGPPSMTVRA RRRREESVRR LVADMQARRG EDETHEWLVA 
STIATALEQG LDRELHTELV QETKDNASRI GQICHDHANV FLSSVAQVAA LGEPSAQLAQ
GLEEAYQTLE SQTAGPLQQA VDQHKQAHKS LTQAKALSET LVACQLLATQ LEKARKQALL
GRPRAALAAV DQARTVLTQP LLVDASLNQN KETRLTLEQT PLGRRAQIVL PKLETEVLQA
ARRAWNRWWV QLRNGEQAKA GKAVLRQVGH AVATGPSQLS LGGNLPSSYV WRAQTAHNLV
SRVDQKTSVA RAARAAYWLD RDAAKEAQRI ATITSHGLAR KLESIAACLG WYRCWDASAS
LLLDLSEFNG TDAEGNLLVG SGSRHGSRHG LAGSRHGLRG SRHGKARSLG FRSTASRSQT
TAPSTLSGAA SGTNVATGKW AEVLLPAVLL SQTPSRREED EILMSLPESV HPVRRAELAY
RLLGRTDEFV QYYEQNRFNG GAEDAESERS ALSALTGDDI TLGSDRTFFS KTLPTLCASI
VGFTAVEAAL EVGTFAGDEE ENTEESKESR AFAAVSRSPA AATGSTTLTA SRLRESSERY
ERALTSELGD LIRERAKRSN LGELVRSSIL MSSFRSSLKV VHPSSSSRRH DKDLLALDTE
ILLRALKISQ DEQLRATTAI VAEDRKVPML VADSSAAQKG RSMQQPTSGI PDPEEIGLPF
GLNQMKQQPT KSSLQFQEQS QASFNRSAVD QAYTFSDSLP TVIRSLHARA IAMVVFALSQ
EELGQSFSAK KGSNAAGYVL DAIGEFVNVT SVGMKDSDNV VDEGSVEKAV QIMANIAALQ
HCLPRFLGTI LRGMCHIGMI KAEELDETFV YAEMTLKSAD KACDAQMGST YSLVYEICRN
KIDSHINYAL ENFNWVAKSV RDMPNAYCEG LIGYMRSVFN SLGPMDEGSR AGLHFSCCGH
VSERLVKLLA GKPGDTATFD DSGLPPIARI DAFGIKNLAL DCDELEKFAD STAIPQLRDC
FNELRVLTSV MLDKDLPMLV MPENVAQRRR KYPILSMDKV GNILEKYVGT GLGDKLMGGS
RKVDILFIDK KEVQQLIKIV RSQGI