Gene PHATR_46872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_46872 
Symbol 
ID7204427 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011679 
Strand
Start bp675231 
End bp677680 
Gene Length2450 bp 
Protein Length670 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185929 
Protein GI219121409 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.916479 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGTTGACCGA ACTCAAAAGA CCTGCACGAC AATTTTTTGC AGCAACCACC AAAAACACAA 
AGTAGGAAGT ATCCTATCGG ACAAAGGCAA TACTATTCAC CGCATGCAGA GGAACGCTAT
GCATTCTCCC TCCAGATCAT CGCGGACAAC GGTGTCATCC AGAAATCATA GACAGCGGAG
CTGCTGCAGT GTCGCGACGT TCGCCCAGTG CCTCGTGTTC GCTTCGCTAC TCCACCAAGG
TGCGGAAGCG CGCCTTCACG ATGCCGGCAA CCGTTCGAGG CGAGCGTCGG CATCCAACGA
AACGGGTGTC GATCTCGTAG ACGTAGTGAT CGGTTTCGAA AAGGGCGAAG TACAACGCGG
CGGAATCAGT GACTTGGAGT TGCTGGAGCG CGGTGGATTG CTCGGACTCA CCGCAGACGG
CGCGCGTTCG GATTTTCGAC GCACCAATGC CGTGTCGATG CAGATTCCGG CTAGTGAAGT
GAACAGTCTG CGTAGTACGA CGGGAATCTC CTTTGTAGAG TACGATGCCA AGGCTTTTAT
GCACGAAGAT GAGATCGTAA CAACCCGTGC TACTCAGGAA ACTAATCTTC CCTGGGGGCT
CGCAGCGATC CAAGCCAACA GCAAAAAGAT TCCGTTACCC TCACCCAGTA GCGACTGTTT
CAAGATCTGT ATCGTTGATT CAGGTCTTCT TGTGGAACAT CCAGACATTG TAAGTCGGAT
TTGTTTCTTG TCAACAGAAC TTTTTGTTGT TGAAGTTCGA TTGCTCACCA TTGTTTTCTA
TGGCACTCTT TTTTAGCCGT ATACACGAGG AGCTACGAAC ATTGCCGGGG AAGAGTTCAA
CATTGGAGCA CTTAGTAAAT GGGATGAACC TATTGCAATT GCCAACCATG GAACACACGT
TACGGTACGT ACTTTTGTTT GGCCATTGAA ATACTCCAGA TGTGACACGC ACGCTTCGCA
AAAAATCTGA CTCAAAACCT CGTATTGCTG CATTTTCAGG GTACCATGTT GGCTACCGGC
ACTCGCAACC CGAGGGTGGT TGGTGTAATA CCCAACAACC AGAATATTTG TCTGCTCGTG
GCTCGTGTAT TTGGTGACGA AGTCAACCCA GGTGCCTCCA CTTCGACTAC CGATCGTGCG
GCGGAATGGT GTGGAGACAT GGGTGCTAAG GTTATCAACC TTTCCTTCGG TAGCCCCACA
TTTTCGACCA ACTCGGCCGC CATCTACAGA AGCTTGCGAT CCGAAGGAGT TCTGTTGGTT
TCCTCGGCCG GAAATGCACA AGACAATTCC AAGTCGTACC CGGCATCATA CTCGGACGTT
ATTTCCGTTT CATCCGTCGA CAAAGACCTA CAGCGATTCT GGACGTCCCA ACACAACGAC
CAAGTGGACT TGGCCGCCCC CGGAGTCGGC ATTGTATCGA CCGTTCCCGG TATGGGTTTG
TTCGACGAAC TTGGTAGAGA GTACAGCACG GCGCTCATTG ACTACTCTCC TTTTATTACA
TCTCCCCTTA CTGCGGGTGT GGTCCAATGT GGACTAGGCA CAGAGCCGTG CAAGGATGCT
GAAGGGCGGA TCTGTCTAAT ACAAGTTGGT TCCAACGCAA TAGACGACAA GGCTTTTCAT
TGCGAAATGG GTGGCGGCGT TGCAGCCGTT ATTTATAATA CTCAGTTTAC AACCGCTCCG
ACCCGCGGCG CTATTAACAG CTTTGTTTCC ATCCCCGTCA TGGGCGTGAG TTACGCAGAT
GGCCTTGCAT TGTTATCGAA AGAATCTATT ACTGTCGACC TACAAGTCCC AAGCTACACC
GAAAAACGCG GCACCAGCAT GGCCGCCCCG CACGTTGCGG GAGTTGCCGC CAAGATCTGG
GCCGCGCGAC CTGCATGTTC CAATAATCAA GTACGCGACG CTCTGGAAGC TACCGCGAAG
GATCTTGGCG ACCCTGGTCG CGACGACATG TACGGACACG GACTCATCCA AGCGGAAGCC
GCCTACGAAT ATCTTCTTGC CTTGCCAGCC CCTTGCGGAA TTGGCACTAC CGGTACGCCC
AACACGATCC AAGGCTCCGA TAGTGATAGC ACTCCTACGG CAAGTAGCGG TGGGAACACT
CCTACGGCAG GTGGAGGCGG AAGCACCCCT ACCATTGATC GGGAAACCTT TCTGGCAAAC
TTGGCCAAGG AGGGGACCCG ATCACAGGCG AATGATAAGA ATTCTTTGCT CAAGTCCATT
CAAAATAAAA CAGGCCGGGT CCGGGGTGGC GGAGGTGTTC GTCGCCGGTT CCTGAAGGGT
AGCAAGGCGA AACAATAAAA CAAAGCACCG GTGCAACCGT TGCAATTGAC GATTCCGGCA
AAATTTAGGC GCCAGTGTTT TGCCGGAATG TTACCACTTA GCGTCCTCTC TTATTATATC
CATAGATTCT TTCAAGAGGA AATAACACAG TAAGATCATT TTTTCCTCTT
 
Protein sequence
MQRNAMHSPS RSSRTTVSSR NHRQRSCCSV ATFAQCLVFA SLLHQGAEAR LHDAGNRSRR 
ASASNETGVD LVDVVIGFEK GEVQRGGISD LELLERGGLL GLTADGARSD FRRTNAVSMQ
IPASEVNSLR STTGISFVEY DAKAFMHEDE IVTTRATQET NLPWGLAAIQ ANSKKIPLPS
PSSDCFKICI VDSGLLVEHP DIPYTRGATN IAGEEFNIGA LSKWDEPIAI ANHGTHVTGT
MLATGTRNPR VVGVIPNNQN ICLLVARVFG DEVNPGASTS TTDRAAEWCG DMGAKVINLS
FGSPTFSTNS AAIYRSLRSE GVLLVSSAGN AQDNSKSYPA SYSDVISVSS VDKDLQRFWT
SQHNDQVDLA APGVGIVSTV PGMGLFDELG REYSTALIDY SPFITSPLTA GVVQCGLGTE
PCKDAEGRIC LIQVGSNAID DKAFHCEMGG GVAAVIYNTQ FTTAPTRGAI NSFVSIPVMG
VSYADGLALL SKESITVDLQ VPSYTEKRGT SMAAPHVAGV AAKIWAARPA CSNNQVRDAL
EATAKDLGDP GRDDMYGHGL IQAEAAYEYL LALPAPCGIG TTGTPNTIQG SDSDSTPTAS
SGGNTPTAGG GGSTPTIDRE TFLANLAKEG TRSQANDKNS LLKSIQNKTG RVRGGGGVRR
RFLKGSKAKQ