Gene PHATRDRAFT_42100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42100 
SymbolRPC157 
ID7202167 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp373133 
End bp377428 
Gene Length4296 bp 
Protein Length1281 aa 
Translation table 
GC content48% 
IMG OID 
Productrna polymerase C 157 kDa 
Protein accessionXP_002181195 
Protein GI219121691 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCAGTC GGAAACCCGC GCACGGGGGC TGTCTAGACC CACGTCTCGG AGTGTCAGAC 
AAGGTCTCGG CCTGTGCAAC TTGCAAACGA AAATTGGTGG ATTGTGCGGG ACATTTTGGC
TACATTAGAT TAGCCTTGCC CGTCTTTCAT ATTGGATTCT TACGCCATAC ACTACATCTG
CTCCAATGCG TCTGCAAAAC CTGTTCCAGG GTACTCATGC CCGAGTCGGA TCGGCAAAAG
CATTTGAGCC GCATGCGCAG TCCAAAGACT GATGCGTTGA GTAAGGCGGC ACTCTTCAAA
AAGGTGGTGG AAAAATGCAA AAAAGTTCGC GTGTGTCCGC ATTGTGGAGG ACACAACGGT
ACGGTAAAAA AGATCACCGG TGTGCCGACG CTTAAAATCG TACACGAGCG CTACAAGGGA
CGAAACGCCG AGGACGAGCT GGACGAACTA ATTGCCAATC TAGAAGTCTC CTTGAAGATG
AATAAGGATG TCGGGACTGC CATTTCTGGG TCGAGTCTTC CCTACGAGGA TTTGTTGCCA
ACTCGAGCTT TGGAGATCTT TACGCACATC TCCGACGATG ACTGCGAAGT CTTATGGATC
GATCCGCTCA TTGGGCGACC AGAGACGCTC ATTTTGCAGA GTATTCTCGT CCCGCCAGTT
CCCATTCGAC CGTCGGTGGC GATGGATGTT GGTGGAGGTA GCAACGAGGA TGATCTTACC
GTCAAGTTAC AGGAGATAAT TGATGTCAAT GTGGCTTTGG AACTGGCCTT GATGAAGGGT
CCCCAGACAC GCACCATTAT GGAAGAGTGG GACTTTTTGC AAGTACAAGT TGCGCAGTAC
ATCAATGGCG AGATGCCAGG TTTGCAAAGA CCGATCGGGT CGAAACCCAT GAGGGGCCTA
TGTCAGCGTC TTAAAGGAAA GCAAGGTCGA TTTCGAGGGA ATCTTTCGGG AAAGCGGGTA
GACTTTTCCG CCCGTACCGT AATTTCTCCT GATCCCAACC TTCGTGTAGA TCAAGTTGGT
GTCCCGGAGC GCGTCGCTAA AACCATGACG TATCCGGAAC GAGTGTCACG TTACAATATC
GAGAAGCTTC GTCAAAGGGT CCGGAACGGG CCGGACGTTC ACCCTGGGGC CAATCTTATC
CGCATGAAGG ACGGCTCGTT CGTCAAGTCG CTATCGTTCG GCGACCGAGA GCTTGTGGCC
AAGAATTTGC GGTACGGAGA TGTTGTGGAG CGTCACATGG AAGACGATGA CGTCGTATTG
TTTAATCGAC AGCCATCGCT ACACAAGGTT TCAATTATGG CTCATCGTGC CAAGGTCATG
GAATGGAAGA CCTTCCGCTT CAATACCTGC GTGTGCGCTC CCTACAACGC CGATTTCGAC
GGAGACGAGA TGAATATGCA TTTACCGCAA ACGGAAGAAG CACGTACGGA AGCCTCACTA
CTGATGGGTG TAAAACACAA CCTTACGACG CCTCGCAATG GAGAGCCTTT GGTTGCGGCC
AGTCAGGATT TTCTCTCGGC GTCATACATG CTGACGCAGC GGGATCGCTT TTTTACCCGG
GAGCAGTTCT GCCAGTTGGT GTCTTACTAC AGTGATGCGT CAGAAGACAT TGACATACCT
TTCCCCACAA TTCTTAGACC TGTAGAGTTG TGGACGGGCA AGCAAGTATT TGGCATGATG
TTGCGACCAA ACAGGAAGTC CTCCGTTCTC GTCAGTTTTG AAAACAAAGA AAAAAATTAC
ACAACGAACA AATACTTTTG TAAGAATGAT GGGTGGGTTG CGTTCCGAAA CAGCGAACTT
GTAAGTGGCA ACATTGCAAA GAAATCTATT GGGGACGGCA GTAAAAGTGG CTTGCTGTAC
ATTCTACTCC GAGATTGCGG TGTGCACGAA GCCGCGAGCT GTATGGACCG ATGGGCAAAG
TTCTGTTCTC GTTTTATGGG TGGTCATCGT GGATTGTCGA TCGGAATTTC GGATGTCACA
CCGTCCGCTC GCTTACGAGA TATAAAACAT GGGATTCTTT CTGAGGGATA CAAAAAGGAA
AGATTGATCA CGCCGATGAA ACGTTTTTGC CCACGTTTTC TCACATGTAT ACTTTAGGCC
GATGACAGTA TCCTCCAATA CGAAGAAGGA AGGCTAGAAC TCCGTCCTGG TTGTGATTTG
TTGCAATCTC TAGAAGAGAT CTTGAACGGA ATTCTTGGCC GACTACGAGA ATCAGCAGGT
CAGGAAGCCA TGAAGGCTCT ACCGTGGACG AATACACCTC GCATCATGGC AGAATGTGGG
TCAAAAGGCA GTCCATTGAA CATTTCGCAG ATGATATCTT GTGTCGGACA GCAAGCCGTA
GGCGGCATGC GTATCCAAGA TGGGTTTGTA AACAGGTGTC TCCCACATTT CGAATATCAC
AGCCTTATAC CATCAGCCAA AGGATTTGTA GCCAATTCGT TTTACACCGG TTTAACGGCA
ACCGAATTCT TTTTTCACGC CATGGGAGGG CGAGAGGGCT TGGTAGACAC GGCTGTTAAA
ACAGCTGAGA CTGGTTATAT GGCCCGTCGA TTAATGAAAG TATGTATACC TGATTTGGAA
AGTATTTTTG TCAAGCCAAA TACATTCTCA CGCTTTGTTC ACTCTCAGGC TCTTGAGGAT
CTGTCTTTAC AGTATGACTC ATCGGTTCGC AATAGCGAGA ACACGGTTGT GCAGTTTACG
TACGGCGACG ATGGTTTAAA TCCAAACATG ATGGAGAACA ACGATAGGCC AGTCGATTTT
GAACGCCTAC GCTTGCACAT AAGCCAAACA ACTCCTTGTC CAACTGAGGA TTGTTTGAGT
GCCGCTGCTT TAAGCACTAC CGTTGAGAAG AAGTTGGCGG AGCCGAGATT CCAGGCGTTA
CTACCAACAG GTCGTGTCTT TATGCAGGAA ATTCGAGACT TTTTCAACTC ATTGGCGGAG
AACCAAAAGT CGCTAGTTGT GGGGTCTGGG TCTGACGAGA GAATAGGTGT TCGTACATGG
AATAGCTCCC GCATGACGGA GACGCAACTC GAGTTGCTGT TGACGGAGGC ATTGGACAAG
TGTATGCTTG CATATGTAGA GCCTGGCGAA GCAGTTGGTG CGATTGGAGC TCAGAGCATT
AGCGAACCCG GTACACAGAT GACCTTGAAG GTGTGTCGTC GAATATTTGC TTGTACCTGT
GTTTTTGAAT CTGTTTCTAA TTATCGCTGT TCGTTTTCAG ACTTTCCATT TCAGTGGTAT
CAGCTCCATG AACGTGACGC TCGGCGTGCC GCGATTAAAG GAGATCATTA ACGCGGCCAA
ACTGATCTCA ACACCTATCA TCACAGCGAA GCTCGAATGC GACAATAACA AAGTTGCTGC
ACGTATCGTG AAAGCGGTGA TTGAGAAAAC TACTCTGGGT GAGGTGTCGA AGTACATGAA
AGAGGTGTAC GCTCCTGGGA GCTGCTATAT AAGTGTTGAA CTAGATATGG ATGCTATTGA
ACAGTTGAAG CTCAACGTCG ACGTACATAG TGTCCGTCGC TCCATTCTTT ACGGAACCAA
AGGGATCGCA AAGAATGCGG TTCTTCGAGG TTTGCGGGAC AGTGATGTGC TTGTTAAAAA
GGGCTTCAGC CCTAAGCTTC GAATCAATGT ACCCCGCCCA GATGAGAAGA GAGACAAATC
TAGCGCAGTT CCCAGCTACT TCGTAAGTTT TCAAATTCCT TGTCAGGAGT ATCTCGACTA
TTGCTAATCT GTCTTTTTCG ACATACTCAG GGAATGCAAA TGCTCAAAGC GGCGTTACCT
AATGTAATTG TTCAAGGAAT TCCGACTGTA GCGCGAGCTG TTATCAACGA AAGCAACCAA
TCGGGTACAC CGACATACAA TTTATTGATG GAGGGCTACG GTCTACAAGA CGTTATGGGA
AGCCCCGGTA TAGACGGATT ACATACAACA ACAAATCACG TCTTAGAAGT TGAAGATGTT
CTTGGAGTCG AAGCCGCGAG GACACAGATA TCTGCTGAGA TCGACAATAT TATGAGCGCA
TACGGTATCG GTATTGATCA ACGACATCTA CTTTTGCTTT CGGATGTTAT GACGTTCAAG
GGAGAAGTTT TAGGCATCAC CCGGTTTGGT GTTTCTAAAA TGAGGGAGAG CGTTCTGATG
TTGGCGTCGT TCGAGAAGAC AACTGACCAT TTATTCGACG CGGCTGTACA TGGTCGAACA
GACACAATCG TTGGAGTCAG TGAATGCATT ATTATGGGAA TGGAAGTACC AGTTGGAACT
GGGCTACCCG CATTGTACTG GAAATGCACA AACTGA
 
Protein sequence
MPSRKPAHGG CLDPRLGVSD KVSACATCKR KLVDCAGHFG YIRLALPVFH IGFLRHTLHL 
LQCVCKTCSR VLMPESDRQK HLSRMRSPKT DALSKAALFK KVVEKCKKVR VCPHCGGHNG
TVKKITGVPT LKIVHERYKG RNAEDELDEL IANLEVSLKM NKDVGTAISG SSLPYEDLLP
TRALEIFTHI SDDDCEVLWI DPLIGRPETL ILQSILVPPV PIRPSVAMDV GGGSNEDDLT
VKLQEIIDVN VALELALMKG PQTRTIMEEW DFLQVQVAQY INGEMPGLQR PIGSKPMRGL
CQRLKGKQGR FRGNLSGKRV DFSARTVISP DPNLRVDQVG VPERVAKTMT YPERVSRYNI
EKLRQRVRNG PDVHPGANLI RMKDGSFVKS LSFGDRELVA KNLRYGDVVE RHMEDDDVVL
FNRQPSLHKV SIMAHRAKVM EWKTFRFNTC VCAPYNADFD GDEMNMHLPQ TEEARTEASL
LMGVKHNLTT PRNGEPLVAA SQDFLSASYM LTQRDRFFTR EQFCQLVSYY SDASEDIDIP
FPTILRPVEL WTGKQVFGMM LRPNRKSSVL VSFENKEKNY TTNKYFCKND GWVAFRNSEL
VSGNIAKKSI GDGSKSGLLY ILLRDCGVHE AASCMDRWAK FCSRFMGGHR GLSIGISDVT
PSARLRDIKH GILSEGYKKE RLITPMKQGR LELRPGCDLL QSLEEILNGI LGRLRESAGQ
EAMKALPWTN TPRIMAECGS KGSPLNISQM ISCVGQQAVG GMRIQDGFVN RCLPHFEYHS
LIPSAKGFVA NSFYTGLTAT EFFFHAMGGR EGLVDTAVKT AETGYMARRL MKALEDLSLQ
YDSSVRNSEN TVVQFTYGDD GLNPNMMENN DRPVDFERLR LHISQTTPCP TEDCLSAAAL
STTVEKKLAE PRFQALLPTG RVFMQEIRDF FNSLAENQNS RMTETQLELL LTEALDKCML
AYVEPGEAVG AIGAQSISEP GTQMTLKTFH FSGISSMNVT LGVPRLKEII NAAKLISTPI
ITAKLECDNN KVAARIVKAV IEKTTLGEVS KYMKEVYAPG SCYISVELDM DAIEQLKLNV
DVHSVRRSIL YGTKGIAKNA GMQMLKAALP NVIVQGIPTV ARAVINESNQ SGTPTYNLLM
EGYGLQDVMG SPGIDGLHTT TNHVLEVEDV LGVEAARTQI SAEIDNIMSA YGIGIDQRHL
LLLSDVMTFK GEVLGITRFG VSKMRESVLM LASFEKTTDH LFDAAVHGRT DTIVGVSECI
IMGMEVPVGT GLPALYWKCT N