Gene PHATRDRAFT_54289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_54289 
SymbolRNAP-II_2 
ID7199698 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp2690 
End bp6619 
Gene Length3930 bp 
Protein Length1211 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178910 
Protein GI219116230 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGGGT TTGACCACGT TCATGGACAA GACGTGATCG ACGCAGTCGA TCAGCACCCT 
TTAGGTTCCG CAGGCCGAAA GCTTTTGGAA GACGACGACC ACGGCAAGGG AGGCAAGCAG
CACATCCAAC AACCTCAAGT TCACGACTAT GGATCTACCA AGACACCCGA TGACGCCCCA
GTTGGAAGTC TTCGGGATAA ATGGAGATTG CTTCCGTACT TTCTAAGACT ACGATCGCTT
CTCCGCCAGC ACATTGACTC CTTCGACCAT TTTGTCGATG TCGAAATGCA GCAAATTGTC
CAAAGTCCTT CGGCGTGCGA GATTCGTTCC GAACACGATC CAAAATTTTA TTTGCGTTAT
GAGGCTTGCT GGGTCGGAGA ACCCTCTGTG GAGGAAGATT CTTATTCGGT GAAATCCGCC
ACGCCCTTCC AATGCCGACT TCGCGACTGT ACTTACTCGG CACCTATCTA CGTCAACGTT
CGCTACACCC GGGGACGGCA GATTGTCGTC AAAAGAAAGG TCATGATTGG GCGAATGCCA
ATCATGTTGC GCTCCAAGAA ATGTCTTCTC CGGGACAGGT CGGAAGACGA TCTGGCGGGT
ATGAAGGAAT GCCCTTACGA TCCTGGCGGC TACTTCGTTA TTAAGGGCGT GGAGAAGGTA
ATACTTATTC AGGAGCAATT GTCCAAGAAT CGAGTCATCT TAGAAGAAGA CAATAAAGGT
ATCATGGCGT CGATCACGTC CTCCACCCAT GAAAGGAAAT CGAAGGCCTA TATTCTGATT
AAGAACGGCC GCGTGTATCT TAAAAACAAT ACTCTCGGCG ACGATATTCC AGTTGCTGTT
GTACTGAAGG CAATGGGCAT AGAGTCTGAT TTGGAGCTTG TGCAGCTTGT TGGTAGCGAG
TCACCGATCA TCAACGCACT GGCCCTTTCG CTGGAAGAGC CTAGCCGTCT CGGTATACAC
ACACAAGCTC AGGCACTACG TTTCATTGGT GCCAAAATTC GTGGCCGATC GGGGCCCAGT
GGGCCAAGCA GTAGCTATAG AAAGAATGTG TCTCCCGAAG ACGAGGCGAG AGAGGTGCTG
GCGAACGTTG TACTGAGCCA CGTTCCCGTT ACACATTTTG ATTTTCGTGA GAAGGCTCTC
TATATTGGCC ATATTGTTCG GCGAGTATTG TTGGTACATC TTGGCAAGAT GCCGTTGGAT
GACAAGGATT ACTACGGAAA CAAACGCCTC GAACTAGCTG GAAATCTATT GAGCTTACTT
TTTGAGGATT TGTTCAAGAT TTTCAACAAA GATTTGAAGC GACAGGCCGA TCTCGTCCTG
TCAAAGCCAA ATCGCACACA AGCCTTTGAT GTTGTCAAAA CGATTCGCCC TGATACAATA
ACAAACGGAA TGATCAACGC TATTGCAACA GGAAATTGGG TGTTAAAGCG ATTTCGTATG
GACCGAGCCG GGGTCACGCA AGTTCTGTCT CGTCTTTCGT ATATGTCTGC CCTTGGCATG
ATGACCCGAA TCAACAGCCA ATTTGAGAAA ACTCGGAAAG TGAGCGGCCC GAGATCATTG
CAGCCTTCGC AATGGGGTAT GTTGTGTCCC GCAGATACTC CCGAAGGTGA AGCCTGCGGT
TTGGTGAAAA ATTTGGCGTT GCTTGGACAC ATTACAACCG ATGAAGCTGA CACAAGACCT
ATTGAACGGA TTTGCAGAGA CCTTGGAGTT GAAGACGTCA AGCGGATGAC GGGCCACGAA
ATTAATTCGC ACCAAGCCTT TCTAGTGTTT TTGAACGGAC TGATACTAGG GGTGCACACT
CGGCCGAGAG AACTTGTTCG GAACCTCCGA AAGATGCGAC GTAGAGGGCT GGCGGGAGAG
TTTGTCTCCG TCTATCTCCA CGACGAACAG TGCGCGGTTC ACATCGCCAC TGATGGTGGT
AGAGTTTGCC GGCCACTTTT GATCGTGGAC GAAGAAACTG GCCTACCCAA ACTACAACAA
ATTCATATGG AATCGTTGGC ATTGGGTACG ATGGACATCA CCGACTTGAT GCGGCAAGGA
ATCGTTGAAT ATGTAGACTG CAACGAAGAA AATAACACGC TGATCGCGGT CACAGAACGT
GATTTGGAAG TCGCTATCTT GCACGGGTTG GAAACGCGTA AAATGCGTTA TACACATTTG
GAGGTAGATC CATTCACCAT TTTGGGTGTC GTTGGGGGGA TCATTCCGGT ATGTATGATT
GATACGGTCG AATGATGATG GATCGCACCA TCCAAACGAA TGTCTCACCC GAAGGATTCT
TCGCTTCAGT TTCCGCACCA CAACCAGTCT CCGCGAAACA CCTATACTGT TGCTATGGCA
AAACAGGCTA TGGGTTCTGT GTCAATGAAC CAATACGAAC GAATGGACGG TCTTATCTAT
ACCCTAATTT ATCCACAAAA ACCAATGGTG AAAAGTAGAA CCCTGGATTT GGTAAATTTC
GACAATATAC CAGGTGGTCA TAACGCGTGC ATTGCCGTGA TGAGTTACAG TGGCTATGAT
ATTGAAGATG CAATCATTCT GAACAAGGCA GCAGTTGATC GAGGGTTTGG GCGTTGTATG
GTTCTTCGAA AACACCAAAC AAGTGTTCGG CGCTACGCCA ATGGAACTAT GGACCGGACT
TGTGCTCCTC CTGATCCAGG TAGTTTTCCG GATGGCGAAG ACGATAAGCG TTTCGCTCGA
TACATGGCAA TTGACAAAGA TGGAATTTGC CGAGTCGGGG AAAAGATTGA AAATGGCTCA
GTAATGGTTA ACAAGGAATC CCCTGCAGAC ACCACTAGCA ATATGGCTGG CGTCGATTTC
GGATTTAACA GCGGCATGTC GATGGCCCAC TTGAATTACA AACCCTCTGG AGTATCTTAT
CGCGGAAGTG CGTCTACCTA TGTCGATAAA GTGCTGATAA CATCAAACGA AAACGAGCAA
TTTCTGATCA AAGTTATGCT TCGTCAAGTT CGCCGGCCAG AAATCGGGGA TAAATTTGCT
TCACGGCATG GGCAAAAAGG AGTCTGTGGT TTAATTGTAC CGGAAGAAGA CCTTCCTTTC
AACGAATTTG GGCACGTCCC CGATCTAATA ATGAACCCAC ATGGCTTTCC TTCTCGGATG
ACTGTCGGGA AGCTTTTAGA GCTTTTGGTA GGAAAGGCAG GCCTGTACGA AGGTCGTCAA
GGATATTCCT CGGCTTTCGG AGAGGAATTT GGTTCTGCTG ACACTGCACA ATCTGCGTCG
GAGGCTTTGC TGAGAAATGG TCTCAACTAT ACCGGTAAAG ATATTCTCTA TAGTGGCACT
AACGGCGAGC CATTGGATGC ATACATTTTC TCCGGCCCCG TGTTTTATCA AAAGCTCAAG
CACATGGTTT TGGACAAGGC ACATGCGCGG GCCCGGGGTC CTCGCGCCGT GCTAACACGT
CAGCCGACAG AAGGTAGATC TCGTGATGGT GGGTTGCGTT TGGGTGAAAT GGAACGCGAT
TGTTTAATTG CGTATGGAGT ATCAAACTTG ATCATGGAGC GCCTTATGCA TTCTTCGGAT
GCCTTCAGTG CCAACGTTTG CCTCACTTGT GGGCTACTAC AATACGAAGG TTGGTGTCAA
TACTGCCGGT CCGGCGAAAA AGTGGCTGAT ATCCGTTTAC CCTATGCTTG TAAGCTTCTA
TTCCAGGAGC TCCAATCGAT GAATGTCTTA CCTCGTCTCC GACTGCAAGA TAAATGAATA
CGCTGCCAAT TGCCGTCGGG GAAGGTAGTG TTTACTGTTA ATTGTGTTGG TGTTGACCGT
GATAAATAAT AGGTGGGCAC AATCGGACAG TAACCCCTTA TTCAAAATTC ACTGTCACGT
CATCTGTTTC CTGTCTGTTA TGCCAGCAAA TCGGTTTTTC AAAAAAAGGA ATATCCTAGA
CGACAAATAA TTTATGTTTC TTATAAATTT
 
Protein sequence
MEGFDHVHGQ DVIDAVDQHP LGSAGRKLLE DDDHGKGGKQ HIQQPQVHDY GSTKTPDDAP 
VGSLRDKWRL LPYFLRLRSL LRQHIDSFDH FVDVEMQQIV QSPSACEIRS EHDPKFYLRY
EACWVGEPSV EEDSYSVKSA TPFQCRLRDC TYSAPIYVNV RYTRGRQIVV KRKVMIGRMP
IMLRSKKCLL RDRSEDDLAG MKECPYDPGG YFVIKGVEKV ILIQEQLSKN RVILEEDNKG
IMASITSSTH ERKSKAYILI KNGRVYLKNN TLGDDIPVAV VLKAMGIESD LELVQLVGSE
SPIINALALS LEEPSRLGIH TQAQALRFIG AKIRGRSGPS GPSSSYRKNV SPEDEAREVL
ANVVLSHVPV THFDFREKAL YIGHIVRRVL LVHLGKMPLD DKDYYGNKRL ELAGNLLSLL
FEDLFKIFNK DLKRQADLVL SKPNRTQAFD VVKTIRPDTI TNGMINAIAT GNWVLKRFRM
DRAGVTQVLS RLSYMSALGM MTRINSQFEK TRKVSGPRSL QPSQWGMLCP ADTPEGEACG
LVKNLALLGH ITTDEADTRP IERICRDLGV EDVKRMTGHE INSHQAFLVF LNGLILGVHT
RPRELVRNLR KMRRRGLAGE FVSVYLHDEQ CAVHIATDGG RVCRPLLIVD EETGLPKLQQ
IHMESLALGT MDITDLMRQG IVEYVDCNEE NNTLIAVTER DLEVAILHGL ETRKMRYTHL
EVDPFTILGV VGGIIPFPHH NQSPRNTYTV AMAKQAMGSV SMNQYERMDG LIYTLIYPQK
PMVKSRTLDL VNFDNIPGGH NACIAVMSYS GYDIEDAIIL NKAAVDRGFG RCMVLRKHQT
SVRRYANGTM DRTCAPPDPG SFPDGEDDKR FARYMAIDKD GICRVGEKIE NGSVMVNKES
PADTTSNMAG VDFGFNSGMS MAHLNYKPSG VSYRGSASTY VDKVLITSNE NEQFLIKVML
RQVRRPEIGD KFASRHGQKG VCGLIVPEED LPFNEFGHVP DLIMNPHGFP SRMTVGKLLE
LLVGKAGLYE GRQGYSSAFG EEFGSADTAQ SASEALLRNG LNYTGKDILY SGTNGEPLDA
YIFSGPVFYQ KLKHMVLDKA HARARGPRAV LTRQPTEGRS RDGGLRLGEM ERDCLIAYGV
SNLIMERLMH SSDAFSANVC LTCGLLQYEG WCQYCRSGEK VADIRLPYAC KLLFQELQSM
NVLPRLRLQD K