Gene PHATRDRAFT_45953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45953 
Symbol 
ID7201026 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp762588 
End bp765872 
Gene Length3285 bp 
Protein Length857 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180112 
Protein GI219118689 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.535873 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCGAGCGAAA GCGTAGGAAA GAAACGAGTG CATTGACTGT GAGTTGCTAT TCGATCTTCC 
CTACATTTGC GGGATCATCT GGTTCGAGAA ATCTCGCACT TCTTGTATCG ATCCGCCATG
ACGACACTTT CCTCTACTAC ACCTAGACCA AGCGTGCCCA AACGAGCGGA TCAACGATCG
ACAATCACAA TTCTTTTATT AGGCGATGGT ACGTGGAAAA ATCAGTGAGA ACGATGAAAA
AATGTGACCC ATTGAAAATG TGGATAGCTT GATTGAGTCA CTGAAGGCAG CAATGTATCC
AAATGTTTGC CTATTGAGAG ATTTGTAACA AAATCACGTT TTTACAGCGG CCTGAAGAAA
ACCGCAAGCC GTTTTCTGCA GTAGAATTAG CTCGATTCCC CTTTCATTTT TATTTGCCTA
CATCTCTATC GAACGGAATC TTTCTGATCT CGTATTTCTT ATATTGCATC TTTTCTCTAC
AGAGGGAGTT GGGAAATCCT CGTTGATATC CACCTTTGTC TCTCGATACT TTTCTGAGGT
CGTCCCTGGC ATCATGACTC GTGTTCGTTT GCCGCCGGAT CCAGAACTGT CCTGTGTCAC
CACTATAGTA GACTCGCAGG GGGGCGACCT TGCCTTGCTA CAGGCAATGG CCACTCGCCG
CTCTATGATG CAACACCATT CGTCGGTGCA CGGTAGCACG GACTCACTAG CCGCGCTCAT
GGAGCGGGCG GAAACCAGTA TGATGACCCA GCAATCGTCC GCTCCAGAAC AAACAACCAC
GCCGACCGTT AAATCTTCTG GTATCGAAAA CGTCGACTCA ATTGTTTTGG TGTACGATTT
GGACCGAGTA GAAACTTTTT TTCGTTTGGA GAATCATTGG TTGCCTTTGA TTGAAAGATG
TTACAACGGG AAGGTAAGCC GATTCTCACA CTGTTCGTTC CGGATCACTG TGTTGCTACG
CTGCTCCAGC GAACCATATC TTACTCACAG TCACAATGCC GCTGCAGGTT CCAATCATTG
TGGCGGAAAA CAAACTAGAT CTCTTTCGCC CTTCCAGTAC GGCGGGGATG ACGGACGAGC
AAGCTGTAGC GCGACAACGA CAACAGATTG TCTCCCTCCT ACAACGATTC CCATTTGTCC
GACAATGCAT CAAGTGTAGT GCCAAGAACT TGGTACGGGT TGATGATGTC TTTCTGAAGG
CGCAACAGGC AGTCCTCTAC CCCTTCACTC CGCCCTTGTA CGATCTCGAA CATGGACGCT
TGACAGAGGA GTGCAAAAGA GCCTTTACTA GGATCTTTCG AATGTACGAT TCGGATCGTG
ATGGATTGTT GAGCGACGTT GAATTGAATC GCTTTCAGAT CGAAACCTAT CACGTAGCAG
TCTTTGATCG GGATTTTTCG GCCTGGAAGA AGGTAGTGTC GCGCAACAAC CCCACCGACG
AAGTTGTGAT TCAAGACGGC AAATTCACAA TCGCTGGTTT TTTTGCAATT TTCGATCTCT
TCATCAGTCA AAATCGACTT GATGTTGTAT GGCAAGCTTT GCGCGAGTTC AATTATGATG
ACGATTTGAA TTTGCATATA CCTGAAATTG TTACAGCCCC AACCGACGAC ACCAGTTGGA
AGTTGTCATC GGGCGCGAAA AGATTCTTGT CAGGTGTCTT TCGTCAATTT GACCAGGACC
AAGATGATGT TTTGACTGCA GATGATATAG GGAACATTTT TTCGATTCTG CATCCACCCG
CTCTTCCTCC GTGGCATCCA GCTCGCGCTC CATTTTTGTT CGCGGGTTGC TTTTCACTGC
CCAAGCAGAA ATATTCGCCA GGCACCGAAA GTCCTAACTT TGGAGGCAAT GTATCCTTGA
TTCTCCCAGG TTCTACCCCA ATGGCCCAGT CTCTATCAAA CAGTGGAATT TCTATTTTAA
GTGCTTCAGA TTCCCTACCG AGCGTTGCCT TGTCGGGAAT AAGTGTTTCG GAACCTCTCA
CGTTCTTGGA ATGGATGGGA CACTGGCACA CAATTGCTGC TATTTCGCCG TCAGTGACTA
GAGCGGAACT GTTTAGGTTG GGGCATAGCG AGGAGTCTCG CAAAACTGAT CCTCGGCCTC
GTCGAAGTCG TAAGAAGAAA TCAGCTTCTA TCACCCCAAG TCAAGCGCCA TCCGATGCCA
CTTTTCCCTC CAGTGCCATT AGGGTTTTGG TGCTAGGCAG CGGCTCTTGT GGCAAAACAG
CTCTATTAAA TGCACTATGT GGCTCGATGG AAAGCACCGA AGTTTCAGCT ACCAACACAA
CAAGCACTCT GCATCCCGAG ACAAGCAGCA CATACGTAAA GATCGGTAGG GGGCAATCGC
TTGGCCATCA CGGTACCTGC AGCCCGTCCA AGTCGCATGA TGTAGTTGAG GAAATTGTAG
CTCATCTCGT TTTCACAGAC GTCCCTGAGA CTGCTGCTGT CAGTCAGAGG GAACATTATC
GAGAATTATC CGAGCTCTTT GGCTCGACCG CGTCTCCAAA AGATCGCGTC TGCGACCTTG
CGATGCTAGT GTTTGACGCT TCGAGTCCTT CGAGCTTTGA ATTTGCCAGA GAACTAGAAG
CAAAGCTATT GACACAGGAG ACTCCTCGTG TTTTTATCGC TACGAAATCA GACAAGATAT
CTGCTCCCGA ACCAGAGGAT GGCGACGCAC AGGCTGCGAA TGTGTTGGAA ACTGCCACGA
TTCATTGTCG AGAATCCGAC TTGGAACTGC CGCTCTTGAC GTCGGCCGCC GACGGCTCAC
TGCTGAATTT TGAAAAGCGC AATGCTACTC TTGACCACTT GGCGCGTTGT GCCCTGGTCG
AAGCTGGAGT GACACGCCTA AAGTCGAGGC CGCACGAAGA GAAGCAACGC CGCGAGACTA
ACCGCCGTCG CAAGATGATG TGGCTCGGTG GTATCGTAAG CGTCGGTGTC GTTGTTGCTG
CTGGTGTAGG TCTCCTTTGG GGCAGTCATG CGACAAAAAA GGAGCAGACG AGTGGCTTTG
GATGGTTGCG TAACTGGTTT GGAGGTACAA CCCGGGGTAA TTCACCGCAG GCCATGTAGT
TACAGTGATG TTGCTATTGG CAAGACTTTC CATCTCTCGC TTACCCTGCC ATACTAGGTA
AAGACTGAGA AATTTATAAT TGCTTTCGAA AAAAAGCCTT GCACCCGATT TCGGTATGCA
TTACGTCGTC ATAAGTGTTA TCCTTTGGTC ATGAAACTAG TCTAAACCTC GGGCACATTG
CAATATGTGC TCACTTATTT TATAATAAAG CACTTTGCAA ACTGT
 
Protein sequence
MTTLSSTTPR PSVPKRADQR STITILLLGD EGVGKSSLIS TFVSRYFSEV VPGIMTRVRL 
PPDPELSCVT TIVDSQGGDL ALLQAMATRR SMMQHHSSVH GSTDSLAALM ERAETSMMTQ
QSSAPEQTTT PTVKSSGIEN VDSIVLVYDL DRVETFFRLE NHWLPLIERC YNGKVPIIVA
ENKLDLFRPS STAGMTDEQA VARQRQQIVS LLQRFPFVRQ CIKCSAKNLV RVDDVFLKAQ
QAVLYPFTPP LYDLEHGRLT EECKRAFTRI FRMYDSDRDG LLSDVELNRF QIETYHVAVF
DRDFSAWKKV VSRNNPTDEV VIQDGKFTIA GFFAIFDLFI SQNRLDVVWQ ALREFNYDDD
LNLHIPEIVT APTDDTSWKL SSGAKRFLSG VFRQFDQDQD DVLTADDIGN IFSILHPPAL
PPWHPARAPF LFAGCFSLPK QKYSPGTESP NFGGNVSLIL PGSTPMAQSL SNSGISILSA
SDSLPSVALS GISVSEPLTF LEWMGHWHTI AAISPSVTRA ELFRLGHSEE SRKTDPRPRR
SRKKKSASIT PSQAPSDATF PSSAIRVLVL GSGSCGKTAL LNALCGSMES TEVSATNTTS
TLHPETSSTY VKIGRGQSLG HHGTCSPSKS HDVVEEIVAH LVFTDVPETA AVSQREHYRE
LSELFGSTAS PKDRVCDLAM LVFDASSPSS FEFARELEAK LLTQETPRVF IATKSDKISA
PEPEDGDAQA ANVLETATIH CRESDLELPL LTSAADGSLL NFEKRNATLD HLARCALVEA
GVTRLKSRPH EEKQRRETNR RRKMMWLGGI VSVGVVVAAG VGLLWGSHAT KKEQTSGFGW
LRNWFGGTTR GNSPQAM