Gene PHATRDRAFT_47710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47710 
Symbol 
ID7202711 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp601318 
End bp604584 
Gene Length3267 bp 
Protein Length1041 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182097 
Protein GI219123573 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTTTGGGCGT GAGTGGAAAT AGCGACGGGC CAAACGCATC CCAATAGTGA CGGGTACCGT 
GGGAGTATTG GGGTGAGTGA GTGAACGAGT GAGGGAAGGA TCTAGTGTCA AACCGAAAAG
GCGCGTTCGG CTAACGGCAC AATGCATCCG CGACGCGGTA TGTTCGACGA TTCGAGACGT
CTTCCACCGA CGTCTAGTTC AACCTTGTCC CGATCGACTC AGCGAAACGG TGGGAAAGAA
TGGCGGAAGG AACCGCGTGC CGATGTGGAC GACAGGATCC AGGACGACTT TCCCACGCCG
CCCGTTACCC CGGAGAGTTC TGCCGGGGGT TCACCGCCTC AAAACTTAAC GAGCTCGACA
ATGAATCGAG AAAGGAGACA GTCCAAGGCA GCCTATAGTA GAGCAACGGG ACCGTTTGTC
GGGTCGCCCG GAAGCGCCAG TCCCTCGCAC GCCAACGCCT TTTCTCCGAA AGAGATGAGG
AAGGAACCAA TTATAGTAAA AGGGCAGCAG CAGCGAAGCG AGAAATTGGA TGCGACGGTG
AGGAGCCCGA ACACTGCTTC GGACACCGTC CTGGGACGAC TTTACGAAGC GGTCGACCAA
GTATGCGCGC CGCGCACCTC TTCCCCGACG GCAATTGTCG ACTACACCAC GCCGGAAAGA
ACCGAAGAGG TGCGCCAACG GCTCACGTTC GAAAGAGAAT GCGGGACGTC GCCGGTGGAC
GACGGAGATG ACTCTGCTGG GCGATCGCGC ACAACCAGTT TTTTAGACTA CCTTACGGGG
GGGACGGCGG GGACTACGAT CGCCGACGCT GACGACCAAG GCTTTGATCT CCTACTCGAC
GAAAGCAACT CGCAGCCTTT CAGCGATACG AGAGAGAAGG ATGTTGGAAG GCAGCGCCAG
CGCACGCTGA AAGCAACGCG CCGTGGCAAC AAAAGTGGAC CAAATACCAA TGCCACAGCA
ACTCTGGCGC AATCATTTTC TGCTGCCCTA GCGTTTTACC AAGGCTCGTC ATCCCCACCG
AGATCACCAC TTAAGACGAA TAACAATCTT CCCATGCGCA AAATCAAATC AGCAGCCAAA
GCCGCTTTGG CTGTCAAGGC ATTGAACACC AAGAAGGCAA TTGCGTACCA GGAAGCTGCC
CACACCAAAC ATACATCGTC TACCAGCAAT GGAAGTGCTG CACATGCCCG AGCCACCGCT
GCGGACGAGA CCACCATTTC AGCGGTAGTG ATGGCTTTGG ATAACAGTGT GGAAACACTG
CGATCTCCCT CAGGAACGAT GGTGAAACAG ACCAACGAAG AGACGGAAGC CCATGTCAAG
CAAGTCTTGG TGGCTTTTAA AGGAGCAGCC AAGGGACCAT TGAGCAGTAT TTCGGAGGAA
GAAATCATGT CTCGGGAGTC GGCAGAGTTC AAGGTTCACG TATCGGGCGA AACAGCGAAC
GTTTGGGGCG ACAGCGGTTT GATCGAAGTG GAAATGATCG ATGAGGTAGA AGAAGACAAA
GCCTACCCGA TTGATTCGGT GCAAAACAAT CTCACGAGTG AAGCAACACC TTCTTCGATG
GCCGAAATTG CATCTGGTGG AGACGAGATA GGCGATCGAT ATTCATCAAA CACAGCGAGT
GCATCGTCCA CGTTTCGCTC CAGAGTACAG CAAGTAGATT CCAATTCGGA TCTACCGACA
AGACTGCTTC GCAAAAATAG ACCTTGGAAG ACTAAGCCTT GGAGGAGTAC TAGCGGGCTG
TTCAAGACCA ACTCATTTGT TGCGAGCAAG ATTGGAGAAA CGAAACAGGA CGATCCTGTA
ACGTCACAAA AGTCCAAGTC AATATCGGGT GGAAAACCAC AGTGGAAAAT TGCCGTCGAC
GCTGAATCTG GTCGTACATA TTACTACCAT CGAATTTCAC GTGTGACTTC CTGGACAAAG
CCACCTGACG GTGAAGTGGG TATTGAAGTT GAAACCCAAA GTAAAAACGA AAGTTGCAAG
GATGTGGCAA AACCAGATTT CGATAATGTC GTATGGCAGA AAAAGGAAGA AATTTCTGCA
TTGCTCGAAA CGTTGACTCC TTCTGACTAC GAGAATGCGA GACGATTAAT GGTGCGGTAT
AGTGGGAACG AGGACGAGTT GCTAGCGCAG CTACGTAATT TGGCCCAGTC CCAGCCTTTC
GATGAGTCCT CGGTCAATGC TGGAGAATCT GCTAGTTTCG ACAACGCATT GGATACTGAT
ATAGCACTTT CGAGGCCGAC GAGTATGAAG TCTCGCACTG TGACATTGTC TAGTTTAAAA
AGTGGGACCA GCGTCTCTAC GAGAGTCTCC GAACAGACTG ACGTGATACG GAACACCGCA
AATGGACGTC GCCGCGGTTC GAACAAGATG GAAAGCGACA GTTCTGTCAC GAGCATCTCG
AGTCAACATG ATGACATTTG GAGACCCGGT GCACGTATTC CATACTCGGG AAAGCCATTA
CGTATGGATC GCATTCCGAG CCGAATTCCT GTTCCGCGCG TACGTGAACT CGTGGCGGAA
GATCTCTCCT CGCCAAAGGG CTTCCGCATT AGCCAAAAGA TAAGCGTTGC CCACAATTCT
CAGCCCACAA TTTCCAGGAG AGTAAAGTCC CTTTCTCCCC CGGAAGGAAA TGAAAATACA
GGCTCAAAGG ATGATTTAAA ACAGACGGAT GAAATCAAAG ACTTCGATTC CATGGGTTTG
AATGACGACA TTTCCGCATT GAGCATGGCT GATATCGACT ACCCAGGACA CAGAATCTGT
GACACTCACG GAGCCCGTCG CCGCCCGGTT GATGATGTGT TTGCACGTAA AGAATCACAT
TTGGTGGCAG CGCAGTCCGG CGGAACTCTT CACTCCAAGC AACCTGTAAC GGGCTCCCCA
GCACATCGAT GGACTCAAGC ACAGTTAGAT GGCTTTATTG CGCTGAACGA CTGGGACGCG
GTAGCGAAGC ATATTTCCCA AGTCCAAGGC ACTAATAGGA AAGTTAAGAC TGAGAAGAAT
GCTGTGGCTT TTCATTCAAG GATCGCATTT GAAATGCAAC AAGAGCCGCT GGTTCAACGC
AGCCAGTATG ATGAAGTGAA CGGCGGTCAC GTACAAAAGC GTCTCGGAGG GCGCTTTCAA
CGGCGACATG ACGGCATGCA CTCCGCTTCA AGCCGAGATA TGTCTTCTGT TGATGATACA
GACGCGTTCA GCACGGTGAG CGAATACGCG GAAGAACGAC GGAGACGTTC CAACAGGATA
CGACGAGCTA CGAGAGGATT TCATTGA
 
Protein sequence
MHPRRGMFDD SRRLPPTSSS TLSRSTQRNG GKEWRKEPRA DVDDRIQDDF PTPPVTPESS 
AGGSPPQNLT SSTMNRERRQ SKAAYSRATG PFVGSPGSAS PSHANAFSPK EMRKEPIIVK
GQQQRSEKLD ATVRSPNTAS DTVLGRLYEA VDQVCAPRTS SPTAIVDYTT PERTEEVRQR
LTFERECGTS PVDDGDDSAG RSRTTSFLDY LTGGTAGTTI ADADDQGFDL LLDESNSQPF
SDTREKDVGR QRQRTLKATR RGNKSGPNTN ATATLAQSFS AALAFYQGSS SPPRSPLKTN
NNLPMRKIKS AAKAALAVKA LNTKKAIAYQ EAAHTKHTSS TSNGSAAHAR ATAADETTIS
AVVMALDNSV ETLRSPSGTM VKQTNEETEA HVKQVLVAFK GAAKGPLSSI SEEEIMSRES
AEFKVHVSGE TANVWGDSGL IEVEMIDEVE EDKAYPIDSV QNNLTSEATP SSMAEIASGG
DEIGDRYSSN TASASSTFRS RVQQVDSNSD LPTRLLRKNR PWKTKPWRST SGLFKTNSFV
ASKIGETKQD DPVTSQKSKS ISGGKPQWKI AVDAESGRTY YYHRISRVTS WTKPPDGEVG
IEVETQSKNE SCKDVAKPDF DNVVWQKKEE ISALLETLTP SDYENARRLM VRYSGNEDEL
LAQLRNLAQS QPFDESSVNA GESASFDNAL DTDIALSRPT SMKSRTVTLS SLKSGTSVST
RVSEQTDVIR NTANGRRRGS NKMESDSSVT SISSQHDDIW RPGARIPYSG KPLRMDRIPS
RIPVPRVREL VAEDLSSPKG FRISQKISVA HNSQPTISRR VKSLSPPEGN ENTGSKDDLK
QTDEIKDFDS MGLNDDISAL SMADIDYPGH RICDTHGARR RPVDDVFARK ESHLVAAQSG
GTLHSKQPVT GSPAHRWTQA QLDGFIALND WDAVAKHISQ VQGTNRKVKT EKNAVAFHSR
IAFEMQQEPL VQRSQYDEVN GGHVQKRLGG RFQRRHDGMH SASSRDMSSV DDTDAFSTVS
EYAEERRRRS NRIRRATRGF H