Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47710 |
Symbol | |
ID | 7202711 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 601318 |
End bp | 604584 |
Gene Length | 3267 bp |
Protein Length | 1041 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182097 |
Protein GI | 219123573 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTTGGGCGT GAGTGGAAAT AGCGACGGGC CAAACGCATC CCAATAGTGA CGGGTACCGT GGGAGTATTG GGGTGAGTGA GTGAACGAGT GAGGGAAGGA TCTAGTGTCA AACCGAAAAG GCGCGTTCGG CTAACGGCAC AATGCATCCG CGACGCGGTA TGTTCGACGA TTCGAGACGT CTTCCACCGA CGTCTAGTTC AACCTTGTCC CGATCGACTC AGCGAAACGG TGGGAAAGAA TGGCGGAAGG AACCGCGTGC CGATGTGGAC GACAGGATCC AGGACGACTT TCCCACGCCG CCCGTTACCC CGGAGAGTTC TGCCGGGGGT TCACCGCCTC AAAACTTAAC GAGCTCGACA ATGAATCGAG AAAGGAGACA GTCCAAGGCA GCCTATAGTA GAGCAACGGG ACCGTTTGTC GGGTCGCCCG GAAGCGCCAG TCCCTCGCAC GCCAACGCCT TTTCTCCGAA AGAGATGAGG AAGGAACCAA TTATAGTAAA AGGGCAGCAG CAGCGAAGCG AGAAATTGGA TGCGACGGTG AGGAGCCCGA ACACTGCTTC GGACACCGTC CTGGGACGAC TTTACGAAGC GGTCGACCAA GTATGCGCGC CGCGCACCTC TTCCCCGACG GCAATTGTCG ACTACACCAC GCCGGAAAGA ACCGAAGAGG TGCGCCAACG GCTCACGTTC GAAAGAGAAT GCGGGACGTC GCCGGTGGAC GACGGAGATG ACTCTGCTGG GCGATCGCGC ACAACCAGTT TTTTAGACTA CCTTACGGGG GGGACGGCGG GGACTACGAT CGCCGACGCT GACGACCAAG GCTTTGATCT CCTACTCGAC GAAAGCAACT CGCAGCCTTT CAGCGATACG AGAGAGAAGG ATGTTGGAAG GCAGCGCCAG CGCACGCTGA AAGCAACGCG CCGTGGCAAC AAAAGTGGAC CAAATACCAA TGCCACAGCA ACTCTGGCGC AATCATTTTC TGCTGCCCTA GCGTTTTACC AAGGCTCGTC ATCCCCACCG AGATCACCAC TTAAGACGAA TAACAATCTT CCCATGCGCA AAATCAAATC AGCAGCCAAA GCCGCTTTGG CTGTCAAGGC ATTGAACACC AAGAAGGCAA TTGCGTACCA GGAAGCTGCC CACACCAAAC ATACATCGTC TACCAGCAAT GGAAGTGCTG CACATGCCCG AGCCACCGCT GCGGACGAGA CCACCATTTC AGCGGTAGTG ATGGCTTTGG ATAACAGTGT GGAAACACTG CGATCTCCCT CAGGAACGAT GGTGAAACAG ACCAACGAAG AGACGGAAGC CCATGTCAAG CAAGTCTTGG TGGCTTTTAA AGGAGCAGCC AAGGGACCAT TGAGCAGTAT TTCGGAGGAA GAAATCATGT CTCGGGAGTC GGCAGAGTTC AAGGTTCACG TATCGGGCGA AACAGCGAAC GTTTGGGGCG ACAGCGGTTT GATCGAAGTG GAAATGATCG ATGAGGTAGA AGAAGACAAA GCCTACCCGA TTGATTCGGT GCAAAACAAT CTCACGAGTG AAGCAACACC TTCTTCGATG GCCGAAATTG CATCTGGTGG AGACGAGATA GGCGATCGAT ATTCATCAAA CACAGCGAGT GCATCGTCCA CGTTTCGCTC CAGAGTACAG CAAGTAGATT CCAATTCGGA TCTACCGACA AGACTGCTTC GCAAAAATAG ACCTTGGAAG ACTAAGCCTT GGAGGAGTAC TAGCGGGCTG TTCAAGACCA ACTCATTTGT TGCGAGCAAG ATTGGAGAAA CGAAACAGGA CGATCCTGTA ACGTCACAAA AGTCCAAGTC AATATCGGGT GGAAAACCAC AGTGGAAAAT TGCCGTCGAC GCTGAATCTG GTCGTACATA TTACTACCAT CGAATTTCAC GTGTGACTTC CTGGACAAAG CCACCTGACG GTGAAGTGGG TATTGAAGTT GAAACCCAAA GTAAAAACGA AAGTTGCAAG GATGTGGCAA AACCAGATTT CGATAATGTC GTATGGCAGA AAAAGGAAGA AATTTCTGCA TTGCTCGAAA CGTTGACTCC TTCTGACTAC GAGAATGCGA GACGATTAAT GGTGCGGTAT AGTGGGAACG AGGACGAGTT GCTAGCGCAG CTACGTAATT TGGCCCAGTC CCAGCCTTTC GATGAGTCCT CGGTCAATGC TGGAGAATCT GCTAGTTTCG ACAACGCATT GGATACTGAT ATAGCACTTT CGAGGCCGAC GAGTATGAAG TCTCGCACTG TGACATTGTC TAGTTTAAAA AGTGGGACCA GCGTCTCTAC GAGAGTCTCC GAACAGACTG ACGTGATACG GAACACCGCA AATGGACGTC GCCGCGGTTC GAACAAGATG GAAAGCGACA GTTCTGTCAC GAGCATCTCG AGTCAACATG ATGACATTTG GAGACCCGGT GCACGTATTC CATACTCGGG AAAGCCATTA CGTATGGATC GCATTCCGAG CCGAATTCCT GTTCCGCGCG TACGTGAACT CGTGGCGGAA GATCTCTCCT CGCCAAAGGG CTTCCGCATT AGCCAAAAGA TAAGCGTTGC CCACAATTCT CAGCCCACAA TTTCCAGGAG AGTAAAGTCC CTTTCTCCCC CGGAAGGAAA TGAAAATACA GGCTCAAAGG ATGATTTAAA ACAGACGGAT GAAATCAAAG ACTTCGATTC CATGGGTTTG AATGACGACA TTTCCGCATT GAGCATGGCT GATATCGACT ACCCAGGACA CAGAATCTGT GACACTCACG GAGCCCGTCG CCGCCCGGTT GATGATGTGT TTGCACGTAA AGAATCACAT TTGGTGGCAG CGCAGTCCGG CGGAACTCTT CACTCCAAGC AACCTGTAAC GGGCTCCCCA GCACATCGAT GGACTCAAGC ACAGTTAGAT GGCTTTATTG CGCTGAACGA CTGGGACGCG GTAGCGAAGC ATATTTCCCA AGTCCAAGGC ACTAATAGGA AAGTTAAGAC TGAGAAGAAT GCTGTGGCTT TTCATTCAAG GATCGCATTT GAAATGCAAC AAGAGCCGCT GGTTCAACGC AGCCAGTATG ATGAAGTGAA CGGCGGTCAC GTACAAAAGC GTCTCGGAGG GCGCTTTCAA CGGCGACATG ACGGCATGCA CTCCGCTTCA AGCCGAGATA TGTCTTCTGT TGATGATACA GACGCGTTCA GCACGGTGAG CGAATACGCG GAAGAACGAC GGAGACGTTC CAACAGGATA CGACGAGCTA CGAGAGGATT TCATTGA
|
Protein sequence | MHPRRGMFDD SRRLPPTSSS TLSRSTQRNG GKEWRKEPRA DVDDRIQDDF PTPPVTPESS AGGSPPQNLT SSTMNRERRQ SKAAYSRATG PFVGSPGSAS PSHANAFSPK EMRKEPIIVK GQQQRSEKLD ATVRSPNTAS DTVLGRLYEA VDQVCAPRTS SPTAIVDYTT PERTEEVRQR LTFERECGTS PVDDGDDSAG RSRTTSFLDY LTGGTAGTTI ADADDQGFDL LLDESNSQPF SDTREKDVGR QRQRTLKATR RGNKSGPNTN ATATLAQSFS AALAFYQGSS SPPRSPLKTN NNLPMRKIKS AAKAALAVKA LNTKKAIAYQ EAAHTKHTSS TSNGSAAHAR ATAADETTIS AVVMALDNSV ETLRSPSGTM VKQTNEETEA HVKQVLVAFK GAAKGPLSSI SEEEIMSRES AEFKVHVSGE TANVWGDSGL IEVEMIDEVE EDKAYPIDSV QNNLTSEATP SSMAEIASGG DEIGDRYSSN TASASSTFRS RVQQVDSNSD LPTRLLRKNR PWKTKPWRST SGLFKTNSFV ASKIGETKQD DPVTSQKSKS ISGGKPQWKI AVDAESGRTY YYHRISRVTS WTKPPDGEVG IEVETQSKNE SCKDVAKPDF DNVVWQKKEE ISALLETLTP SDYENARRLM VRYSGNEDEL LAQLRNLAQS QPFDESSVNA GESASFDNAL DTDIALSRPT SMKSRTVTLS SLKSGTSVST RVSEQTDVIR NTANGRRRGS NKMESDSSVT SISSQHDDIW RPGARIPYSG KPLRMDRIPS RIPVPRVREL VAEDLSSPKG FRISQKISVA HNSQPTISRR VKSLSPPEGN ENTGSKDDLK QTDEIKDFDS MGLNDDISAL SMADIDYPGH RICDTHGARR RPVDDVFARK ESHLVAAQSG GTLHSKQPVT GSPAHRWTQA QLDGFIALND WDAVAKHISQ VQGTNRKVKT EKNAVAFHSR IAFEMQQEPL VQRSQYDEVN GGHVQKRLGG RFQRRHDGMH SASSRDMSSV DDTDAFSTVS EYAEERRRRS NRIRRATRGF H
|
| |