Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_34031 |
Symbol | |
ID | 7198084 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 923085 |
End bp | 927630 |
Gene Length | 4546 bp |
Protein Length | 1284 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178605 |
Protein GI | 219115619 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACCA AGTCTACTCC CAAAGACCTC ATTGACTCGT TTCCGCACAG CAAACTCACC CCGATTGCTA CCGCGACAAC CGAACCCGAT TACTTGTCGC TCCATCAGCT TCAGTATGAA ATCAACGACA ATGCGGAGAC CCTTTCCTCC ACTCTTGGAG ATGGCCAACA CGGTCACCTT TTTCTCGTAA TTTCCGAAAC CGAGTACCTC GAAATGACCG ACGGCATTCC ATGCATTCCT CCTGTGCAAC CGCCTTTCGA CCCAGTTCAC GCTGCCAACG CCACAGCCCC TCAAATTGTC GAAGCGAACC GCCAGAACGA CAAACGACAA AAGCTGTTTG ACCTCTATCA CAACGCCATT AAAGCGTTTC GCAATCAACT CCTTGAAGCC ATTCCCATTG AATACATCAA ATCTCTCGGT CATCCTACAC GAGGCTTTAA CAAAGTCTCT CCCCTCGAAA TCCTTTCTCA TCTCTGGGAA ACTTTTGGTA AAATTCAGGC TTCGGATCTC ATCGCCAACG ACGAACGCAT GAAAGCCGCC TGGCATCCAC CAACGCCTAT CCAGCAACTC TTCCAGCAGC TTGAAAAAGG CAATCAGTTT ATCATCGCGT CTGGCCAAGT CATGGACGAA CGTATTATCG CTCGCATCGG CTACCAGATC ATCGAAAAAA CCGGACTCTT TGATCTTGCT TCTCGCGACT GGCGTTATAA AGATGAAGCC GATAAAACTT TGGCAAATTT CAAAAAACAT TTCCAGAAGG CCAACAAGGA TCTCGCCGTC ACCGCCACCA GCAGCTCTGC GGGTTACCAC ACCGCAAATC AGAGTACTGT CACCAAGGGA AAATCGTATT GCTGGACCCA CGGCATCGTT CACAACACAA AGCACACCAG TGCGACATGT GAAAAACAGG CCCCGGGGCA CAAAACCGGC GCTACATTGC ATGACAAACA AGGCGGGTCG ACTAAGACCT ATCAATACAC GCCACCGGTG CCCAAATAGG AAAGAGGGAC GGCCAAACTG TTGAGTGTGC CGCTGAATAC TTATCATAAC AAAAATAAGT CTTCAGTTGC ACCAAACACT CCTCCGTTAG CTTCCTCCCC GCCATTTTTC CCTCCCGACG CCATTGCAGA CACTGGCTGT ACCGGACATT TTTTGAGCAC CAACATTGCT CACATACATT GCCAACCGAC GGTCCCCGGC ATCAACGTGG TCCTCCCTGA TGGTCGCACA ATCACTTCGA GTCATATCAC CGAACTCAAC ATTCCCTCGC TTCCTCCGGC AGCTCGTACC GCCCATATCT TTCCCGGTCT CTCGAATGGA TCCCTCATTT CCATTGGCCA ACTTTGTGAC CACGGCTGTA CCGCCACGTT CACATCTGAC ACAGTCCGCA TTGAGCTCAA TAACACTGTC GTTCTCCGCG GCGGCCGTTC TCCTTACACC CGATTGTGGA CCCTCGACTC CCCTGTAACG CCCAATCCGC CCGCCACTGA ATTGCATGCG CCTGTGCACG ACAAAAATTT TGCGAATCAC CTCGGAGACC ACTCAGGGAC CCTTGCCGAC CGCATTGCCT TTGTTCATGC ATCCTTATTC TCACCACAAC TTTCAACATG GTGCAAGGCC ATTGACGAAG GCCACCTCAC AACCTTTCCG GACATCACGT CTGCACAGGT AAAACGGCAC CCCCCCCAGT CCGTCCCTAT GGTCAAGGGA CACCTTGACC AGCAACGGTC CAACCTACGC TCAACCAAGC CCAAGGTCAC CCTGTCTGCC TCTGTTGATC CTGACGACAT CAATTTCGAC ACCAATCCCG TCGTACAAGA CCCTCCAGCC GCCAGGACGC AGTTTTTGTA CGCCGATTTC GCCGAAGTCA CCGGAAAAAT TTTTACTGAC CCTACCGGCC GCTTTGTTAC CACTTCAAGC TCCGGCAATG CATACATGCT AGTGGTTTAT GACTACAATA GCAATTTTAT TCATGTCGAA GCCATGAAGA ACCGCACCGG TCCCGAGATT TTGAGCGCCT ACAAGCGTGC TCACGCCATG CTATCCTCCA AAGGTTTGCG CCCCCAACTC CAACGCTTAG ACAACGAAGC CTCCACTGCG TTACAACAAT TCATGTCCTC TGTTGACATT GATTTTCAAT TAGCTCCTCC GCACGTGCAC CGTCGGAACG CCGCCGAACG GGCAATCCGC ACGTTCAAAA ACCACTTCAT TGCAGGTTTG TGCAGCACCG ACAAGAACTT TCCGCTTCAC CTTTGGGATC GCTTACTCCC ACAAGCCATC ATGACTCTCA ACCTTCTTCG AGGGTCTCGT ATCAACCCAA ATCTGTCGTC CTGGGCCCAA CTCCATGGCT TGTTCGACTA CAATCGTACC CCTTTGGCTC CCCCGGGCAT CCACGTACTT GTACACGAAA AACCGACAAT TCGCAGAACT TGGGCCCCCC ACGCAGCCGA CGGCTGGTAC GTTGGTCCCG CCATGAACCA TTACCAATGT TATCGCGTCT GGATCAAGGA GACCACCAGC GAACGCATTT CTGACACTCT GACATGGTTT CCCAGCCAAG TCAAAATGCC CAGCACCTCG TCTCGCGACA CAATTGTCGC TGCTGCTCAC GATCTTGCCC ATGCTCTGGC ACATCCATCT CCCGCGTCCC CCTTGTCGCC TCTTTTGGTC CACGAACGCG AAGCCCTCTC GCAACTTTCA GATATTTTTT CGAAAGCCGC TAACCCAGTT GACTCGTCCC TCCCAGTTGC TCCCACGGCA ACCCTAAGTC CGCCAACTGC ATCGACTTCT TCACCTCGTC AAGTCCGCTT CCGAGACCCG GTCACTGAAT CACTTCCGAG GGTGCCGACC ACCACAGCCG CCCCTCCGCA GTCACTTCCG AGGGTGCCTC CCCCAAACTC CGAGGCCGAG ACATACAAGC TTGTCACCTG CAACCCTCGC CAAGCACGTC GTAGGGCCGC TCGAAAACTG AAAGAAAAAA TTTCCGCTTC AACATCCGTT GTTCCTACCC AAGCAACACC TGCACCCGTC GTACCTTCTC CCGACAAGTA ATCGGGTTTG GTTGTCGCGG TAGCAATCGG GGTGAGTTTG CTGTGCGGAA ACGAGTCAAT GAGGTCTTTG GGAGTAGACT TGGTGGTCAT GATGACGAAA GTGATCGGAT CTTTCGGATG TGGTTTTGTT GGCCTTGCAG GTTAGACTGG AACTTGTTTG GCAGACAACA GAGGAGAGGA CGAGATGTCG TCGTGTGCGA TAGGTGATAT CGTGAGGAAA TTGGAAAATT GACAATCTTA TTAGTTCTAT TGATGAGCTC TACAAACGCA AGAACGCGAA TGATGGAAAT CGATGAACTA ATAAGATTTA TGACTAGTTT GAGAATCGTG ATTGAGTTTG TATGATGCAG CTGAATCGCC AATGACTGTT TCATATGGCG ATGACTTGTT TGAAATGTAA ACGGGAAGAC TAGAACGTTC GTAGGACGTT AAAGCTCTGT ATGTAACCGT GATAGGTTCT TTGGTACTAT CAATCTTGTC ACAACGGATT GGGAAAGTAT CAGTCTTGTC ACAACGGATT GGGAAAGATT TAGTAGGCCC AATGGAATTA TCAACACAAA GTTCCGTCCT TTACGGTATG TTAATCCAAC AGAAAGACTG CTGAAAGGGT CATTCTGATC CTGTCAAGGG TCTTTTTTCC GAAGACACAA TTTCCATCAA ACTTCTCCTC TTTTCTCATT ATCAACGCAA ACGTAAACCC TGGGACCTTC TGACCGTGTT GATTCTCCCG TACTAGCCGA CCACTTTGCA ACTGCAAGAA ACCTCCGCAG CTGTCTTTGA GCATCGGTGA AATCATGACA GCTTCTTGCT TTTCGTTTTT TATGCTATTC TTCCTATCAA CTGTCGACAC AGTGGAAGGC GAGGCGATTG TAACAACTCG AGGTGCCGAT CGCGAGCTAG CATTGGTAGG TGCTACCTGT GCAGATGTTA TAAAAGAAAT ACGCTCCCCA CGGTTCAATA GATGTACGTG CAGTCTTGCC GGTAGCACAG GGGGTGCAAC GGTGAATGCT ACATGCCGAG GTTGCTCCGA AGTATGCATC CTTGGAGCTT GTGGATCGGG ATATGACCAG GTCTCTTACG AGTTCGAGAC CGACGGAAGT GCTACCGGAG AAAGACGACA GTGCTTCGAG TACGTCTCAG GGGTGAACGG TACGATTTGT CGAATCGAAA ACCGCTTGAG AACCTCGTGT GCGATTACAT TGGACAATGA AGCATGCAGC TCATGTGAAA TGCGCGATTG TGGCGAGAAT GCCAATGGTA GCCAGTTGCC TAGGCAGGCA TTCGCAAACT GTACAAATCT CTTGGGAGGG GATTTTTTTG ACTTTTGTGA ACCTGTTACC GTCTCCGACA CGTCATCAGC GTTCATAGCG ATGGACTCCA CTTTTGCGGA GGACATTGAC GAATGCAACA GCGGCAGCAC AGCACATAAG CTATACTTCA ACGCATTAGG ATCCCTCTCT ATGGTTTCGG CTTGGCTCAT ATTGCAAACG CAATAA
|
Protein sequence | MTTKSTPKDL IDSFPHSKLT PIATATTEPD YLSLHQLQYE INDNAETLSS TLGDGQHGHL FLVISETEYL EMTDGIPCIP PVQPPFDPVH AANATAPQIV EANRQNDKRQ KLFDLYHNAI KAFRNQLLEA IPIEYIKSLG HPTRGFNKVS PLEILSHLWE TFGKIQASDL IANDERMKAA WHPPTPIQQL FQQLEKGNQF IIASGQVMDE RIIARIGYQI IEKTGLFDLA SRDWRYKDEA DKTLANFKKH FQKANKDLAV TATSSSAGYH TANQSTVTKG KSYCWTHGIV HNTKHTSATC EKQAPGHKTG ATLHDKQGGS TKTYQYTPPS SVAPNTPPLA SSPPFFPPDA IADTGCTGHF LSTNIAHIHC QPTVPGINVV LPDGRTITSS HITELNIPSL PPAARTAHIF PGLSNGSLIS IGQLCDHGCT ATFTSDTVRI ELNNTVVLRG GRSPYTRLWT LDSPVTPNPP ATELHAPVHD KNFANHLGDH SGTLADRIAF VHASLFSPQL STWCKAIDEG HLTTFPDITS AQVKRHPPQS VPMVKGHLDQ QRSNLRSTKP KVTLSASVDP DDINFDTNPV VQDPPAARTQ FLYADFAEVT GKIFTDPTGR FVTTSSSGNA YMLVVYDYNS NFIHVEAMKN RTGPEILSAY KRAHAMLSSK GLRPQLQRLD NEASTALQQF MSSVDIDFQL APPHVHRRNA AERAIRTFKN HFIAGLCSTD KNFPLHLWDR LLPQAIMTLN LLRGSRINPN LSSWAQLHGL FDYNRTPLAP PGIHVLVHEK PTIRRTWAPH AADGWYVGPA MNHYQCYRVW IKETTSERIS DTLTWFPSQV KMPSTSSRDT IVAAAHDLAH ALAHPSPASP LSPLLVHERE ALSQLSDIFS KAANPVDSSL PVAPTATLSP PTASTSSPRQ VRFRDPVTES LPRVPTTTAA PPQSLPRVPP PNSEAETYKL VTCNPRQARR RAARKLKEKI SASTSVVPTQ ATPAPVVPSP DKFFGTINLV TTDWESISLV TTDWERFSRP NGIINTKFRP LRRPLCNCKK PPQLSLSIGE IMTASCFSFF MLFFLSTVDT VEGEAIVTTR GADRELALVG ATCADVIKEI RSPRFNRCTC SLAGSTGGAT VNATCRGCSE VCILGACGSG YDQVSYEFET DGSATGERRQ CFEYVSGVNG TICRIENRLR TSCAITLDNE ACSSCEMRDC GENANGSQLP RQAFANCTNL LGGDFFDFCE PVTVSDTSSA FIAMDSTFAE DIDECNSGST AHKLYFNALG SLSMVSAWLI LQTQ
|
| |