Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38388 |
Symbol | |
ID | 7203250 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 776472 |
End bp | 781289 |
Gene Length | 4818 bp |
Protein Length | 1277 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182298 |
Protein GI | 219123992 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.553316 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACCA AGTCTACTCC CAAAGACCTC ATTGACTCGT TTCCGCACAG CAAACTCACC CCGATTGCTA CCGCGACAAC CGAACCCGAT TACTTGTCGC TCCATCAGCT TCAGTATGAA ATCAACGACA ATGCGGAGAC CCTTTCCTCC ACTCTTGGAG ATGGCCAACA CGGTCACCTT TTTCTCGTAA TTTCCGAAAC CGAGTACCTC GAAATGACCG ACGGCATTCC ATGCATTCCT CCTGTGCAAC CGCCTTTCGA CCCAGTTCAC GCTGCCAACG CCACAGCCCC TCAAATTGTC GAAGCGAACC GCCAGAACGA CAAACGACAA AAGCTGTTTG ACCTCTATCA CAACGCCATT AAAGCGTTTC GCAATCAACT CCTTGAAGCC ATTCCCATCG AATACATCGA ATCTCTCGGT CATCCTACAC GAGGCTTTAA CAAAGTCTCT CCCCTCGAAA TCCTTTCTCA TCTCTGGGAA ACTTTTGGTA AAATTCAGGC TTCGGATCTC ATCGCCAACG ACGAACGCAT GAAAGCCGCC TGGCATCCAC CAACGCCTAT CCAGCAACTC TTCCAGCAGC TTGAAAAAGG CAATCAGTTT ATCATCGCGT CTGGCCAAGT CATGGACGAA CGTATTATCG CTCGCATCGG CTACCAGATC ATCGAAAAAA CCGGACTCTT TGATCTTGCT TCTCGCGACT GGCGTTATAA AGATGAAGCC GATAAAACTT TGGCAAATTT CAAAAAACAT TTCCAGAAGG CCAACAAGGA TCTCGCCCTC ACCGCCACCA GCAGCTCTGC GGGTTACCAC ACCGCAAATC AGAGTACTGT CACCAAGGGA AAATCGTATT GCTGGACCCA CGGCATCGTT CACAACACAA AGCACACCAG TGCGACATGT GAAAAACAGG CCCCGGGGCA CAAAACCGGC GCTACATTGC ACGACAAACA AGGCGGGTCG ACTAAGACCT ATCAATACAC GCCACCGGTG CCCAAATAGG AAAGAGGGAC GGCCAAACTG TTGAGTGTGC CGCTGAATAC TTATCATAAC AAAAATAAGT CTTCAGTTGC ACCAAACACT CCTCCATTAG CTTCCTCCCC GCCATTTTTC CCTCCCGACG CCATTGCAGA CACTGGCTGT ACCGGACATT TTTTGAGCAC CAACATTGCT CACATACATT GCCAACCGAC GGTCCCCGGC ATCAACGTGG TCCTCCCTGA TGGTCACACA ATCACTTCGA GTCATATCAC CGAACTCAAC ATTCCCTCGC TTCCTCCGGC AGCTCGTACC GCCCATATCT TTCCCGGTCT CTCGAATGGA TCCCTCATTT CCATCGGCCA ACTTTGTGAC CACGGCTGTA CCGCCACGTT CACATCTGAC ACAGTCCGCA TTGAGCTCAA TAACACTGTC GTTCTCCGCG GCGGCCGTTC TCCTTACACC CGATTGTGGA CCCTCGACTC CCCTGTAACG CCCAATCCGC CCGCCACTGA ATTGCATGCG CCTGTGCACG ACAAAAATTT TGCGAATCAC CTCGGAGACC ACTCAGGGAC CCTTGCCGAC CGCATTGCCT TTGTTCATGC ATCCTTATTC TCACCACAAC TTTCAACATG GTGCAAGGCC ATTGACGAAG GCCGCCTCAC AACCTTTCCG GACATCACGT CTGCACAGGT AAAACGGCAC CCCCCACAGT CCGTCCCTAT GGTCAAGGGA CACCTTGACC AGCAACGGTC CAACCTACGC TCAACCAAGC CCAAGGTCAC CCTGTCTGCC TCTGTTGATC CTGATGACAT CAATTTCGAC ACCAATCCCG TCGTACAAGA CCCTCCAGCC GCCAGGACGC AGTTTTTGTA CGCCGATTTC GCCGAAGTCA CCGGAAAAAT TTTTACTGAC CCTACCGGCC GCTTTGTTAC CACTTCAAGC TCCGGCAATG CATACATGCT AGTGGTTTAT GACTACGATA GCAATTTTAT TCATGTCGAA GCCATGAAGA ACCGCACCGG TCCCGAGATT TTGAGCGCCT ACAAGCGTGC TCACGCCATG CTATCCTCCA AAGGTTTGCG CCCCCAACTC CAACGCTTAG ACAACGAAGC CTCCACTGCG TTACAACAAT TCATGTCCTC TGTTGATATT GATTTTCAAT TAGCTCCTCC GCACGTGCAC CGTCGGAACG CCGCCGAACG GGCAATCCGC ACGTTCAAAA ACCACTTCAT TGCAGGTTTG TGCAGCACCG ACAAGAACTT TCCGCTTCAC CTTTGGGATC GCTTACTCCC ACAAGCCATC ATGACTCTCA ACCTTCTTCG AGGGTCTCGT ATCAACCCAA ATCTGTCGTC CTGGGCCCAA CTCCATGGCT CGTTCGACTA CAATCGTACC CCTTTGGCTC CCCCGGGCAT CCGCGTACTT GTACACGAAA AACCGACAAT TCGCAGAACT TGGGCCCCCC ACGCAGCCGA CGGCTGGTAC GTTGGTCCCG CCATGAACCA TTACCGATGT TATCGCGTCT GGATCAAGGA GACCACCAGC GAACGCATTT CTGACACTCT GACATGGTTT CCCAGCCAAG TCAAAATGCC CAGCACCTCG TCTCGCGACA CAATTGTCGC CGCTGCTCAC GATCTTGCCC ATGCTCTGGC ACATCCATCT CCCGCGTCCC CCTTGTCGCC TCTTTCGGTC CACGAACGCG AAGCCCTCTC GCAACTTTCA GATATTTTTT CGAAAGCCGC TAACCCAGTT GACTCGTCCC TCCCAGTCGC TCCCACGGCA ACCCTAAGTC CGCCAACTGC ATCGACTTCT TCACCTCGTC AAGTCCGCTT CCGAGACCCG GTCACTGCAT CACTTCCGAG GGTGCCGACC GCCACAGCCG CCCCTCCGCA GTCACTTCCG AGGGTGCCTC CCCCAAACTC CGAGGCCGAG ACATACAAGC TTGTCACCTG CAACCCTCGC CAAGCACGTC GTAGGGCCGC TCGAAAACTG AAAGAAAAAA TTTCCGCTTC AGCATCCGTT GTTCCTACCC AAGCAACACC TGCACCCGTC GTACCTTCTC CCAAGGTCCC CACACCTCCG CACAGTCACG GCACTCGCTT ACAAGCCGCT CGATACCCAG GACACTCGTT CGACAGCGCC AACGCCGTCG TCGACCCCAA TTCCGGAGCC ACTCTCGAGT ATTCAAAACT CAAAAATTCT GAACAAGGCC CCGAATGGAT TCAAGCCGCC GCCAATGAGA TGGGCCGCCT GTCTCAAGGC GTCAAACCCA ACATGCCCAC CGGCACCGAC ACGATGCATT TTATTCCGCA TACCGCAAAG CCGCACGACC GCAAGGCCAC TTACCTGAAG ATTGTAGCGG CTATCAAGCC ACACAAGGCC GAAAAATACC GCATCCGTTT CACTGTCGGC GGCGACCGTA TCGAGTACAA CGGACCCACA AGTACCCCTA CAGCTGCATT ACCAGCCATC AAGATCCTCG TTAACAGTGT CATTTCCACC AAAGGCGCAC GCTTTATGAC CTGTGACCTC AAGGATTTTT ATTTGGGCAC TCCTCTCCCT GTGTACGAGT ACATGCGCAT TCCTGCAGTC CATATACCAG ACTGCATTAT GGAACAGTAC AAGCTTGCCC CGCTAGTTCA CAAAGGCAAT GTTCTAGTGG AAATTCGAAA AGGAATGTAC GGTCTCCCAC ATGCAGGCCG CATTGCGAAC GACCGCCTCA TTGATCATTT AGCTCTCGAC GGATACCATC AACTGACCCT ACCGGCCGCT TTGTTACCAC TTCAAGCTCC GGCAATGCAT ACATGCTAGT GGTTTATGAC TACGATAGCA ATTTTATTCA TGTCGAAGCC ATGAAGAACC GCACCGGTCC CGAGATTTTG AGCGCCTACA AGCGTGCTCA CGCCATGCTA TCCTCCAAAG GTTTGCGCCC CCAACTCCAA CGCTTAGACA ACGAAGCCTC CACTGCGTTA CAACAATTCA TGTCCTCTGT TGACATTGAT TTTCAATTAG CTCCTCCGCA CGTGCACCGT CGGAACGCCG CCGAACGGGC AATCCGCACG TTCAAAAACC ACTTCATTGC AGGTTTGTGC AGCACCGACA AGAACTTTCC GCTTCACCTT TGGGATTGCT TACTCCCACA AGCCATCATG ACTCTCAACC TTCTTCGAGG GTCTCGTATC AACCCAAATC TGTCGTCCTG GGCCCAACTC CATGGCTCGT TCGACTACAA TCGTACCCCT TTGGCTCCCC CGGGCATCCG CGTACTTGTA CACGAAAAAC CGACAATTCG CAGAACTTGG GCCCCCCACG CAGCCGACGG CTGGTACGTT GGTCCCGCCA TGAACCATTA CCGATGTTAT CGCGTCTGGA TCAAGGAGAC CACCAGCGAA CGCATTTCTG ACACTCTGAC ATGGTTTCCC AGCCAAGTCA AAATGCCCAG CACCTCGTCT CGCGACACAA TTGTCGCCGC TGCTCACGAT CTTGCCCATG CTCTGGCACA TCCATCTCCC GCGTCCCCCT TGTCGCCTCT TTCGGTCCAC GAACGCGAAG CCCTCTCGCA ACTTTCAGAT ATTTTTTCGA AAGCCGCTAA CCCAGTTGAC TCGTCCCTCC CAGTTGCTCC CACGGCAACC CTAAGTCCGC CAACTGCATC GACTTCTTCA CCTCGTCAAG TCCGCTTCCG AGACCCGGTC ACTGAATCAC TTCCGAGGGT GCCGACCGCC ACAGCCGCCC CTCCGCAGTC ACTTCCGAGG GTGCCTCCCC CAAACTCCGA GGCCGAGACA TACAAGCTTG TCACCTGCAA CCCTCGCCAA GCACGTCGTA GGGCCGCTCG AAAACTGA
|
Protein sequence | MTTKSTPKDL IDSFPHSKLT PIATATTEPD YLSLHQLQYE INDNAETLSS TLGDGQHGHL FLVISETEYL EMTDGIPCIP PVQPPFDPVH AANATAPQIV EANRQNDKRQ KLFDLYHNAI KAFRNQLLEA IPIEYIESLG HPTRGFNKVS PLEILSHLWE TFGKIQASDL IANDERMKAA WHPPTPIQQL FQQLEKGNQF IIASGQVMDE RIIARIGYQI IEKTGLFDLA SRDWRYKDEA DKTLANFKKH FQKANKDLAL TATSSSAGYH TANQSTVTKG KSYCWTHGIV HNTKHTSATC EKQAPGHKTG ATLHDKQGGS TKTYQYTPPS SVAPNTPPLA SSPPFFPPDA IADTGCTGHF LSTNIAHIHC QPTVPGINVV LPDGHTITSS HITELNIPSL PPAARTAHIF PGLSNGSLIS IGQLCDHGCT ATFTSDTVRI ELNNTVVLRG GRSPYTRLWT LDSPVTPNPP ATELHAPVHD KNFANHLGDH SGTLADRIAF VHASLFSPQL STWCKAIDEG RLTTFPDITS AQVKRHPPQS VPMVKGHLDQ QRSNLRSTKP KVTLSASVDP DDINFDTNPV VQDPPAARTQ FLYADFAEVT GKIFTDPTGR FVTTSSSGNA YMLVVYDYDS NFIHVEAMKN RTGPEILSAY KRAHAMLSSK GLRPQLQRLD NEASTALQQF MSSVDIDFQL APPHVHRRNA AERAIRTFKN HFIAGLCSTD KNFPLHLWDR LLPQAIMTLN LLRGSRINPN LSSWAQLHGS FDYNRTPLAP PGIRVLVHEK PTIRRTWAPH AADGWYVGPA MNHYRCYRVW IKETTSERIS DTLTWFPSQV KMPSTSSRDT IVAAAHDLAH ALAHPSPASP LSPLSVHERE ALSQLSDIFS KAANPVDSSL PVAPTATLSP PTASTSSPRQ VRFRDPVTAS LPRVPTATAA PPQSLPRVPP PNSEAETYKL VTCNPRQARR RAARKLKEKI SASASVVPTQ ATPAPVVPSP KVPTPPHMVY DYDSNFIHVE AMKNRTGPEI LSAYKRAHAM LSSKGLRPQL QRLDNEASTA LQQFMSSVDI DFQLAPPHVH RRNAAERAIR TFKNHFIAGL CSTDKNFPLH LWDCLLPQAI MTLNLLRGSR INPNLSSWAQ LHGSFDYNRT PLAPPGIRVL VHEKPTIRRT WAPHAADGWY VGPAMNHYRC YRVWIKETTS ERISDTLTWF PSQVKMPSTS SRDTIVAAAH DLAHALAHPS PASPLSPLSV HEREALSQLS DIFSKAANPV DSSLPVAPTA TLRPLEN
|
| |