Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43214 |
Symbol | |
ID | 7196578 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 2338220 |
End bp | 2343557 |
Gene Length | 5338 bp |
Protein Length | 1657 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176961 |
Protein GI | 219110419 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GAAGAATTGA ATCTTCCAAT GAATCTTTGA GATCAGCGAG GTCCGCCCAG AAACACGGGA CTTGATGCGA ATGCTTTCCG TTTCGGGTCG TTGAAGACTT GTTTCTTGAT AAGAGTGAAT ACGCAATCTT CTACCACAAA TCGGTTGGCT CACTTTTCGC CAATGGCTGT ACTGAACGAG AACAAGAAAA CAGGTGCGTA TGTGTAAGGC GAATCTATCA CTTTTTCTGC ATAATTGTGG CGTCTTCATC CTAACAGTGC TTTCCTTCTC GGTCTCTTGG TTCTCAAGAG AGTTTGTTTG ACCGAAACTA GAGCGAAATG GCACCGCAGC AACGAATCGA GGCCGTGCTC GACAAGGAAG GTACAACTCC GGTCCTTCCT CTCCTTCAAG CAGCTCAGCA TCTTCTGAAG CCTAACGGTT CGATCGTTGG CCGCGATGTG GCCGACACGA TCCGCCCTTT ATTTCGAACT TTTTGTTCGG CTATTCCGAC TTTCTACGAG GAAGCAAAAA CTAAGCCGGC TCTCGCGAGT ATTGATGGAC GCAGCCCCAT TTCACATTCT CGCATTCACG AATTTATTGT CAACGAATAC GGCCCCACCC TTCATGCTCT CGGGTTCGGA CGAGGCCATC GAATCGCATT GGTCTTGCCG AACGGGCCGG AATTAGCCTT GGCGATCGTC GCTACCGCTC AATACGCATC CTGTGTGCCC TTGTCGGCCA ACGGCGCCGC TAGCGAGTTG AAAGCCGATC TGCTGCGTGC CGGGGTCGAT CTCGTCGTCG GTCCATACTC AGGTCGTATT CATAACGATT TTATCCAAAA CTCGCAAGCA ACCGACGAGC GCTTCCACGT CATGCCATGG ACGGAAACAG ACTGGAGCAT GTTTGCAGAG ATCGAACAAA CCGCTACAGG TTTGGGAATC CCTTTTGCGG GGTTGGTTCC GAGTCCCTAC GAGTCGGGTA TTTTCAAAAT TGTTACCAAC GCAGCGACGG ACACTCTGCG ATTCTCCGAT CGTCACGAGA TTATTTGTCC TATAAATCGA AGCACCAAGG CTTTTTTGAC CAAAGAAAAC ACCTTGAATG ATGAGGTACT CGTCTTGTTC ACATCAGGAA CTACCGGTAA CAAAAAGCTT GTCCCACACC AACTCGGTGA CATCTTAACT GCAGCAACGA CGATTGCTTT GAGCTGGGCT CTAACTCCCG ACGACGTCAA CTGCAACCTC ATGCCCCTCT TTCACGTCGG AGGCATTGTC CGTCAGGTCT TTTCACCCCT CGTCTCCGGC GGTTGTGTAA TTTGCTGTCC TAGCTTTGAT CCAAGCATCT TTTGGCTGCT ACACGTAACA AAACAAGCCT TTACTTGGTA CTACGCCGCG CCCACCATGC ATCAATTGAT TTTGCAAACG GGACAAGCAG ACGGGTTCTT GGTAGAAGGC AAACATTGTC CACCTTTGCG AATGATTGCC AACGCTGCTG GGGGGTTGTT ACCCTCGTTG GCACTGCAGC TACGGGATAC ATTTGTCGGT GCCACCGTGC TGCCATCCTA CGGAATGACC GAGTGTATGC CAATCAGTTC TCCGCCGGCA ACTTATCAGT TGGAGAAACC TGGAACGTCG GGAGTGGCTG TGGGACCGGA AATTGCTATT CTCAATACAA CGACAATGAA ATCGTTACCG ATCGGAGAAG ATGGTCCAAT TTGCGTTCGT GGTGATCCTT GTTTCCGTGG TTACGGAAGG ATTGCCAATG ATTTTTCGGA GGCCGTGAAT GACACATTTA TGGCTGACGG TTGGTTCAAT ACCGGTGATC TAGGACACTT GGACAAGGAC GGCTATCTCT TTATTACTGG CCGTTCGAAG GAAGTCATCA ATCGCGGGGG TGAAATCATA AGTCCCATGG AGGTCGAAGA AGCGGTCCTT AGCCATTCCG ATATTTCTTT GGCAGCTGCC TTTAGCGCTC CTCATGATGT CCTTCAAGAA GTTGTCGGTA TTGTTGTGGT TATGATGGCA GACCGTCCCC GTCTTGATCT AGCCTCTCTT CATGAGTTTC TGGGAGAGCG ATTGGCAGCC CCTAAATGGC CGCAGTGCTT GATCTTTATG GATGGTTTGC CCAAGAGCCA CACTAACAAG CTTCTCCGTG TCAAGCTTGG CAGCCGACTG GGGCTGCCTG AACTCAGGGA TGATATGCTC GCAATAGATC GTACATGGGA AGGCAAGTGT CCACCGCAGG GGACGCCGTT GGATGTTGCC ATACCAGTTT TTCCTGTTTC CGTTTCGGCC GAGGAAATCG AAGAAAAGCT TGCCAGTATG TTGGTGACCA CAAAAAATCA AAATTTGCGG GTAATTCCAC ATAGCACCCG GACCGGGTCC CTCGTTTGCT ATGTCTACAA CCTTGATCGA ATGGATGCAA TTTTGCTAGC CCGCAAGGCT TTACCAAGAT ATGCGGTACC CAGTCACTTT GTTTCGCTTG ACACAATCGA GCTTTTGTCG GGAAAAGTTC TGCCTTCTCC GAGTATGAAG GATGCCGTGG CTTCTCTTCT CCAACGCTCG TCAAGCGCCA ATACAATTGA TCCTGTAGTC GATAACTTGC AGAGCTTGTT TGCAGAGCTT TTGTCTTTGG ATTATCTCCC TGGGCCTGAA GCTAACTTTT TCCACATCGG AGGAAGTTCA ATGTTGGCGT CACAGTTGGC TAGCAAACTT CGAAAGCAAT TTGGGATAGC CTGTAGTGGG GCTGAAATTT TCCACAGTAC AAACTGTAAC GACCTTGCCA AGCTTATTTA CCAACGAAGC GACGACTTTG CGACGATTTC ACCAATAGAT TCGAAATTGA ATGATCAATC AGGCCCAAGC GGACGTGTTG TTGACGACCA TGGCGCACCC TTCCCCTCAA AACGTCTAGC TATGGATGGT TCATTCCTTC GCTCACTATT TCAGCTTGTG CCTATTCTAA TTATATTTCC AATTTGGCAG ATATCGCGCT ATATCCTCTT TTTCTGCTTC CTACTCTGGT CGATTGATGT TGTTCCTGGT ACCCGCGATA TCGGAACCTT TATTGCTGCC TATTTGGCAT TTCATTTGTG CTGGATTACT ATAACGCCGC TCGTGTTTGT TGCCATCAAA TGGAGCGTGA TTGGTCGCTA CAAAGCAGGG CGCTACCCAA TTTGGGGTAG CTACTACTTA CGGTGGTGGT TTGTTGACAT CTGCCGCAAA CTTTTTCTTC GTGGCATCTG GGGTTCCAAC GAGGTGTACC TGAACATCTA CTACCGCCTA CTGGGGGCCA AGATTGGCAA GGGTGCCCGC ATTAGTCTAG AGGCTGACTT GGCCGAATTT GACTTGGTAA ACGTTGGGGA AAACGCCGCT GTAGAAGATT GCACGCTAAG GGCCTTTGGA GTAGACAATG GTGCCATGAT TCTTGGACCG GTGCATGTCG GAAATAACGG CAGTGTTGGA GCGAAGTCAG TGGTCGCTCC CTTTACTTCA GTCCCTGACG ATGGTCACCT CGGACCAGTG ACTTCGAGCT ATGAAGTAGG AAAAGCTCTG GACTTGAAGA ACAGACAGTT TAATCGTCGA TGCCTGGCTG AGCCCAGCAT TTGGCTCCAG GTTTGTCTGG GTTCGCCGAT CACCTTTGCT GTAAACTGCT TTGCTCAAAT TCCCTCGTTG CTGATCCTTA TATGGATGCT GAGATACAAA GGCCAGCGTG GCGAGGAGTT CTTGACTCTG AATGATTTAA TGGAGTGGCT TTGCGATCCT AAGCGTATCC CGTTCTACAT TGGAATTCGT GTTGCTCGCA ATATTGTGTC TCCTTTTTTT TACATGGCGG CTGCCATTGT TGCCAAAAAG ACTGTGATTG GCAAGTTTAC AGCCGGTCCG CGTGACACTT GGTCCAGCTG GTCTCTGTTT CGTCACTGGC TAGCTGCAAC TTTATTCTCT CGTAAAAAAA TTCAGGCCGT CACGGATTTG ATCGGCCGTC ACTACGAACT TGTTAGTGTC TTGTACCGTC TCCTCGGTGC CAAAGTTGGA AAACGTGTTT TTTGGCCGGG CTCACAACCT GTGTTCACCG GTGAATTTGA TTTGTTGGAA ATTGGCGATG ACGTTGTCTT TGGCTCGCGT TCTGGTATTT TCATGACGAC AGACACTTCA TGCGAAAAGG TTGTTTTGTG TGCCGGTGCC AATGTTGCTG ACAATTGTGT TGTCCTTCCT GGAAGCGTTG TCGGCAAAAA CGCTGTGCTG GGGTCAAACT CTGTTTGTCC ACTCGGATGG TACTTGCCCG AGGGTAGCGT CTGGTTTGGA TCTAAGGGCT GCGAGCCCGA TTGCCTCGAC AAGGGTGTAG TCACAGATTT CGATGGTCCG ATTCTAGTCA CTGATATTGA CGTAAAGACG GTTCCGATGG TTGGAGATGC TACGACTCTT CGCCCTTTCG GTAAAGCTTT TTACAGAGGA GAAGCGTCTT ACAGCGTTTG GCCCTTACAT GTGATTATCG CAGCGACTCT CTTGATCAGA AGTATCCTGA GTGTGTTTCA CACGCTTCCG TTGCTAGGTG CAATTCAAGG CGGTGGCGCC ATTCTATATG GTCTTCCTCT TATTGACCGG GTCTACGATA GTCATGAGTA CAACTTTTTC CACGTGTATT TTGCGATTTT GTTTGTCTTT TTCTTCACGC ACGCGCTTCG CGTCGCACTT TGGCTTGTGA TTGAACTGAC GGCCAAATGG ACTTTGATGG GGCGTCGAGA AGAAGGCCGT TACAACTATG ATACGAGCTC GTACGCACAG CGATGGGAGC TCTATCAGCT TATTTCCAAA GTTCGAAAAT TCAATCGTCT CAACTTTTTG GACTTCTTGT CCGGTACCCC TTTTATGGCT GCATACTTTC GGCTGAACGG AGGCAGAATT GGTCGGGATT GTTGCCTATT TCCTGCTGGG GCCGACCCTT TCATGCCCGA ACCCGATTTG GTCACGATGG GCGACCGCTG TGTTGTGGAT TGCGCTTCTA TTGTCTGCCA TTTGAACACA CGCGGCAACT TTGAACTAGC CAGAATCACT CTAGAAAATG AATGTACGCT TCGTACACGT TCTCGTTTGC AGCAAGGCTG TTACATGGAA CATGGCTCGC AACTGTTAGA GAAAAGTTTG GCAATGACAG GAGAAGTTAT CGAAGCCAAC AGCGTTTGGC AAGGTGGCCC AGCGTCATGG TGGTTCCAGT ACTCTCAACG TAGTCTGTAC ATGGCGGATG AAGAGGAAAC TGCCGATGAA AAAACCAATC TTTTGAAAGC TAAAGTGTCT TCGTACAACG TTCAGCTTTA GCGTTGGCGA TGAGCTCGTA ATCATTTACA TTTTGACCCT TTACTGTT
|
Protein sequence | MAPQQRIEAV LDKEGTTPVL PLLQAAQHLL KPNGSIVGRD VADTIRPLFR TFCSAIPTFY EEAKTKPALA SIDGRSPISH SRIHEFIVNE YGPTLHALGF GRGHRIALVL PNGPELALAI VATAQYASCV PLSANGAASE LKADLLRAGV DLVVGPYSGR IHNDFIQNSQ ATDERFHVMP WTETDWSMFA EIEQTATGLG IPFAGLVPSP YESGIFKIVT NAATDTLRFS DRHEIICPIN RSTKAFLTKE NTLNDEVLVL FTSGTTGNKK LVPHQLGDIL TAATTIALSW ALTPDDVNCN LMPLFHVGGI VRQVFSPLVS GGCVICCPSF DPSIFWLLHV TKQAFTWYYA APTMHQLILQ TGQADGFLVE GKHCPPLRMI ANAAGGLLPS LALQLRDTFV GATVLPSYGM TECMPISSPP ATYQLEKPGT SGVAVGPEIA ILNTTTMKSL PIGEDGPICV RGDPCFRGYG RIANDFSEAV NDTFMADGWF NTGDLGHLDK DGYLFITGRS KEVINRGGEI ISPMEVEEAV LSHSDISLAA AFSAPHDVLQ EVVGIVVVMM ADRPRLDLAS LHEFLGERLA APKWPQCLIF MDGLPKSHTN KLLRVKLGSR LGLPELRDDM LAIDRTWEGK CPPQGTPLDV AIPVFPVSVS AEEIEEKLAS MLVTTKNQNL RVIPHSTRTG SLVCYVYNLD RMDAILLARK ALPRYAVPSH FVSLDTIELL SGKVLPSPSM KDAVASLLQR SSSANTIDPV VDNLQSLFAE LLSLDYLPGP EANFFHIGGS SMLASQLASK LRKQFGIACS GAEIFHSTNC NDLAKLIYQR SDDFATISPI DSKLNDQSGP SGRVVDDHGA PFPSKRLAMD GSFLRSLFQL VPILIIFPIW QISRYILFFC FLLWSIDVVP GTRDIGTFIA AYLAFHLCWI TITPLVFVAI KWSVIGRYKA GRYPIWGSYY LRWWFVDICR KLFLRGIWGS NEVYLNIYYR LLGAKIGKGA RISLEADLAE FDLVNVGENA AVEDCTLRAF GVDNGAMILG PVHVGNNGSV GAKSVVAPFT SVPDDGHLGP VTSSYEVGKA LDLKNRQFNR RCLAEPSIWL QVCLGSPITF AVNCFAQIPS LLILIWMLRY KGQRGEEFLT LNDLMEWLCD PKRIPFYIGI RVARNIVSPF FYMAAAIVAK KTVIGKFTAG PRDTWSSWSL FRHWLAATLF SRKKIQAVTD LIGRHYELVS VLYRLLGAKV GKRVFWPGSQ PVFTGEFDLL EIGDDVVFGS RSGIFMTTDT SCEKVVLCAG ANVADNCVVL PGSVVGKNAV LGSNSVCPLG WYLPEGSVWF GSKGCEPDCL DKGVVTDFDG PILVTDIDVK TVPMVGDATT LRPFGKAFYR GEASYSVWPL HVIIAATLLI RSILSVFHTL PLLGAIQGGG AILYGLPLID RVYDSHEYNF FHVYFAILFV FFFTHALRVA LWLVIELTAK WTLMGRREEG RYNYDTSSYA QRWELYQLIS KVRKFNRLNF LDFLSGTPFM AAYFRLNGGR IGRDCCLFPA GADPFMPEPD LVTMGDRCVV DCASIVCHLN TRGNFELARI TLENECTLRT RSRLQQGCYM EHGSQLLEKS LAMTGEVIEA NSVWQGGPAS WWFQYSQRSL YMADEEETAD EKTNLLKAKV SSYNVQL
|
| |