Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_43968 |
Symbol | |
ID | 7204385 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 565557 |
End bp | 570692 |
Gene Length | 5136 bp |
Protein Length | 685 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186373 |
Protein GI | 219113579 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGCTGTGGAG ACAATACATT CCAACAACTG TCCGTCTCAC TTTCTAGTGT GCTGCGACTG TTAACTTCTC AGTCTCGACG CGTGTGCTGC GCTCGCTTGG ACCTTTCTCG TCGTTGAAGG AGATCAGACA CCCGAAGAAA TTCATTTGCG TTCTCTCTCG AACGACAAAG AATCGACGCG GTGGGAGCGC TGGGTTCTAG TTCCGATTTG ACGATAACTT TGTCTTACCA GCATCGATTC CAAGAAGAGT TTTAAATAGG AAGGTTTTCC GTCGTCACCA AGCATGGCTT CGCTGCCACA TCATCAGCAA CAGCAGCCAC CGCTACACGC GCTCAGGGAA AATGGCGGTT CTGGGTCCGA GGGAGGTCTG GCGACTCGAC AAAATGGCAT ACCGACGGAA ATACGGAGCA CCGGATCACA GTCCTCCAAC AGTCGGAATC CGATTGAATT CCAAACACAG CAATACTATC CGCAGCGGCC GACAGCACAA CCTATGAGCA CGCCTTGGAC ATTAGGGTCC ACGGATGGAT CGTACGCTAC CACACTGACA CTCATGCAAC AACAGCAAGG CCAAATGGAT GAAAGCACCG TACCTTTGTC ATCTCCGAAC GACACCGGTA TTGCTCCCGG CCTCGGCCCG TTGCCTAAAG GAAAAGCGGA AGGCGGAGCG GCCATAGCCT CGTCGGGCGC CGCCGCGGTG GACCACAACG CCGCGCTCCA AATTCTGGCG GCTCAAAAGT CGCTGTACGA AACCCGTTTG TTTCGCCAAC CATCCGCATC GGTGGAAGCG GTCCTGGCAG CATCTTGTGA AGTCATGGGA TTTGATATTT CGGAAATGTG GTTACGGACC GGAATTAAAA CCCATCAGCT CACCAACTCG CATTTGCGAC CAACCGCTCT CGAAGACTCC GTCCGCAACG ATCTCGTGGA TGTGTACTAT GGAGACAAGT CTGCGGAGCG GACCCATCGA CTCAGCCCAG CTCTGTGCAA GCGGGCCAAA GAGGCGAATG ATGTGGTATG GGTCACGGCA CATACCCCTC ACGGAGCCGA AGCCTTGCGG TGCTCGATTT CCAACGTGCG CACGGCCGTT GCCGTTCCGG TTTGCCATGA AGCATCCAAT ACCAATATTA CGATTATCTT TTTCAGCATT CGAAGGTAGG AACGCGGGGA GTGGTTTTTG ACGTTTGCAT TCTCTGCCGC TGTCTCTCAC TTTTTTGAAA TCATTTTAGA ATTGTTGTTC GTCCAACCGC AGTTGAGTTT CTCGTCCACA TGTCGCTTGC AGCTGGCGGT GCGTCAGTGA ATTCACTAGC TGAAGATGGT CTTATCGACC GTGAAGCACT CAGCAGGAAG GACGATAACG AAAAAATTGT CAAGAGTATG AGTCGGTCCG AGCACGTCCC ACGCAAAGAA GATATTGCCA TTCGTCATCA GCGTGTAGAA CGATATTCGA TTACCGGTGC ACCGCTGGAT CTCCAGTGGC GGCAGCTGCA CAACGTCGAG TATTTGACCG ACGGTGGGAA CAGCTGGATT CATACAGCTG TTTTCCAAGG TAAACCAGTT GTGGTCAAAA CGCTGAAACC AGAATGCCAA GACGTTGTGC TGGCAATAAA TGAAATCGAA GGTGAACTTG CGGTGCATTC GCGATTGTAC CATACCAACA TTGTAGCGCT CATTGGTGCA GGGACGACGA GCAAGGGCGT ACGCTTTGTT GTATTAGAAC GTTTAGACGG TGGCACATTG ACACAGATGC TAGGGTACGA TACGCGTATT CGGGATCGTA GGAGACGTTT CTGGCGACGC AAGCAATTTA GCTACGTCGA CGTGTTGCGA GTTGCACGGT CCATCGCTGA TGCCATGTCT TATTGTCACC AAGAAGCGAT ACCAGGATGC ATGGTGCTAC ATCGAGATTT GAAACCAGAC AATATAGGTA TGTAAAAACT GCATTGCACA CAACCTTTTC ATATGCCAAT GGATTCTTAT TTCTTACACT TTCATTCGTG TTTTAAGGGT TCACTCTGGA TGGAACCGTG AAAATCATCG ACTTTGGTTT GGCAAAAATC GTCGAGAATG CATCCGTAGA CTCTGACGAT ATCTACACCA TGAGTGGAGA AACAGGGTCA CTCCGGTATA TGGCACCAGA GGTGGCCGAT GCTTTGCCTT ACAACGCAGC TGCCGATGTT TATTCCTTTG GCATCATTCT ATGGGAAATG AACGCGACGA AAAAGCCATT CGAGGGATTG AACCGCGAAT TGTTTTATGA GCGTGTTGTA CATGGCGGAG AACGCCCATC ATTGAACCGA AAATGGCCGT CCCAATTGAC GAGTCTGATA TCTGAGTGTT GGGACGCCGA TATGCACAAC CGTCCAAGAT TCAAAGAGAT TGTCGGGCGT TTAGATGCTT TGTTAGCGAA GGAAAAGGGA GGGCCAGCGA GCGCTAAAAA GAAGCTGCTG CCCAAGATTA CCGGGATGAT CGACCGACAT TCCACTTGGT TCTAGATTAC GCAAACCCTA CTCAATACTT GTACGTTGAG CGATGTCGAC TTGTTTCGTC AGAAACTTTA TACTAAACGA AGTCTAGCAT ATGCACCATA TTTGGTTAAG ACAGAATATC ATAAGCCAAA GGGCTCTGCA TCATTAATAC AACGCTTGTG GTACTAGTCG CTGCCTCTGT GCTTCCAAGG TTTCGAGATC TTGATGCAAC GGGGTGCTTC GTCAGTCTTG GGTTGATCCT GTGTCACGGT AGCATTGGAG TATGACTTTG CCCGTCGGTA ACGCGGAATC CATTCGTTCA GTGTCCACTT CGGCCCCATA TCCTGCAAAA CTTCCAGCAC ACCCTCTCTC ACTAGATCGC ACAGTGCCAT TACGACAACG TTGATAGAAC AGTAGTCACG AACGGCTACC CATCCTTTGG GAACCTTGTC CCAGTCCCGG TTCATGGCAT CCGAGAGGAC CTCGTCGCCC GTCAGCGTCT TGCTACTCAG AGCTTGTAAG AGCAAATCCT TAATCGACTC TTTCATACCG CTTGAAAAAC TTTCGTAGGA TTTGAAGTGA CGAAGTCTCT TCTGACCAGC TCCAATTCCT TTCCCAAACC CATTGTAAGA CTCTGCCGCA TCACGTAGTT CCGCCCACAA ATGAGAGTTG TTCATGAGCC CAAGCATTGA TGTAGAGCCA TCTTCTCCTA ATAAACCGGC CTTGGTACGA GAAACGAAGT TGCTAGAGTT CGCATACATC ACTGTGGGAA CATCACCTTC TTCGCACGGT GTCGCAAAGA TGCTTTCGAC TGCTCGAGTC CTTTCGGAAA TGGGCCGCCT TACATTTCGG CCGGTAGGTC CAAATATTAG ATAGCGATAT GACAACTTGT TTCCGAAACC AGAGCTGCCC ATCCCTACAT CTTGCTGTCC ATCTGATTTG TCCTCGGGCG CATCACAGCT CGCCGAACTT GGGTTCCACT TATTCATGTT GCAGTACCAG ACCTCTGGCA ATTCGTCCGC GGAAATATGA GGTGGAAGCT TTCGCCACTT GTCACATTTC TCGCATTGCA CCCACTCTAG ATTATCGGCT TCGTCCTGGT TTGTCGCGTT GGATACCTGA TTTGGCTTCA CATTCAACCC GTTCCGACGA GGCCGTCCCC TTTTTCCCTT TATTTCCGCA CTCTCAACCA GAGCATTGAG GGTGTCCTCG GATGCGTTTT GTCCCTTCTT TCCCATGCCT CCTGATTTTT TATCTTTTGG GTTCGTGTCG CTACCAGTAT CAGAATGGGG TGACTGCATC GAAGGAGAAA AGCGTTTGCT CTTGGCCTCG GCATCGGTAT CGTCTCGAGC GCGGGAGCTT TGCTGAAATC CAACGGGTTC TTCTACAATT GCACTATCCG CCCCTTTCGC TTCTCCACCC TCGGCAGATG CCGCAGCTTG GAGAATAAGT CTTTGCCGTT TCTTACGCCG TTTCTCTTTG GCTTTTTCCT TCGCCACTTG TTTTAACGAT TGCTCGACGG CTTCGCACGT AGACCGTTTG ACGTCCCAAA TATTATCCGA GCAGTACCAT TGTTTTGGTA GCGTTGCCAC CGCAGCGGAA GGAATGATTC GCCATTTCCC GCATTTGTCG CACTGCACCC ATTGACTGTC GTCCTCTACA TCGACCGATC CATACTGTTC GCTACCCCTT TTTTTCCTCT TTTTGGAAGC TTGCTCAGCG ACGTTACTTT CTTGCTTAGA CGCGATGCGC TCGTTAGCTT GTTGTGCGGC TCGTCTTCCT GATCGAGCAA CCGTACTCAA ATCTTCCGTG GAAATCAATT CTTCTTTGGT AAAATGCGGC AGACTTCGCC TTTCGGTCTC GGCTGCAAAG TTCGAACCTG GATCTTCATT AGTCTGCGTC TCTTCTATAC CACTTTTTGG TTGGTTCGCG TATTTCATAC TCTCGGACTT CTCAGCGGAC GATGAAGCTT GTATGTGCTT AAGCTTGGAG GATTGTCGGT CTGTCAAACT TTCGACGGAG GCTGATTCGG AGATCTCCAT TGCTTCATGC GTTGAGGGTG AAGCAGCTGA CGTTGTCGTG GAATCGACCA AGTCCTTGTC ATGTGACTTC TCTATCGACT GTGATTTATC CGCAGACTGA GATTTGACCG ATGAAATGTG AATATGCAAT CGCTTCGGAG CGGGTGGATC AGTCACGGGG CGTTCCTCAG GCGGGGCGAG CATCTTGGAA GGCGGGGATC CCATATTGTC CGTCGATTGC TGTGCAATCT CGCTTCTTGA AGCAGTCGCG CTACTGCATC TATCCATATG TCCCCGTCCC TCACGCTTGA CGTTGGATAT ACGAATAGTT AGGGAGGACT TTGGCTTTTC TGTGGTTTCA CTAATGGGCT TTGCATTGTC CGAAACACCG GCTCGACTGA ATGCTTCGAA CGGCAAGACT GTGCCAACAG TAGACTGCTC GGATAAATTT GTCTCCGTCT GCATTGGATT TGCAATCTCG GCCACATCTT CTTCGTCTTC CGCCGTTGTT CGATGCTTAG GAGGACGCCC TCGTTTACGC TTTGGTGGTA CATCTAGTCT CTCATCGTGT AGCAATGCGC GACTCGGTAC CGAGATTTCC TCCGATTCAA GTGTCTCCTC CAATGCCATG GGCTCCACAG CACCTG
|
Protein sequence | MASLPHHQQQ QPPLHALREN GGSGSEGGLA TRQNGIPTEI RSTGSQSSNS RNPIEFQTQQ YYPQRPTAQP MSTPWTLGST DGSYATTLTL MQQQQGQMDE STVPLSSPND TGIAPGLGPL PKGKAEGGAA IASSGAAAVD HNAALQILAA QKSLYETRLF RQPSASVEAV LAASCEVMGF DISEMWLRTG IKTHQLTNSH LRPTALEDSV RNDLVDVYYG DKSAERTHRL SPALCKRAKE ANDVVWVTAH TPHGAEALRC SISNVRTAVA VPVCHEASNT NITIIFFSIR RIVVRPTAVE FLVHMSLAAG GASVNSLAED GLIDREALSR KDDNEKIVKS MSRSEHVPRK EDIAIRHQRV ERYSITGAPL DLQWRQLHNV EYLTDGGNSW IHTAVFQGKP VVVKTLKPEC QDVVLAINEI EGELAVHSRL YHTNIVALIG AGTTSKGVRF VVLERLDGGT LTQMLGYDTR IRDRRRRFWR RKQFSYVDVL RVARSIADAM SYCHQEAIPG CMVLHRDLKP DNIGFTLDGT VKIIDFGLAK IVENASVDSD DIYTMSGETG SLRYMAPEVA DALPYNAAAD VYSFGIILWE MNATKKPFEG LNRELFYERV VHGGERPSLN RKWPSQLTSL ISECWDADMH NRPRFKEIVG RLDALLAKEK GGPASAKKKL LPKITGMIDR HSTWF
|
| |