Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_43905 |
Symbol | |
ID | 7204311 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 390336 |
End bp | 394065 |
Gene Length | 3730 bp |
Protein Length | 1038 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186051 |
Protein GI | 219112935 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.352902 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAGAGT ATACGCTCGA AAATCTGCCC AGGAAGGCGA CCATCGAGGA GCTCTCGGCG TATCTTGCAA AGGTTCAGGT TTCGTTTTCC AGTGTCGGTC CAATCCACAA GGCCGGAAAG GGGTCTGTCG CCAAGGTAGT GTTACCAAAA GACATCGACC TCCGAAAAAC GAGAAAGGAA TTCTTGACAA GCCCTTTGGA AGGCTGCGTT GTGCAACTTC ACTGCAAGTC ATTCATTGCG GAGATGAAAA ACAATAAGTC CAAAAATGGC TTGGCGGACC GGACTTTCCT AAAGAAACCA CCACAAGGTA CTAGCGGTAA CTCACGCCGA TCGAAAGACA GAGTTCCAGG CCCCTCTGGG AAATTCAAAA AGCATTTTCA TGCCAGGAAG GATCACAAGG TGGCTACCCG GGTTGTTCCA GACTTCATCC CAACGCAAGA CAGGGTCGGA CACACAGGAC GCCAGGATGA ACGGTTGACG ATGAATCGGA CGCGTCTCGT TCCCACGAAA CAAGTCAACG ATCAGGTGGC AGTTACTTTC GCTGAACTTG ACCGCAAGAT CCCGTCCCCC AAGGAGATGA AAGAAATCCA AGCTCAAGTT CTCAAAAGTC GAAAGCAGGC AGAAGCTGAC AAAAATGGAA TCTCGATTAG CAAAGCACCG ACCGAAGCGG ATATCGTTCC TTTGCACAGA GGATACTTCT TTTCTTTGAG GATCACCTCG ACCACTGCGC CAGCTATTCT GGAGAAAATC GATATCGGTG GTACTCACGG GAAATCTTTC CGATACGTGG GGCAGAAATT GCCCCACACC GTTGCCGAAA AGACCGAAGC CAACTTCATC TTTCTGCCAA ATTTCCCCGG AGTCTTTCGT GTCGGCCTGA CCTTTCAGTT TTCTACATGC TCCATTTTGC GAACTCTAAC TTTCAAAGGA GGCGACAAGA AAATGCATAA AGAGGTGGAG CCTCAAAGCC TATACGAAAG AAAGATATCT AACATACACA GCCAGAAGCC GATAGAAATA TTCGCCCCGC CACCGGAAGG AAATCACAAC GGTACCAGAA ACCCTTTCGC AAACCTCCCA CTCTACAAAA TTCCAGCAAA CGTGGAAAAA ATGATCGGGA ATCCTGAATT TGAAAATGCG ATTGAACTCC CTCATGAGGA CGGTAGGAAC TATGCAAAAT TCTACCAGAG TATGCTTTGG GCAAGCGAGT TGCAGGCCTA CCGCGATATC AAGCTTTTCG ATCTTGAGGG CGCCAAATTG AAGAGAGAAG GCAAATTCTT TAAACTGACT GTTGAAGGCC TAGCCGAAGG CCGCCCTTCC GTTTTACGAG GCGATATGGT AAATGTCACT TGGAATCGTC GTTTGTACCA GGGCCGTGTC GAACGAACGC TATTGCTTGA AGTAGTTCTG GAATTCCACC AATCCTTTTC TCGCTCATTT AATCCAACGG CTGACTCCGT TGATGTGAGA TTCACATTCA GCCGCATGAC ATTCCGTATT TCTCACGAAG GAGTGATCGA AGCACAGCAC CAGATGAAGG ATCCACTGTT TCCTAATATT GAAAGTGTTG CCTCAATCAA AGGTACACAA GTCGGGCGGG ACAATTCGTC GCGTCTGGAA TGGGGAAATA CTGCTTTGAA CGACGAACAA AGACTTGCTG TTACAAAAGC TGTGGAGGGT GCGCTGCGTC CACTCCCTTA CATTATCTTT GGTCCTCCTG GTACAGGTAA GCGCGAGTCT ATTGATGTCG AAAGCGTCGC AACCGAGTCC GTAGCCGTAA CACAGTGATC TCTTCTTTGA TGCTTCTAGG GAAAACCACC GCGGTTACCG AAACTATCCT CCAGTTAGCA AGGCACAAAA ACGGACTAAA AATTCTTGTC ACTGCACCAT CGAATGATGC TGCCGACGCT CTCGTTGAAC GACTGGTGTC GTACTTTTCT CCATCTGACC TCAAGCGCGT GATAGCGTAC AGTAGAAACG TAGATAGCGT CCCCTCTTTG ATTCGCAAGT ATACTAAGGA AGGGTTGACC AGTGATGGGC AGCTCAATCA GATTTTGTCC GGGCGTATTG TGGTTTGTAC CGTCAATTTA GCGGCGCGGT TCTCTCGGCT CGGCGTTCCT CGTGGCTTTT TTGATGTCCT CTGTGTCGAC GAGGCTGGAC ATGCTAGTGA ACCCGAAGTA GTTTCAGTAG CATCGACGCT ACTAAATTTC AGCCATGCTG ACGAACAACT CGGCGCCGGT CAAATTATTT TGGCAGGCGA TCCGAAACAA CTTGGTCCAA TCGTCACTTC TGATCTCTGT CGGCGCTATG GGATGTCTAC ATCTTACATG GAACGTCTAT CTAAGCGTTC AATCTACTAC AAAGAAGACG GACAATACCC GGCGGAATTA ATTACGAAGC TGGTCCAGAA CTATCGATCC CATCCTGCAA TAATCGAGCT TCCGAACAAG ATGTTTTACG AAAACGATTT ACTGTGTCGT GGGGATACTA AATGTACTCA TAGCCTCGCG AACTGGGAGA AATTACCTGT GAAAGGCTTT CCAGTTATGT TTCATTCAAT GACGGGTGAG AATCTCCGTG AGGGATCCTC TCCATCTTGG TTCAACCCCG AAGAAGCAAT TCAGTCTTTC AACTACGTAG ATATCCTTTT AAACCACTCT CGGCCACCCT TAAAGCAAGA TGATATTGGA GTGATAACTC CATATGCACG CCAGGCCCAG AAGATTCGTT TACTGTTGAA GAATCGTGGC ATCAACGACG TGAAGGTGGG ATCAGTCGAA ACCTTTCAAG GACAAGAGAG GCAGTGCATC ATTCTGTCGA CGGTCCGATC AGAGACTGAA CACATCAAGC ATGACCTGCG ATTCAACCTT GGATTTGTCG CCAGCGCCAA ACGCTTCAAT GTGGCACTCA CCAGGGCCAA AGCGCTTCTC ATAGTGATCG GTTGCCCGAC AGTGCTCAGC CTTGACAAGG ATCATTGGCT TCCATTTCTA AGGTACTGCC GTGAGAACGG CTCTTGGGTA GGTCAGCCTT GGAAGGAACC AGAAACGGAC GTCTCTTCAG ATCCGGGTAT TGCTTCTCAA CTTGAAGACA ACATTCGGGA GTGTATCGAT AGCCCGAGCC AGGTAGTCGA GCAAGAGGGA TTAGCATTTT CTGGTAGGGT TTTAACAGTA AATGCAGGCG ACACAGATAA AACCTGTTAA AAAAGCACTT CCCTCTTTTG GCAGACGATG CAATTTTGCC ACACCATGGA CAGCAAGCGA GTTACAAACA GAAGTACAGG AAAAAACAAT GCAACTGCCC AGCGGAGGAC ACGAACTGGG CCTCTTTGCA CTATAGACAT CAGCAATCGA GCCACTAGGT GGCTTGCCCC CTCGACTTTT GGGTTAACAA GGAGCTGGAG AGCCATTCCA GGCGCAATCC ATCTCTGAAA AAGTGGTTTT GGCTTAAGAA CTCCTGTAAG CTCGTCAAAG TCTCCAGTGA AAAAGGTAAC AAACACATCC ATGAAGCAAA TAAAGCCCAC AAAAGCGAAG AACCTGTGAA TGCCGAATTC GAGCACCTGA ATGTATATAG TTTGTAGCCT CTCGTAGTAC TCCGAACAAT ACCAAGGCAG GGAGTCTTGG GCGGCATGAC GATGCTTTCT CTTCAAGTAT ATGATTTTGC GTTGGATGGC TCGTAAGATG AGTGGTCCCG ACAGGACACT ATTTTCTACA GCACAGACTT CTTGTTCCCG
|
Protein sequence | MPEYTLENLP RKATIEELSA YLAKVQVSFS SVGPIHKAGK GSVAKVVLPK DIDLRKTRKE FLTSPLEGCV VQLHCKSFIA EMKNNKSKNG LADRTFLKKP PQGTSGNSRR SKDRVPGPSG KFKKHFHARK DHKVATRVVP DFIPTQDRVG HTGRQDERLT MNRTRLVPTK QVNDQVAVTF AELDRKIPSP KEMKEIQAQV LKSRKQAEAD KNGISISKAP TEADIVPLHR GYFFSLRITS TTAPAILEKI DIGGTHGKSF RYVGQKLPHT VAEKTEANFI FLPNFPGVFR VGLTFQFSTC SILRTLTFKG GDKKMHKEVE PQSLYERKIS NIHSQKPIEI FAPPPEGNHN GTRNPFANLP LYKIPANVEK MIGNPEFENA IELPHEDGRN YAKFYQSMLW ASELQAYRDI KLFDLEGAKL KREGKFFKLT VEGLAEGRPS VLRGDMVNVT WNRRLYQGRV ERTLLLEVVL EFHQSFSRSF NPTADSVDVR FTFSRMTFRI SHEGVIEAQH QMKDPLFPNI ESVASIKGTQ VGRDNSSRLE WGNTALNDEQ RLAVTKAVEG ALRPLPYIIF GPPGTGKTTA VTETILQLAR HKNGLKILVT APSNDAADAL VERLVSYFSP SDLKRVIAYS RNVDSVPSLI RKYTKEGLTS DGQLNQILSG RIVVCTVNLA ARFSRLGVPR GFFDVLCVDE AGHASEPEVV SVASTLLNFS HADEQLGAGQ IILAGDPKQL GPIVTSDLCR RYGMSTSYME RLSKRSIYYK EDGQYPAELI TKLVQNYRSH PAIIELPNKM FYENDLLCRG DTKCTHSLAN WEKLPVKGFP VMFHSMTGEN LREGSSPSWF NPEEAIQSFN YVDILLNHSR PPLKQDDIGV ITPYARQAQK IRLLLKNRGI NDVKVGSVET FQGQERQCII LSTVRSETEH IKHDLRFNLG FVASAKRFNV ALTRAKALLI VIGCPTVLSL DKDHWLPFLR YCRENGSWVG QPWKEPETDV SSDPGIASQL EDNIRECIDS PSQVVEQEGL AFSGRVLTVN AGDTDKTC
|
| |