Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_23838 |
Symbol | |
ID | 7198891 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | + |
Start bp | 340651 |
End bp | 347029 |
Gene Length | 6379 bp |
Protein Length | 1879 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185018 |
Protein GI | 219129695 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGGCCGGCAG GATCCGGCAA GTCCAGTCTC GTTCGAGAAC TTGCAAGCAT GTTTGCGTTT GACGAACACA CCGCAGCTCA CCATGACGAT CAGCTGCTAG AGATTCATGT CGATGAAGAA ACGGATACAA AAACTTTAGT TGGATCATAC ACTACGACCG ACGTTCCCGG CGAGTTTGAG TGGAGACCAG GCGCCTTGAC GAGAGCGATT CGATCTGGTA AATGGGTACT GTTGGAGGAC TTGGATTCTA TTCCCATTGA GATCCAGGCT TCGTTAGTGC AGTTGTTAAA AAGTAGACTG TTGCCACTAG GGAACGGAAA AGTCGAGCGA TGTCACCCAA ACTTTCGTCT TTTCGGAACG TTGACTATTC TGCCCGATGT GATTCATCCA GGATCAGTCG GTGGCAAACG AATACTGAGC CCCACGATGT GGAGACATAT ACGAATAGAA CCTCTGCCTC TTTCGGAGCT GAAAGAAATC GCTGTTTCGA TGCACAAAGA GGTCCCCGGC TTCGTTGTTG ACTGTGTTTT GAAAGTCTTT ACTGCCCTCG ATCAAAGTGG TCGAACAACC GCAATAGAAG ACGATTTTAT GGACGGCCTC CAACTCTTGA CACGGGCAAT AATTGGAAGA AATCCTTCAG TTCGGGATCT TTTTAAAGTT CTGTCGCGCA TAGCGCACAA CGTTGTGTTT GAATCCGGTG TGAGCTATGC AACAGAAGGC CAAAGAACCC TTTGCTTGGC CGAAACGTAT GATATTTTTG TCGCAGCTTG CCCTGATCGG AATTTTAAAG AAGAATGTCT CAGAAGTATT TTTGCACCAA CTTGGGGGGT GTCGGCTGAC TTGGCTCTAT CTTACATTGA TAGAAGGCGG CCGGAACCTG TAGTTTACCC AGCGTACACT GAAGTTGGTA GAGCAAGAAT ACCTATTCCT GAGCTTTATA TTCGGCCGAA CACTGACAAA GAAACGTTTG CTCAAACAAG CTATGCTCTT CGCTTGATGG AGTCGATCGG AGTCTGTATC GCGGAAAATG AGCCAACTCT TTTGGTCGGT GAAACCGGCT GTGGGAAGAC AACAATCCTA CAGCATCTCG CGAGACTTTC TGGAAGAGAT CTTGTTGTAC AAAATCTCTC TTTGCAAACC GATTCAACAG ATCTTCTCGG TGGATTTCGA CCGCTGGAGA TACAGCAGGT GGCTCGAAGT GTATACCAGA ATTTTGTAGA CCTTTTCACG TCTTCTTTCT CTAGAAAGCA GAACGCTGAT TTCCTGGCAT TTACCTCTGA TGCAATGAAA AGAGGGAACT GGAAAAGATT GTCTCAATGC TTCAGAAAAG CAGCCAAGCT TGGAATAGAT AAGGTCAAAG AACGAAGCTG GGATTCCGAA TCGTCTTCGG TGGCAGCTTC CTGGAGACGT TTTGAGAAGA GCACTGGGCG TTTCGAGCAG CAACGACTTG CTTGCAATTC AGGGTTGGCC TTTGTCTTCG CGGAAGGTGC GCTAATAGAG GCTATTACAA GAGGGAAGTG GGTATTGTTG GACGAGATCA ACTTGGCCAG TTCGGAAACG CTTCAAAGAT TGTGCGGGCT GCTTGATGAT CCGACCAGCA GTATCACTTT GACCGAAAGA GGTGATGGTG AACCCGTCGA ACGGCACCCT GACTTTCGGT TATTTGCAGC AATGAACCCA GCAACCGATT CAGGAAAGAA AGATCTGCAC GCTAGTATCC GGTCTCGATT TACAGAGTTG TACGTAGACG AATTGCTCGA TCCGCTGGAG CTAAGGGTGG TTGCCGAACG CTACATTTGT GCAGTTCTTC CGGTGACTGA CACACCTCCT GAACACACGG AGACTGTGGT CAAGGCCGTT GATGTATATC TCGAATGCCG TTCACTCGCG GAACGTGTCT TGGTCGACGG CAGTGGTCAA AAACCTAGAT ACACTCTTCG CACTCTATCA AGAGCCTTGA CTGCATCCCA AATCTACGTT CTTCAGCAAA AGCTCCCACT CCAGCGAGCA CTTTATGAAG GCTTTGAACT TGCATTCCAG GGCCCGCTAA GTCAGACTTC ACTGAAACCT TTTTGTGATC TTCTCACCCA AGCATTTGCT TCCGGGTTGA AAAAGGAAGA GATGGATCAT CCCGGTCGTC GACCAGGAGG AACCGCTGCA GTAGATCGCT TCGCACTAAT TAAACCTTTT TGGATTGAGA AAGGTCCCCT GGAGTCGATA GATTGGTCGG AGCTCAACAG TAATGCCAGG TCAAAGTTTG TTTTGACACG GAGCACCACT AACAATTTAC GTCGACTCGC CCGGGCAGTA GCATCAGGGC CTTGGCCGAT TCTTTTGGAA GGGCCCACTA GCGCCGGGAA GACATCCTTG GTGGAATACT TAGCGGCACG TTGTGGTCAT CACGTTATTC GTATTAACAA TCACGAACAC ACCGACGTGC AGGAGTATAC GGGGAGCTTT GCCGCCGACT CGAGTGGTCG TCTTGCTTTT CAAGACGGCT TACTCGTTCG GGCATTGAGA GAAGGTCACT GGGTTATTTT GGACGAGTTG AATTTGGCAC CAAGTGAGGT ACTCGAGGCC TTGAATCGTC TTTTGGATGA CAATAGAGAG CTTTATCTCC CTGAAACCAA CGAGACTGTC CGTCCTAAAC ATGGATTCAG GCTTTTTGCG ACACAAAACC CAAGCGGCGT ATATGGAGGC AGAAAACCGC TTTCTAGGGC CTTTCGAAAT CGATTCACTG AACTTCACTT GGACGATATT CCTAGTAGCG AAATGATTAC CATTCTAGAG AAACGGTCCG GATGTCCACC TCAACACGCC AAGATTTTAG TCTCAATCAT GGATGCTCTT CGACTGAAGC GTAGTAAAAG TGGAGTGTTC TTGGGAAAGA ACAGCTTCAT TACTCCACGG GACCTGTTGA GGTGGGCAGA GCGACATGCA ACCGGGAAGA TTGAACTAGC CCACGAAGGT TTCATGCTTT TGGGAGAGCG GTTGCGGACA AGGGAAGAAA AGGATTGTGT CCGAGAAGAA ATTGAGAAAC ATCTAAACGT AAAAATCGAC CTGGAGACAC TCTACTTCAG TGAGTCCTCG GAAGCGAGAT CAGTTCTCCA CCGTCTGTTC ACTAAATCAT CCGATCCGAA TAATCAGAAA CTGTTAGCTA CGATCGCCCC GACGAGGTCT TTGCTGAGGC TAGTCACGTT GGTGCTCCGC TGTGTCAGGC AACGTGAACC AGTTCTTCTC GTTGGAGGTA AGCGGAAGGG AAACCCCTTT ATTGTTCTTT TGCCGTTACG TATGTATCTC AAATACCGAT TCTTGTAGAT ACGGGATGCG GAAAAACAAC AGCGGTTCAG CTTCTCAGTT GTGTATTGGA GTCCGAACTT CATACCATCA ATTGTCACGC CACAACCGAA ACCTCTGACC TTTTAGGTGG GTTGCGGCCT GTTAGAGGTC GTCCAAGGCT GGTTCGAAGA ATGCTGGATG CGACAGTAAA TCTCTGTGGT TGTTGGCCAC AACGCGATGC TATTGCTTTA CAATCAATTC CCGGATTCGT CAATGAAGAA TTGGCAAAGA CCACAAAAGC ATCGGAGGTC TATACCGGAA TATTACCAGA TGATGCAGTC GACCAAATGG TTGAACTAGC GAGGGCCTTG TGGAAATCAC GAAGTGGGAT TATTCAATCG GAGCCTTCCT CTAGACGGGC GAAACGTCAA AAAGTAGACG GAAAGGAATC TCCCAGTAGT AGTTTTGAGC GTGACAATTG GGATCAAGTT TCGTGCCTTG TCAAGGAAGT CGAAGATCTG GCTCGGGCAT ACAATGCTTT GTTTGAATGG TCCGATGGTC CGCTTGTGAA AGCGGTCAAG GCTGGAGATA TGATCCTGCT CGATGAGATA AGTCTTGCCG AAGATGCAGT TTTGGAGAGA CTGAATTCGG TGTTGGAACC ATCGCGCACC CTCGTTCTGG CGGAGAAAGG CTCAGCAGAC GACGTTGAGA GTCGAGTTCT CATCGCGCAC GAATGCTTTC AGCTTTTCGC GACCATGAAT CCTGGGGGGG ATTTTGGAAA GCGTGAGCTC AGCCCGGCAC TTCGAAGTAG ATTCACAGAA ATTTGGGTAC CTGCTGTTAC GGATAGTGCC GATATTGATT TTGTCCTAGA GCGCTCCCTA GCACAAAGTG CTTTCGAGGA AGGGACACGA AATGACGTAA AGCGGAAAAT GATAGAATAC TTCAACTGGT TCAATGAACA CCTCTGTACC GATCCTTCGC GTCCCTATGC CGAACTGGCG CTTTCTCTCC GTGACATCTT GACGTGGGCA GCTTTTATTA CAGCTTGCAC GCAGGCTGGC GACGACCTTG ACATATGGGG AATATACTAC CACGGCGCTT GTCTTATGCA TTTGGATGGG CTGGGACTCG GAACTGCACT TGCCTCGCAC GATTCCGCAT CGGCCAAAAA TAAAGCCATT GAATTCTTGA TGGAACATGT TCCAGTAGAT AAAAGGTCAA CGCTTGGTGT GATGACGAAT GGCTTCAATG TTCTGGATGC AAAATTTGGT TGCAATACTT TCCGTATTCC GGTCGGTCAA GATCATGTTG CGGAAGCTGA CTTCAACTTG GAGGCTCCGA CTACATCTGT AAATATTTTC CGAATTCTCC GGGCAATGCA ACTTTCAAAG CCAGTACTGC TCGAAGGATC CCCGGGGGTG GGAAAGACGA CGCTTGTGGC TGCGCTGGCA GCAGCTTCCG GGCATAAGCT TGTGAGAATC AACCTTTCCG ACCAAACCGA CATTTCCGAT TTGTTCGGCA GCGATCTCCC TGTTCCGGAA AAGGATGCGT CCGGGAATAA CTCAGCCTCC TTTAGTTGGT GTGACGGCGT TCTACTGTCC GCTATCAAAA AGGGAGACTG GGTGCTATTA GATGAATTAA ACCTAGCGTC TCAGGCGGTG CTGGAAGGTC TGAACAGCTG TCTCGACCAT AGAGCAAGTG TTTACATTCC GGAACTAGGT AGTTCTTTCA GATGTCCTGC GTCCTTCCGA ATTTTTGCCG CACAGAACCC GTTGGCACAA GGTGGAGGGA GGAAAGGCCT TCCTAAGTCG TTTTTGAATC GGTTCACCAA GGTTTTTGTT GATTCTTTAA CAGACGACGA TCTTCGCGTT ATTTTGACAG CAAAATTTCC ATCATTGGAT GAAAAGCTCA TCGACCGTGT GATTGCATTT AACAGTCGGA TAAACTGTGA TGTTGTTCAA GATCGCCTGT ATGGTCAGTC CGGTGCACCG TGGGAGTTCA ATTTGAGAGA CGTTTTTCGA TGGTGTGAGC TGCTACGATT TGATCTTTTC GGGCAAATAT CAATTTCCCT TCCTCTGGTG CTTGCCCGAG ACCTGTATTT TCAACGTTTC AGAGCACAAC ATGACCGGGA AATTGCGAAA GAACGTTATC TGTCAATCTT CGGATCCGTG GATCAGAAGG ACCCCAGATG TAAACTAGCA ACGACTCCGC TCGAAGTCCT TGTGGGTAGT GTCAGATTAG CACGGTTGCG AGGTCCGGAT ATCACACGCG GTCCCTATGT CGTAGAGAGT CCGCTTTCGC TCATTCACTT TCAAGCACTC GAAGCGGTAG CTCGATGCAT TTCAAAAAAT TGGCCTTGCC TACTTGTAGG GCCGGCGGTA GTGGGGAAGA GCTCGACAAT TGGTTGCCTC GCCGAATTAA CAGGGCGAAA GCTAGTCGAG ATATGTCTGT CTCCATCCTC TGACGTTTCT GAGCTTGTGG GGTGCTTTGA GCAAGTTGAT GGACTTGAGG ACTACAGGGT AGCGTTCGCG TCTATCTGCC GTGTGGCTGA GGAATATCTG CTGCTTGGCG GATCGGAGAC TGAAAAAGTA TGCGATTTAC TCAATTCGAT CCGATCGGCA GACGACGAAG TAGAGGGTAT GCGTTTTGCG CAGCACATTG CGTCTCTGCT TTTTACAGAG CTCAGATCTA GAGTCGATCT GAACCCCATC GTATGGGATG ATCTGGCGGC TTCTCGCGAT CTTTTGCTCA ATTACAATGG AAAAATGTCA ACGATAGGCG GTCACTTTGT TTGGAAAGAC GGTATTCTTA TCGAAGCGAT GGAAAAGGGT TACTGGCTTC ATTTAAAGCA TGTCAACCTT TGCCCAGCTT CGGTGCTTGA TCGTTTGAAT TCTGTGATGG AACCAAATGG AAGTTTGTTG TTGGCAGAGT GCAGTTCAGC GGGGGAGGAC GGAGGAACAA GGCATCGTCA AGTCCATTGC CACCCTGACT TTCGGATCTT TCTCTCTATG AATCCTGAAT ATGGGGAGGT TTCAAGGGCG ATGCGGAACA GATGCGTGGA GATAGCGTTG ATGGACCACC CGGCTCTGGC ATATGAGGG
|
Protein sequence | MFAFDEHTAA HHDDQLLEIH VDEETDTKTL VGSYTTTDVP GEFEWRPGAL TRAIRSGKWV LLEDLDSIPI EIQASLVQLL KSRLLPLGNG KVERCHPNFR LFGTLTILPD VIHPGSVGGK RILSPTMWRH IRIEPLPLSE LKEIAVSMHK EVPGFVVDCV LKVFTALDQS GRTTAIEDDF MDGLQLLTRA IIGRNPSVRD LFKVLTLCLA ETYDIFVAAC PDRNFKEECL RSIFAPTWGV SADLALSYID RRRPEPVVYP AYTEVGRARI PIPELYIRPN TDKETFAQTS YALRLMESIG VCIAENEPTL LVGETGCGKT TILQHLARLS GRDLVVQNLS LQTDSTDLLG GFRPLEIQQV ARSVYQNFVD LFTSSFSRKQ NADFLAFTSD AMKRGNWKRL SQCFRKAAKL GIDKVKERSW DSESSSVAAS WRRFEKSTGR FEQQRLACNS GLAFVFAEGA LIEAITRGKW VLLDEINLAS SETLQRLCGL LDDPTSSITL TERGDGEPVE RHPDFRLFAA MNPATDSGKK DLHASIRSRF TELYVDELLD PLELRVVAER YICAVLPVTD TPPEHTETVV KAVDVYLECR SLAERVLVDG SGQKPRYTLR TLSRALTASQ IYVLQQKLPL QRALYEGFEL AFQGPLSQTS LKPFCDLLTQ AFASGLKKEE MDHPGRRPGG TAAVDRFALI KPFWIEKGPL ESIDWSELNS NARSKFVLTR STTNNLRRLA RAVASGPWPI LLEGPTSAGK TSLVEYLAAR CGHHVIRINN HEHTDVQEYT GSFAADSSGR LAFQDGLLAL NRLLDDNREL YLPETNETVR PKHGFRLFAT QNPSGVYGGR KPLSRAFRNR FTELHLDDIP SSEMITILEK RSGCPPQHAK ILVSIMDALR LKRSKSGVFL GKNSFITPRD LLRWAERHAT GKIELAHEGF MLLGERLRTR EEKDCVREEI EKHLNVKIDL ETLYFSESSE ARSVLHPTIA PTRSLLRLVT LVLRCVRQRE PVLLVGDTGC GKTTAVQLLS CVLESELHTI NCHATTETSD LLGGLRPVRD LARAYNALFE WSDGPLVKAV KAGDMILLDE ISLAEDAVLE RLNSVLEPSR TLVLAEKGSA DDVESRVLIA HECFQLFATM NPGGDFGKRE LSPALRSRFT EIWVPAVTDS ADIDFVLERS LAQSAFEEGT RNDVKRKMIE YFNWFNEHLC TDPSRPYAEL ALSLRDILTW AAFITACTQA GDDLDIWGIY YHGACLMHLD GLGLGTALAS HDSASAKNKA IEFLMEHVPV DKRSTLGVMT NGFNVLDAKF GCNTFRIPVG QDHVAEADFN LEAPTTSVNI FRILRAMQLS KPVLLEGSPG VGKTTLVAAL AAASGHKLVR INLSDQTDIS DLFGSDLPVP EKDASGNNSA SFSWCDGVLL SAIKKGDWVL LDELNLASQA VLEGLNSCLD HRASVYIPEL GSSFRCPASF RIFAAQNPLA QGGGRKGLPK SFLNRFTKVF VDSLTDDDLR VILTAKFPSL DEKLIDRVIA FNSRINCDVV QDRLYGQSGA PWEFNLRDVF RWCELLRFDL FGQISISLPL VLARDLYFQR FRAQHDREIA KERYLSIFGS VDQKDPRCKL ATTPLEVLVG SVRLARLRGP DITRGPYVVE SPLSLIHFQA LEAVARCISK NWPCLLVGPA VVGKSSTIGC LAELTGRKLV EICLSPSSDV SELVGCFEQV DGLEDYRVAF ASICRVAEEY LLLGGSETEK LRSRVDLNPI VWDDLAASRD LLLNYNGKMS TIGGHFVWKD GILIEAMEKG YWLHLKHVNL CPASVLDRLN SVMEPNGSLL LAECSSAGED GGTRHRQVHC HPDFRIFLSM NPEYGEVSRA MRNRCVEIAL MDHPALAYE
|
| |