Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_43917 |
Symbol | |
ID | 7204358 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 419109 |
End bp | 421842 |
Gene Length | 2734 bp |
Protein Length | 784 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186341 |
Protein GI | 219113515 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.373616 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGAAC CTGCTGGAAT TGTAGAACTA CGGCGGTACG ATTTACGAAC GAGACTCTAG AGTGACCGTG ACTACGGTGG TGGTGACGGC GAAGGATCGG CAGTCCCTAC TATAACGACG AGAGAGACAG GGTTTCTACA AACGAGGGCT TCGGTGTCCA CGATTTACGG TGGCTTCGCG GTAGTTCCGA ACCGGAAACG TTCCCAGCAC GACCCAACGC AACCCCGCAC ACCACCACGA CTACCCGACG AGGCTTCATT GTTTCCAGTC GCTCCGGCCG GATCCCTCCC CCTCGTTGGG TACACACACA AACACGTCCG CAATTCCTTC CGTTGCGTTA CGGTCCGTGC TGGAATGAAG GACACGCAGC AACTACAACT CTTGGTCTGT TCCGCTAATT TGGGCAACGA GCAACCCGAT CCCGACTCCT TGGCTGCTTG GATTCCCACG GATGGCGCTT GCCAAGCCGT TCTGCGTCAA GATGCACCGT ACCCCGTCCA ATCCTACCAC TACAGTAGCA GTACCAACCA CAATAATCAC AACAACAACA ACAACACGAG CACTCGGTCG TTGGAGGCAG ACAATGACGT GCCGTCGGAT GCGTCCCGTT ACACCGACAC GGATCAGTTT GATATGATCG TCATCGGCAT GCAGGAGGCG ACCTTTGACC CACCGGCGGG GGCGACCCGC GTCGGGACCC GTCATTCGTA TACAGGTACC GGTCGTCCAA CCCATTCTCG AGAAGGGCGT CAAGAAAGCC TTGGATACGG TCAACAATCT GACCGCCACG CGTGATCATA CCAAAAAGAC CGCCAGTATC GGTGTTCCCG ACTGGAGTGG AGGAACGGCC GCGCTGCACA AAATGCTCGA ACTCAGTCTC CCGTCCTACC AGCATGTCGT TAGTTACCAA CGAGGGCAAA TGCGACTCGA AGTCTTTACA CATGCCCAGC AGATTGACGT TCGAGTCCTC CGCACGACGG CACAGAATAC GGGACGCGGA GGATTGGCCA ATAAGGGAGG CATCGTCACG GAGCTGCTGG TCAATGAACA CACGCGGTTG GCCTTTATGA CTACTCATTT GGAAGCGCAC GAAGGAGCCA GCAAGTGTGC CATTCGCAGC AGCAGCCTCG GGGACATTCT GGGTGGGACC AAGACCAAAC TACACGACGT TTCGCAGACG GCTCACTACA CGTTCGTCCT CGGAGACTTG AACTTTCGGA CCGAGTTGCC CGGGTCGCAC TTTTTGTCCG AAGAAGATCA CAAGGCCCGA GTGTGGAATT GGGTCGCAAC ACGAAACTGG CAAGCTTTGA ACGATGCGGA TGAATTGCAA CGGGAACTCC GGAATAAAAA CTGTCTGGTG GGATTTCAAA CGTTGCGGTG CAATTTTCCA CCCACCTTCA AAGTAGAACG ACAGCCCGGA TACCAGTACA TTGACAAGAG ACGACCATCG TATACCGATC GGATCTTGTG GAAGACCGGG CATCAAATGG AACAGGGGGT GGTACCGTTG GTGTACGAGC CAATTGATGT CTTCGCGTCC TCGGATCACA AGCCGATTCG GGCCGCTTTT GGCGTCGACT TGAACGACTC GTTCCGGATG CGTCCTAAAC TGCACCGGAA CCACTCGCAC ATCAATTTAT CCACATTGCT GCATCGGAGA AACAGCCAGC CGAACCTCCG TCAGCCCGTT GTCGCCCATC GCGACAAGCT GCAACTCTTC GTTTCCGGCA TGGCGTGCCA GCTTTTTCCA AATCGAGATT TGCCTCCGAA CCCGTACTTG TGTTTGGTTT CTTCCCCGCC CGATGCATTG CAAGTGAGTA CCAGTCGTTG GGTGCAAACG AAAAGTCTAC TTTTTTGCCA GACTTTGGGT AAAGTCGGTG GTCAGGCTAC GACGGCGCAA GGCTGGCCGC GATCGAGTTT GCAAATGTCG ACCAATACAC CAAAATGGGA CGGGGAAGAG ATTCACTGCC AAATACAAAC TCACGGAGCA GATGGTTCTT CGTTGGACCT GACTGGTGCG CTCTTACACT TGACGATTAT GAACTATAAT CCGAGCAACC AAGATTCGGT GGTGGGAACC TTCACGTTCA ATCTCGTGAA TCTGCTGCGC TCTTGCCGCC GGACGAGCCC AGAGACTAAT TCGGGTTCCG AACGAGATCT GCAAGGGAAG AGAGGACCCC GAAATCTGTT TCGTCGATCG CAAGCACAAA CTGCTGTTTG GAACGATCCA ATCACCAGCG TTGACCTTGA CGAATCAATC TGCTTGAATG GTATGGAGAC GGGAAGGATT CAGTGTACGA TCGATGCGTG GTGGATGGAT ACAGCCAAAG CCTTGGCGAC CAGACCTGGC GAAGACCGCC GTAGTGCGGT CCATCGATAC ACTATCGCCA ACCAGCAGCG ACGTGGTTCG AACGACATGC AACCGTCGAG TGGCGCAAGT ATGAATGAGA CTTGATCGAT TTCATGTTTC GATTAGTCCT GTTCCACACA GAAAGGCATC GGGGATAAAC GCAAACGAAG CCAACAATCC TATGTGTGTA CACAAATGAA GCCAAAGATC CATACAATCA GCGTTACTGA CAGCAAAGAT ATCTACTATC TTTCTTCATT CACATTGAAA TCCCTTTACT TTCTCGGTGA CAACTGAACA AGAAAAAGAG CCGTAAAAAG CCCTACCGTT CACAGCCGCT CAACCTTGCA AAGCTAATCT TACACATGAA CATCCATTCG ATTC
|
Protein sequence | MPEPAGIVEL RRDRDYGGGD GEGSAVPTIT TRETGFLQTR ASVSTIYGGF AVVPNRKRSQ HDPTQPRTPP RLPDEASLFP VAPAGSLPLV GYTHKHVRNS FRCVTVRAGM KDTQQLQLLV CSANLGNEQP DPDSLAAWIP TDGACQAVLR QDAPYPVQSY HYSSSTNHNN HNNNNNTSTR SLEADNDVPS DASRYTDTDQ FDMIVIGMQE ATFDPPAGAT RVPVVQPILE KGVKKALDTV NNLTATRDHT KKTASIGVPD WSGGTAALHK MLELSLPSYQ HVVSYQRGQM RLEVFTHAQQ IDVRVLRTTA QNTGRGGLAN KGGIVTELLV NEHTRLAFMT THLEAHEGAS KCAIRSSSLG DILGGTKTKL HDVSQTAHYT FVLGDLNFRT ELPGSHFLSE EDHKARVWNW VATRNWQALN DADELQRELR NKNCLVGFQT LRCNFPPTFK VERQPGYQYI DKRRPSYTDR ILWKTGHQME QGVVPLVYEP IDVFASSDHK PIRAAFGVDL NDSFRMRPKL HRNHSHINLS TLLHRRNSQP NLRQPVVAHR DKLQLFVSGM ACQLFPNRDL PPNPYLCLVS SPPDALQTLG KVGGQATTAQ GWPRSSLQMS TNTPKWDGEE IHCQIQTHGA DGSSLDLTGA LLHLTIMNYN PSNQDSVVGT FTFNLVNLLR SCRRTSPETN SGSERDLQGK RGPRNLFRRS QAQTAVWNDP ITSVDLDESI CLNGMETGRI QCTIDAWWMD TAKALATRPG EDRRSAVHRY TIANQQRRGS NDMQPSSGAS MNET
|
| |