Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_55114 |
Symbol | |
ID | 7198589 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | - |
Start bp | 146920 |
End bp | 149505 |
Gene Length | 2586 bp |
Protein Length | 617 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184743 |
Protein GI | 219129117 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.950731 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGCACG CCAGCAGCAG TAGCAGCATT GCTCCTGTCG AATCCCGCGC GGCTCGTTGG CGACAGCAGT TGAGAATCCG GCCCCAGCCG CATACCAACG TAGTTAGGTC ATCATTCGCA CAACTCGTTG CCCAATTCCC CTACCCCTCC TACGACGACA ACGACGGCGA CCCCACCGCC AACAAAGCTT CGCCCAACGC GACCGCATCG AAGGTTTCAG AGTTGGCACA CGGTGGCGTA GTACCAGATC TGGATCCGCT CACGGCGTTG GTCCGGGAAA CATCGGAACA ACAACAACGT CTGGAGTCGT TGGAATTAAA GTATCGCAAG GAAAAGGCGT TGCGCAATCG AGCCGGAAAA GGTGGGGCTA GCGATCAAGG ACGGACTTTA GTCGAATCGG AAGACTATGA TGAAAATGCT GTCACCTTGC AAATGATTGA CAAGGACCTG GCGAGATTGC CGCCCCCAAA AGGCTCCGGA CAAAATGGAT CCCAAAATCT TGCTGGTGTT GTTGTTTCGA AGGACGAGGA TACGGCAGGC ATACCCACCA GTAGCGGTAC TAGCGATGAG CGCATAAAAA CGTTGCGCCG TGTCTTGTAC ATTTACGCCT GTGCTCATGC CGAGGCAATT GGCTATCGAC AAGGCATGCA CGAAATTGCT TCCTACATTT TGTTCGCTTT GGAGTTGGAC CAGCAAGCAG AGGAGAGTCT CGTTGCCGTC GCCACCAGCC AAGAGCAAAT TGCTTCCGAT GCGTACGAAC TACTCGAAAC TATTTTAACA TCGATTGAGT GCGTCTACGA CGCAACGCCT CTACCGGGTC AACACGAAAA ACCACTCGAA GCCAGCGCCC GACGTGTACT GCAAGGCGTG CAAACGTACG ATGCTGCCCT GGCGTTACGT CTGTCTCAAT TAGGCGTACC TCCTCAGTTG TATCTGACCA AATGGATGCG ATTGATGTAC AGTCGCGAAG TCACGGATGT TTTGTCTCTG TGGGATGAAC TTTTTGCTTA CGTAGGCGAA GGCAGCACGC TGGTGACCGT TTTGGAAGCC GTGGCTGTGG GTCGTTTGTT GTCATGGCGT GATCGTATTT GCACCGATCC AGATGCGTTA CACTTTCTCA TGAATTTGCC CATCGAGACA AACGTGCAAC GGTGGCTGGA TTTATCTCGA AAGGTTATTC ATAAACAAGG CATACCGTTG CCACCCATCA AGGCCACGAC ACCGGTCGCG CCAGCTACAT CAATACCCGC GTACGCCGTG CCGCACTCCG CGCCAACCGG GAGCGTCAAT AGTAATTGGA GCCAGCCCAA TGCTTCTCTG ATGTCGACCC CCCAACGAAC GTTTCCGTCA GAGGCTGGCA ACGAATCGGG AGTCTTTTCC GTGGGTCGCT TTTCTTTATC AGCGGTGAAG GAAAAGTTTG AACAGGCGAA ACACACGACG CAGTCGTTGA GCAAACGCTT GTACGACGAA TGGGAGCAGC AACAACATCA CCGCGCCACG GATGCTTTCG AACGCCCTTA CTCCGACGCC TTTCCGGACA GTGAGCACGA CACTCCAACA GCGATCAATG ACCCACTGAC TCAAGTTCCC TACCGCAACG ACGACACGGC GGTTCCCGCC AATGTGTACA ACGGTCAGTC GACGCCGCAA CGGCAATCAC AAGCTTCCCC ACCTACTTTG GAATCGCAGT GGGCTAGCCG TGTGCAATCC GACGTACACG TGTTGCAGAA CTATTGCATG ACCATGGAAC GGAGTCAAGC GCACGTCCCG GGCACCGTGT GGGAAGCCTT GGCCGACTTG GAAATGCTGC GACAGGATCT GTCACGTCGA GCAGCCGGGA CCAGGCGAAC GTGAGCGCGA GCGAAAACCT CACGGTCTCC TGTATGGGTT GGTCGGCGGA GCCGGGCCGG CGATAAGGCC GCCAGTGACG CAGTCATTGC TATTTTGGGA TGGTCTACAA CAGAGGGCGG ACTGGAAAAA GTGCGTGCCT TGAAGAGGGG GTCGTACCGG AGAGCCATGG TGTGGTGCTT GGAGGGTGTG TTTCGTTGGC AGCGACTATA CCGGTGCCGA GCCGGAAAGG GAATGGTAAG AGTGATTGTG GTAGTAGTCG AGAAAACGTA ATGATTGCGA TGGAGAGCTG TTTGCCACCT TCTTTGTGAA CGCCAAGGGA TTGCTGACAG TGAGATGCTG ATTGTACCGT ACAGGCGCTT CCCGAACTAA CTCCAGGGCG AAGCCTACTA CGTATGTTCC ACTGGAAACG AGACCAAAAT TATGGAACGA TGTTCTGGTC GAGCATTGAT CAATGAGTTC CCTTGTTGTG ATTAGGGGAA GTTTGGAGGA GAGGAAACAA AATACGATTA TTGTGGACGC GCCGTCACCC TCGAGAATTG GACGGATGAT CCGTTCGTGG ATCACGAAGT GTGCGTCTCG GTCCAACAAT CGATGTGCCA CAGATACAAA TGTACCAACG AACAACGTTC CCGTGGGGAA GAAGAAATCG ATGGAGCGCC CTGTTGCCTA CGGTTTCCGT ACCAGCTACT GTAGTATGAA TGCAATTCCG TAAAGTAGAA TGCTGCTCAC AAGTCC
|
Protein sequence | MMHASSSSSI APVESRAARW RQQLRIRPQP HTNVVRSSFA QLVAQFPYPS YDDNDGDPTA NKASPNATAS KVSELAHGGV VPDLDPLTAL VRETSEQQQR LESLELKYRK EKALRNRAGK GGASDQGRTL VESEDYDENA VTLQMIDKDL ARLPPPKGSG QNGSQNLAGV VVSKDEDTAG IPTSSGTSDE RIKTLRRVLY IYACAHAEAI GYRQGMHEIA SYILFALELD QQAEESLVAV ATSQEQIASD AYELLETILT SIECVYDATP LPGQHEKPLE ASARRVLQGV QTYDAALALR LSQLGVPPQL YLTKWMRLMY SREVTDVLSL WDELFAYVGE GSTLVTVLEA VAVGRLLSWR DRICTDPDAL HFLMNLPIET NVQRWLDLSR KVIHKQGIPL PPIKATTPVA PATSIPAYAV PHSAPTGSVN SNWSQPNASL MSTPQRTFPS EAGNESGVFS VGRFSLSAVK EKFEQAKHTT QSLSKRLYDE WEQQQHHRAT DAFERPYSDA FPDSEHDTPT AINDPLTQVP YRNDDTAVPA NVYNGQSTPQ RQSQASPPTL ESQWASRVQS DVHVLQNYCM TMERSQAHVP GTVWEALADL EMLRQDLSRR AAGTRRT
|
| |