Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42638 |
Symbol | |
ID | 7196306 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 665743 |
End bp | 670244 |
Gene Length | 4502 bp |
Protein Length | 494 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176626 |
Protein GI | 219109745 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACCGACGTG TGCCGAGCTA GCCCTTCGTC GCGCACCACT CGACTCGCGA GACATCTTGG AACTATAGCT TGTGCGTCAT CTCTCTAGCG ATAGTGTGTG TGTGTGGGAG AGAGAGTGCC TGTTCCGGCG TCCAGGGGGG GGATCCACGG TCATCTAACT GGAAAGAACC ATTGTCGCAA TGCCGGAGAT GCCGTCGCCA CCACCGCACG TGTCCGTGTT GACTGCGGGA GGTGCGGGTC AACGTCGGAG TACGGGGTCC CGCGACCTCG CCACTACCCT GTCGTCTCCC CCCACAGTCC GGACGTTGAC GAAATCCATC CCGGCAGTAG CCAACGACGT GACCAGCGCC CACGCGCACT CACACCCACC TCCCACACAA GTCCAGAATC CCACCACACG CGATCCCCCA CCACCGGAAT CGCTCATTCT CGAAATGGAA GAGACGGAGG ACGACGCGGT CGCCTCCACC GACGTCCACG ACAATCCCTA CGCCAAAGCC CAGTCTGTCT CGCCGTTTCG AGATATTCCG CTCTCGGCCT CCACCGCGGC ACGCGCCGAC GAATCGTCCT TCCACGCCGG TCACCTGCTC GAACATTCCA AATCACACGA ACTCTTTTGG TTGACCGTAT GTTTCCTCGG TATCATGGCC AGCTTCGTTT GCTACGGATT GCTGTTGGAG TACACCACGT CGGGAGACCG AGAATTGCAC GAACTCAGCT TTCTCTTTGT CACCTCCGGC CTCTACACCC TCACGGCAGC CGCCGGACGA TACGTACGGG ACGAAACACC CTCCACCATT CCGCCAGCAC GCTTCGCCAT ACTCGGACTC ACCTCCATGG GATCCACCTT TTGCTCCGTA CGATCCCTCC GGTACGTGGC ACGGGAACAA TATTCACTAT TTGTGGCTTG CGCGCACACA CACTTCCACT CACACACACA ACCTTTCTAC TTTTGGTACG CAGATACGTC ATTTATCCCA TCCAAGTGCT CGCCAAGAGC TGCAAACCGG TCCCCGTCAT GATTATGGGA GCGTTCATGG GCAAACATTG TACGTGACCG CCAGAGCGAA CGAACCGCTC ACAAAAAGTC TACTGCCGAG TCGCCGGATG CTCATTCACA ATACCCCCAT TTATTTTTAA CAATTTGCTT GTCACTGACA CACAGATCCC CTCCGCAAGT ACATTAACGT GGTCATGATT GTCGCCGGCG TTGCCCTCTT TATGGGAGGT GGCGACGGCG ACAACAAGAA GAAGAGTGCC AACCAGAGCG AGGACGAAGG CTCGACCGCC CAACTTATTG GTATTCTCCT ACTGTTCGTC AGTCTCTGTT TCGACGGTGG TACCGGGGCC TACGAAGACA AACTCATGAG CGTCCATTCC GTACAGCCCT TCGATCTCAT GTACAACATA CAACTCGGCA AGACCATACT CGCCGGCGTG GCACTTCTCG TACTTAATCA ACTCCACATA TTCCTACAAA TGGTCCAAGA CATGGGCTTT CTCCTCGTGG CACTCGGTCT CAGCGGCGCG CTCGGACAAG TCTTTATCTT TGTCACGATT GCCAAATTCG GAGCCCTAAC CTGCAGTATC ATTGGACTCG CCCGCAAGGT TACCACCCTC GTGGCGAGTA TATACTTTTA CGGACACGTC CTCAACGGCG TCCAATTCCT CGGATTGTGC ATCTCCGTTA CCGCCATGGT ACTCAACTTT TGGGGTAAAA AGGGTGGGGG CGGGCATCAC GCCGGTGGGC CGTCCGGAGC CGGCGCACCC TTGCACGCGG ACACCGAAAC GGAAAAGCTG GTCCGGGAAG ACTCGGGCGA CGCGCACGTG GAACTGACAG TGGCACCCCA GACTTCCAAA GAGTTGGTGT AGCAAAAAAT CAGTGGCACG TTGTTTCCAC GCTCCGTCGA ATCACCGCAC AACATACAGC ACTGTCGACA ATAATAATAC ATATATTTCT TTCCTTGTTT TGCGCATAGT ATCCTCTGTT ACGGTGATCA GAACGAATGC TGAATGCTTG ATTTCATTTT AGAAGGTTAC TCTGCCGTAC TAATTTGTTC GGAGGGCGTG CGTCGAGTAA CAAGAAACGG CTGGCTGCGC GTACCAAAGT TCCTTGGGAA GATGTATTTT GTGTAGCTCC TGTCGGTCCC CGCCTGCTGT CGCAACTGCG GGTGTTGATA AGGCGCGCAC AAGTTCCGTA TGCCTGTTGT TGGTTTAGTG GTGTGTAGAA ATAGCTTGGT CATACCGGAA CGATTCGATC GAAACTTAAA GTGGTCGTAT TCCGCCACAG CAAGTTCTTG TCTATCCTTA ATCTTCTTCA TCCAGACCCG AGCAATGCAG GATTCTGGCG AAGGAGTTCA AATGTAAGGG AAAGTGGTAA TTCATCCTGC ACATGAGAGT TTCGCTTGCA TTCCGTTGCA ACTGCCGTCG CTTGGAGGAA GGGGTACAGC TTGGAAACAC CATCGCGCCG TTGAAGTGAT TCAGGGTTCA AATGTACAAG TCGCTGCAAA AGTTCAAGCA TGCCTTGTAG TAAAGACTCT TTGGTGGATC GGGCTGGTCG GGAACACGCG CGAGCAAACG TGTCAACGAC GTGATGGATA ACCAGCCGAC CACCGTTGTC GGCGACTCGA GCAGTTTCTG GACGTGACAA ATCCAAGGCT GTCCGTAGGA CGCCCAACGA CTCCCGCAGC AGAATTCTAG CTGCGTGCGG CTCCCCTACG TTGTCGTCGC GAGGCCAGTC CCACGCATGC CAAACGCGGC TCGCGGCATA GTGGACCGGC AATCGACCAC GATTGTCAAC AACATCCAGT TCTTCCGGAA ACATCGATGC AACAATATAG ACAATGGCAG CGAGGCAACT TGGAGTGGCA AACGCTATGT GCACCAGAGT CGATTCGCCA ATCGGAGCAT TTCTGCGTCC CATTGTGGCG GCTTGCAACA TGGAAACAAC CTTAGTCCAG AACAAACTAA CAACAACTTC CCGGCGATTC CAAATCGTTG TTGCTGGCGG TTCCTCCCCA TAATTTACCA TCCCCTGCCG AGAGTCGTAT CTTGCACGAA TAATGCGCAA TACTTCGACA GACCGTTTTG CCCACTGCAA AGACTCCAAA TCCCCCAAGA CTTCTAGTGG TATGTGTCTC ATCCGCAAAA AGTCCACCGG AGGGTCCATC CTCTTGATCA GTTGCAGATC CGAATCAAAA TCGCCCTGCT CCAACGCAGT AAAGGCATCG TAGCGCGAAA GCATTTGTCC GTTGTATTCG TTAGACGGCA AGGCAATTTC TGTAGTAGTA TCTGCACCCC GAACATCGAT GACGAGCAAG GAGCTGACAA ACCGCACCCA TAGCCAGTGC AGAGGTGTCA AGCCGGACGC ATCCCGCTTG AGCACGGCTT CCGGTGCGGC TTTGGCAATG AGTGCGACGG AGCGCGGTCG TTCGCCCCGC ATGGCGCAAA AGTGGAGGGG ATTCTCTCCC TGTGTATTGC TTAGGAAGGC AGCCTTGGTT GCCGGTGACG TCGCTTGGGC ACGCTCCGTG TCGAGCAATA ACCCAATGGT GTTGGTGTGT CGTCCGTGAA AGACAGCGGA ATGCAAGGCC GTATACTGCC CTCGGTAACC GGTGAAAACA CAACTAGCCG CTTGAGGACA ATAGCGAAGC ATACAGGCAA CCATCTCGTA AATTTGACGC TCCACTAAAA AATCATCTTC GGCACCGAGT GAAGGAGCAT AGATGTTGGC CTTGATAGCA TACACGATAG GGGGCTCTTC GTATTCGCCC CGGTCCGGTA TGATAACGGC CTCGGGACAA GATCGAACGA GCATATCGAG GATGGGTAAA AGATCACCTG AGTCGGACTG TAAGATATTT GATTGAAATC GTCGTCGGAT CAAGAGGTGG AGAGGCGTAA ATCCGTCAAC GTCTTGTGAC AAGGCTACTC TGTTACTTGG CGTCGCTGTT GCGGTATCAC TACCTTGTTC CTCTTCTAGG GCATCGTCAG CCCTCAGCAA GGCGGCGATC TCGTTAGCCC CCATGCGTTC ACAGACAATG GCTTCATGTA ACGGAGATCC CCGTGAGTCC AACACGTGAC GCAACTTGGA GGGTGCCGCG TGCAAAATCG CCTCCAAGCA TTTCAAGGGC GCTCCCAGAC GACACGCCAA GGCCAAGCAT GAGGGGTCTA CGTCCGCACG AGCCGCTTCC TCGGGGTAAA GCTGTACACG CTCCGTTGCT TCATCCCATA ATGAAGCTTC CAATAACGCG CTCAGGTGAA TGGGCTTATC TATCGTTGCT TTCATTGGGC AACGCCTTGC AGGAGCTGTG CCGCTGCTAG TGCTTTCCAC TCGAGGACGT TTCGACGATC GAGATGTTGT AGGGAAGGAT GGGATCTCCA TCGCCCCTTT GGATCCTGAA CTATCCACTT GGTATCAAAG GCGTGGATTG TACAACAAAC ACCCCGTGGA TGGATGGATT GACTCGCGCT TCAATCCACT GTTGTGGTAG GGCAACGCAG AG
|
Protein sequence | MPEMPSPPPH VSVLTAGGAG QRRSTGSRDL ATTLSSPPTV RTLTKSIPAV ANDVTSAHAH SHPPPTQVQN PTTRDPPPPE SLILEMEETE DDAVASTDVH DNPYAKAQSV SPFRDIPLSA STAARADESS FHAGHLLEHS KSHELFWLTV CFLGIMASFV CYGLLLEYTT SGDRELHELS FLFVTSGLYT LTAAAGRYVR DETPSTIPPA RFAILGLTSM GSTFCSVRSL RYVIYPIQVL AKSCKPVPVM IMGAFMGKHY PLRKYINVVM IVAGVALFMG GGDGDNKKKS ANQSEDEGST AQLIGILLLF VSLCFDGGTG AYEDKLMSVH SVQPFDLMYN IQLGKTILAG VALLVLNQLH IFLQMVQDMG FLLVALGLSG ALGQVFIFVT IAKFGALTCS IIGLARKVTT LVASIYFYGH VLNGVQFLGL CISVTAMVLN FWGKKGGGGH HAGGPSGAGA PLHADTETEK LVREDSGDAH VELTVAPQTS KELV
|
| |