Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_22353 |
Symbol | |
ID | 7203736 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | + |
Start bp | 88247 |
End bp | 92631 |
Gene Length | 4385 bp |
Protein Length | 464 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182769 |
Protein GI | 219124982 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACTTGTTTC ATTTTCCGCT ATTAGGTTTT TAAGAGGCAC ATATTGAGCA GCCTTGCGTA ATCTTTTTCT TGCGCTACTT GGCACGGCAA CGATTGCCGA CACAGGTGGA AGCATTACTA CATTGGCAAT CCTTGGAGCA AAAATCGTCA TCGGCAGCAA AGGACAAGCC AGATGGTGTG TAGGAGACAT CGTCAAATAC TACTTCCACA ACGTCATTTG ATGCCAGACC TGCAGGCGCA ATTGGTCCGA GAGAGTTCCA CAAGTTGAGC CGAATTTCCA CATTACCACT TGGCATACAC TGGATGTAAT CCGGCGTGCC AGCATCCAAG GCAAGTTGTG TTGTCAAAGA ATAGCTCTGT CCTCCACCGG CCGTCGAACT CCAATCAATC CGATTTGGAT TCCAGGTGAA GTCGAACTTG TGACCACCTT GATCGTAGCT CCCCGACGAC CCCGAAAAAA AGCGGTACAT TTGTGGGGAC CCCGGAGGCT GGACAAGAAA TTGAGTGTCT GCGTTGGATT CTTCGTTCCA CCGGGAGATC TCCACATCCA CTTCGTGATT GTAATTTTCG TGAGTTGCGT AGTCTTCGGT GTCATCCCAA GTAAACATAC CCAAAACCAG CTCTTTCGGC AAAATATCAG ACAAGACGAC CCCCGATGCA TCCTTGACGG CAATTGACTT GACGGAAAAT TGGTACTTTC CATAGCCGTA GACTTCACTA TTGGGAAGAA GTATGCGGAC TTCGCTTGCT TGGCCGTTTT TGAACTGAAG CGAAAGATCG TCGCCATTCA CGACGATATT CTCTTCCACA AAACGATTGT TACCTGGTCC ACACAAGCCG TTTGAACGAT CTGTTTTGAC CTGTACAGTC AAATTATTCC ACATTGTAAA GGAAACGCGG TCGTTCTCAT CCGGAAAGCA ATAAAGGACG TGCGGCAAGT TGCTGGTAGG TGCGCCCGCG GGCGGTTCTG CCGCTGTGTT GCATTGGGAG GGATCACAGT CGCGGCATTC GCGCGGAAAT TCAAATCCAA CACGGCGGCA CGCATCCGCT TGCGTTGGGT AGACTCTGGC CATTGCAGTT GAGAGCCATT GGATGCGGGC ACCACACGTG TACTGCCCCG CGAAAGTGTT CCACGCTGTT TCTGAGCAAG ATTGGCAACC ACAAGCGATC GTCTGATCCG GATCATCAGC GGCCTCCTTG GAAGGGATCG AGGTGAGAAG ACCTGTAGGT ACTTCGGTGG GAAGACTCGC TATTGGACTT GTGGGGGGTG TGGTAGGACT TTCTGTCGGA AACAGAGTTG GCCGTTTGGT TGGGATCGGA GTGGGCCATT CTGTAGAAAC CGGAGTAGGC CATTCGGTAG GAACAGGAGT GGGGCGCTGG GTAGGAACGC TAGTGGGAAG CTTTGTTGGA CGAGGTGTTG GGCTGCGGGT GGGGGCAGGT TGCTGCATGG GAATCGGGAC GGAGGGATCG CAAGCGCCAC ACTCAATCGG AAACTCAAAA ACCACTCGTC GACACGCATC CACTTCAGTG GGATATGCAT TCGCCCGAGC TGTCTTCAGC CAGGTGATTC GTTCTCCGCA GGTATGCGCC CCTGCCTTGG TGTTCCAAAT CGTGTCTGTA CATGTCGACG TCGCACAACG AGCGTCGGGC TGTTCCGGAT CGCCAGGAGT TATGTTGTCG AGGCTGCAAA CCGAGGATAC GTCACACCTT CCACATTCTG TAGGAAACTC AAAGGCTACG CGCCGACACG CTTCGACTAG ATTTGGATAC TGATTGGCGC GAGCCGTCGC CAGCCAATGC ATTCGATCTC CACAAGAGTG TGCTCCGGCT TTGTTCTTCC AAACCGAAGT GCGGCAACCT AGGCATTCAC AATCGATGGA TTCGGCTACA TGCCTATGCA CAAGCGAAGG TGACAGTAAA GAATTACCGG TAACTTTTAG GCCAGGATCT TTCACGCCGT TGGACCAGGA CACTACCACG ACTGCAAATC CGACCAATAA CGCGGTCGTC GAAAGGAGTA ATGGTAGCCT TGCTTGCCCC GTGCGAGACT CGGATGTGAA AGGAACTTTC ATCTTGCGTG CTTGTTCTAA ACAATACACC TCTTCAACAT TCCCCATTGT CACCACAAGA TGCGACTGAC TTATTTAGAT TCACAACAGG CTGAAAAGTA TTGACTGATT GTGAGAGTGA ATCGGCTGGT CGTTTTGACT ATGACCTCGT TCACCGATGT GTTTCGCTTT TCGGAAAATC AAGAAAGCTA TGCCCCGCCC AATCCGTAGT CTTGTCAAAT ATGTTACCCG TTTAATTTTG TACTGGCTCT ACGTGTTCCT CCCACAATTG CCCTTAAACT ACAAAATATA GAGAGAAAGA AAAAGGTTGC GTTTCAAACG TCCAGAAGAG AATACGGTTT TTAGCTTACA GTTCGAATGG TCGGACATTG GAAGCACGGT GTGGGATGCG TTGAGACCTT ATTCGTGCAG TCCCGTATGT GTTGACGAAA CTCGGAGTAA TGCCAGTGGT TTCCGCAGGC AGGTACATGT AATTTGTTAA GTGTAACTGT AAATGAGACT CTAATCAAGC GTGCATGAGC TGGGGAGAGG TAAGCTCACT TTGTGGTACG GTTTGGGACG CCGTCGCTAC GAGAACTTTA ATACCATTGT TTGGTTCAGT GTTCATAGTA ATCACAATTC TGACCCTAAT GAAGCTTAAG TGAGACATCT CAAGGAAGAC TCTCTTCTAC AGAAGGTCCA ACGAGAGGTC AATCGATGTC TTCAAACGTG CCAACTACGA GTAATCGACA GTGAGCTCTG ACTATGGGGA AAACAGTCCT GGATATTGTC AAACACGAAC GGGCAAGATC TGTTTTCACT GTCAAATTCT TTTTCCTGAC GAACACAATG GACTGCGAGG AGAAAGATAA GCATGGAACA TTCCGCGGTT GGCTCAGGGA TTCTTCCAAG AAGATTCTAT CACCTCAATT GCTCCTTCCT GTACAGTAAC GTGCGCTTCC GAACGTGATA GTTATGTCGG TCACAGTTCC AGATTCAACT GTCAGTCATG ACAATTCTAC TATGAACAAT ACCGACGCTA ATCGAGACTA CGAAAAGGCT GAAGAAGAAC ACGAGTACGA TGCTGTCACG GCCCTCTTAT TGAACGTGAC GATCATCGGA TGCCTCTTGC TCGCGTATTA CGTCAAAAAG TTTCGAATTT ATACATTGCC CGAGTCCGCT GGCGCTTTGC TCGTCGGTAT TGCGGTGGGA GGCATCGCGC GTTTGTCCAC CGACAATCTA CAACTGTGGG AATTCAGTCC CGAAGTGTTT TTCTTTGTAC TGCTACCGCC GATTATTTTT GAGGCGGGCT ACTCGCTCAA GCGCAAAGAC TTTTTTGACA ACATTGGCGC CATAACTTTG TATGCTATGT TTGGAACAAT CATAAGCACC TTCGTTGTAG GAGGGCTTTC CTTCTATGCT GCCCGCCTTG GTCTCATACA AAATGTGGAT CACGAGAATC CGATGGAAAG CCTGCTTTTT GGTAGTCTCA TTTCCGCCGT TGATCCAGTC GCCACGTTAT CCATATTGGG TAGCGACGAG ATTGGTTGTT CACCTCAATT ATACTATCTG TGTTTTGGAG AAGCTGTCTT GAACGACGCC GTCGCCATTT CCTTATTCCA CGTCTTTGCT CGTTACTACA AGCAGGACGG CCCCGAATGG AGCGAGGCCG AAATCCCTTC AGCGCTGCTC TACTTTATGA CAACATCCAT TCTTTCCATT GGAGTTGGGG TGGGGCTTGG ACTACTGGCC AGCTTTTTGT TCAAGCACAC GGAGCTCTCG AGGTATCCGA ATCTTGAAAC ATCCTTGCTC TTTAGTTTCT GCTATCTCTG TTACGCAACC GGCGAGGCGG TAGGCCTATC CGGAATCATG GCCTTGTTTT TCCAAGGTGT GGTACTCTCA CACTATAATG CCTACAATCT CTCTCCGATG GCACACGTGG CGTCCGAACA AATTTTTGCA ACTCTAGCGA CCATTAGTGA GACGGCCGTA TTTTTGTACA TGGGCATGGG AGTTTTCACG GGCCGCTTTG AGAATTACGA TTTTCGCTTT GCTGTACTGG CCTTGATATT CTGCTGGCTA GGGAGAATGT TGAATATTTT CCCGTTGTCA TGGTTGGCGA ATCGCTGTCG AAATCGGTCC AACAAGATCC CGATCGAAAT GCAGTGTGTG TTGTGGTTTG CCGGTCTGCG AGGGGCAATC GCGTTCGCGC TGGCCATGAA CATGCCGGGG CCTAACAAAG ATGCGTATGC CTCTGCTACA TTATCGATAT GTATCTTTAC CACTGTTGTC TGCGGTGGTC TAACGGAGGG TATGCTCACT GCGTTTGATA TGAAA
|
Protein sequence | MDSATCLCTS EGDSKELPVT FRPGSFTPLD QDTTTTANPT NNAVYDAVTA LLLNVTIIGC LLLAYYVKKF RIYTLPESAG ALLVGIAVGG IARLSTDNLQ LWEFSPEVFF FVLLPPIIFE AGYSLKRKDF FDNIGAITLY AMFGTIISTF VVGGLSFYAA RLGLIQNVDH ENPMESLLFG SLISAVDPVA TLSILGSDEI GCSPQLYYLC FGEAVLNDAV AISLFHVFAR YYKQDGPEWS EAEIPSALLY FMTTSILSIG VGVGLGLLAS FLFKHTELSR YPNLETSLLF SFCYLCYATG EAVGLSGIMA LFFQGVVLSH YNAYNLSPMA HVASEQIFAT LATISETAVF LYMGMGVFTG RFENYDFRFA VLALIFCWLG RMLNIFPLSW LANRCRNRSN KIPIEMQCVL WFAGLRGAIA FALAMNMPGP NKDAYASATL SICIFTTVVC GGLTEGMLTA FDMK
|
| |