Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45034 |
Symbol | |
ID | 7199539 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | + |
Start bp | 1068429 |
End bp | 1070197 |
Gene Length | 1769 bp |
Protein Length | 510 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178906 |
Protein GI | 219116222 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000346108 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAACCG GCAACCCCGC TTCTGTGAGA TTCCATTCGC CAGCCCTGCG GGATCCAACG CCTAGTAGTT TCCACACGAA CTGTAAGTAC CGGTCCATAC CATACTGTTC TGTGTCTAAC GAACCCGGAC TTAAATACGA CCGTTGGCAG TCCGTACAGC CGGCGATGCG ATGAGCACCA GCAAGGGTGC GGTAAATGTC CGTGACGCTG GGGGACGATT ACGCCAAAGA CTTTCTTACT GTTAACAGGA AACAACCAAC GCCTCGAAGG CTTTTGGTTG CTTCCAACAA TCGATCGAAG AACAAAGTTA CCCTTCCACA CTCTCTCTCA CGAGCACTCT CTCTCGTCCT CAATCAAAGG GCTCCCATAG TATGGTGCGA ACGAGACCTC TTGTATCGTT GTCGTTGTCG TTGTTACCGA CGCTAGTAAG GCTCGGAACG ACGAGCGTGA CCGCCGACGC TTCCGCAGAC TGCACCGTGC AGCTCGACGG CACGTGTGTG TCTTCGCTGC CACGACTCGA ATGCGGTGTG TACATGGCGC CGAGTACTTT GGGTGAAGAC ACGAACCTGG GAATCTACAC GGGCAAGGCA CTCCGGACGG ACGACGTTGT CAACTTTCCA GAAATCGCCA TTCCGCTGCT CTTCCGCGAA TGGGGCGAGC ACGTCGAAGG GATTGCCGAC GGTGCACTCT GGGATCGCTA TATTTGGGAA GGCGAAGTCG CCGATTTGGA AACCTACACC GATCGCAACC GCATCGACAA CCGCGCCGTC TTTGTGCCGG GCGTTGGCTG CACCGTCAAC TCGGTACTCG ACATGAACAA CATTGAGAGC ACGCACGGAT CCACCTACGA TACCGCTGGC TTGCACCGAT CGCGGGATCC CGGTACCGGT GCTTTTTCGC CCTACCACAG CGCGCTTACT ACCGCCGTCA CCGATATTGC CCCCGGCGCC GAGCTGTTCG CCTCCTACGG GGACTACTGG GTACCGTCCA TTCCGGGAGT GCAAGTCACA CTCGATGAAG TCCTCGATGT GACCGAAGAC TTCTTGCAGA ACGATTTGTA CGAATTTGTG CAGTCGCACC GTGACGCCGA TGCGCTCACT CCCGACGTCA AGGAAGCGCT CTTTGCTTTT GTCAAGGACT TTCCCCTGCC CAATCAGCCT TTTTCTAATC TGCCCCGCAA CGTTCCCTGG GCCGACGTCG AACGGGCTAT CCACGAAGCC GGCACACACC GCAAACACAC TGATACGACA ACTGACAATG TTAGCGGCGA GTCCTCCGTT GCACGACAGT TCATTCGGGA ACAGTCGATC CGTGATCTAG CCTGGTTAGA CGAACACGGC TACTGTCAGG ATCACCTGCG ACCAGGTCGC TCCACACTAC CCCAAGCCGG ACGCGGGGCT TTTGCTACCC GTAATCTCCC GGCCGGAAGC GTAGTCGGCT ACGCCCCGCT CATTCATATT GGACTACACG GGCGTGAAGT TTTACGTATC ACTTATCCTC CCAGCGAACA CGATCATCAC GTTGGGAACG ATGACAACGA CGGTGAGAAC GGCACGACAC CGCGCACCAG TTACGATCTT GTCCTCAACT ACAGCTTTGC CCATCGCAAT TCAACCGTCA TACTTACACC CTACGGCGGA ATGGTAAACT ATATCAATCA CGGTAGTACA ACTAGCGGGC GTGCCAACGT ACAGGTGCGG TGGCCGGACA AATCCTTGAT CGCTCACGTC CCATCCTGGC TGGAACAAGA CCCCATATTC TTGTCGGAA
|
Protein sequence | MRTGNPASVR FHSPALRDPT PSSFHTNFRT AGDAMSTSKG AVNETTNASK AFGCFQQSIE EQSYPSTLSL TSTLSRPQSK GSHSMVRTRP LVSLSLSLLP TLVRLGTTSV TADASADCTV QLDGTCVSSL PRLECGVYMA PSTLGEDTNL GIYTGKALRT DDVVNFPEIA IPLLFREWGE HVEGIADGAL WDRYIWEGEV ADLETYTDRN RIDNRAVFVP GVGCTVNSVL DMNNIESTHG STYDTAGLHR SRDPGTGAFS PYHSALTTAV TDIAPGAELF ASYGDYWVPS IPGVQVTLDE VLDVTEDFLQ NDLYEFVQSH RDADALTPDV KEALFAFVKD FPLPNQPFSN LPRNVPWADV ERAIHEAGTH RKHTDTTTDN VSGESSVARQ FIREQSIRDL AWLDEHGYCQ DHLRPGRSTL PQAGRGAFAT RNLPAGSVVG YAPLIHIGLH GREVLRITYP PSEHDHHVGN DDNDGENGTT PRTSYDLVLN YSFAHRNSTV ILTPYGGMYN
|
| |