Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44688 |
Symbol | |
ID | 7197894 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 1266336 |
End bp | 1268236 |
Gene Length | 1901 bp |
Protein Length | 539 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178398 |
Protein GI | 219115205 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACCCC CGATCCCCAC AACAACAACT CGTGGTCGTC GTGAATCCGC ACCGGATTGG GTCTTGACGA ACTACGACGT TGACACTTCC GTTCCCCCGG ACGTCGAGAC GGAACTCCGG CGATTGCGCG TCCTCCAATC CTACGATATT CTCGATCGCG AGTGGGATAC AGCCTACGAT CGTTTGGCAC AAATCGCGGC ACGAGCCTTG CAAACCCCCA CGGCCTTGGT CGGTGTGGTC GATCTCGGTC GGTACTGGCG TGTGGCCCTG CACAACTGCA ACAGCAACAT TCACCATAGT AACAATCATC ATCACCGTGA TGCTCCACGG GAACTGCCAC GAAAGCAGGC CATAGCGTCA CACGTGATAC TCCAACGTCA GGGATACTTG GTAGTCATGA ATGTCGGGAA GGATTCGCGT TTTGTCGATC ATCCCGCCGT CCGCAACGGC ACTTTCCAAT TCTACGCCGG AGTCGTACTC CGCTCGCCGG AAGGGTACCC TCTCGGTGTG CTCGCAGTTA CCGACACGCA ACCCCGATTG CACGGCGTCT CCCACGAACA ATTGCAAACC CTGCGAGATT TGGCGGACGC CCTTGTCGAT TTGATGCACA CGCGGCGGCG ACCGAAAGGA GATACGGGGC CTGCCGCCAT CACCCGTCCT ACGGCTCCTG CTTCGACTAC TACTACGACT CCTCCGTCCA ATCCACCATC CCCCACCAAC CACACCAACA CCAAGGATAC CAACAACAGC TCATCCTGGG GAGTGCAACG GTCCGCACGC TTTTTGCGAC AGTACTTGGC CAAACTCAAT GACGACTCGA CGCTCGAAAA CGTCCTCACC AAGGACCAAC GCGAACTACT CCGCTCATCC TACGATGCCG CCGCCTTTCT CCACGGCTCC CTCTTACCTC CCGAACAACG ACAAGAGCTA GTCCGAGAAA ATCGTCCCGA TCAAGTCACC AGCGTACAGA TCGCTAGTCT GGTACAGAGT GTCGAACTCG CCATGGATGC CTTTCCCAAG ACCGTGCCCG TCCGCTACCA AATCGACGAC GAACAAATTC CACCCTTCGT CCTTCTCGTC GAACTGAAAA TATTTCGATC CTGCATCGCC CTCTTGACGA GTGCCTGTGA ACGCACCCGC CAGGGCGTCG TCTGCTTGCG GGTCTTTGTC CAACAACGCA CCGTCACGCA GAAAGAACTC GTCTTTGAGT GCGAAGATAC CGGACCCGAT GTGGAACTGG AACAGTACGA CGATTTATTC GACGCTCCCC TGGACCATAC CGCCGACGTT GGTGAAGAAG ACTGTATCCG GGCAGATCCC CATACGGGAA AAATTCGCAA GGCACTCCGG TGCGCCACCG TCCCCAACAG TCGCAGGGGA CATGGCGTAC ACGCTCTGGC CGACTTTATT GGTTCCATCG AGGGCGGTGA CTATGGATTC AGGCCCCGAG AAACCGAAGA CTTTGAACCA CATGGCACTG GAACGGGGTC CGTCTTTTGG TTCAGTATCG CCTTGCACAC CCCACCAGCG ACACGACACG GTACGGACGC AGTTGTGCCG CGACGACCCC AGCCTTGGAT CATCTCCAGT GCACGGGGGC ATGGATCCTT GGCAAAATAG TAGATTTTCC TATAGTTGCT ACCTAGGTAT TTAACATGTA AGTATTCCGG CTGGTCTTGT CGTCTGCAGT GGACTTATTC GTCATTTCAT ACCGTCACAG TCGCGTGCTC GGTAACAAAG CAGCTTGGTG TTTCAGAGCC CGTAGCAATG TTGCGTGCCG ATGCTTTTCT CCAATGTGGG CTTCGATGCG TTGACGAAGC TGGCGAAGTG ACAACGGGCG AGACCCGATC GCTGCTCTAC TTCCCTCTTC CAGATTGAGC AGAGTCCCAA AGAAGTCGGT C
|
Protein sequence | MAPPIPTTTT RGRRESAPDW VLTNYDVDTS VPPDVETELR RLRVLQSYDI LDREWDTAYD RLAQIAARAL QTPTALVGVV DLGRYWRVAL HNCNSNIHHS NNHHHRDAPR ELPRKQAIAS HVILQRQGYL VVMNVGKDSR FVDHPAVRNG TFQFYAGVVL RSPEGYPLGV LAVTDTQPRL HGVSHEQLQT LRDLADALVD LMHTRRRPKG DTGPAAITRP TAPASTTTTT PPSNPPSPTN HTNTKDTNNS SSWGVQRSAR FLRQYLAKLN DDSTLENVLT KDQRELLRSS YDAAAFLHGS LLPPEQRQEL VRENRPDQVT SVQIASLVQS VELAMDAFPK TVPVRYQIDD EQIPPFVLLV ELKIFRSCIA LLTSACERTR QGVVCLRVFV QQRTVTQKEL VFECEDTGPD VELEQYDDLF DAPLDHTADV GEEDCIRADP HTGKIRKALR CATVPNSRRG HGVHALADFI GSIEGGDYGF RPRETEDFEP HGTGTGSVFW FSIALHTPPA TRHGTDAVVP RRPQPWIISS ARGHGSLAK
|
| |