Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44942 |
Symbol | |
ID | 7199841 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | + |
Start bp | 707669 |
End bp | 709485 |
Gene Length | 1817 bp |
Protein Length | 556 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178831 |
Protein GI | 219116072 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGACATTTTC GTTAATTCCC AAAAAAGCAG CATTGACGAT ACTATACTAG CGTTGTCTAT TGTCTCTGGA AATTTTATTC CAACACCAGA ATATCACCCT CCTCAAGAAC GGAGTCATAC ACGTCACGAC AGTGCGACTA GGAAATCATG GCTACGATCA ACGCATCTAC CAAAATCAGC GTTCCGAAAA AGAAGCCCTC CATCGCGGTG GGCGCGGCCA AATCGGTCAT CGGGGCCAAG AAACCCAGCG GCACCGCGGC GCGCCGCGGC TCCGTGGCCC CGCCACTGAA ACCCGCGACA CTCTCGAACG CGTCCAATTT GGCGCGTTTG CAGGAGTCGG ACTCGGCGGG ACAAATTGCT CACGTGCGAG ACTTGTTCCG ACAAGCGCTG GGGGAACACA CCACGTTGAA TTTGGGGTCC CCCAAAGCCG CCAGTACTAC TCGCGATCGG GCGGGAGCTG CCGCCGATGT TGCCGTTGCT GCTAAGACTC TGGGCGTGCG CTGGATACTC AAAAAGTGCG GCATTGTTGA TGAAATGCAG CGTATGCTGT TTCCCGGCGG AATTGAAGAG TTCTTGGCGC GCAATGCGGA CAACGAAAGT GAAATGAACG GGAATGGTAG CACCCCCATG ACGGGAGGTC TCAAAGCCAG TGCTTCGGCG GTAAGTCTCG CGAGTATGGA CGAAGTGACG ACCGTCACCT CGGCTAATTC GCTAGGGACG GATACGAAAC GTGGAAAAAC GACACCGGCC AACGCCCGTG AAGGGTGTTT ATTGGTGATT CGAGCCTTGT GCCAAATTGT CGGTAAGGCG GCGGAATCGT TCGTGGTAGG GGCCTTTTTG GCCGCCGCTT TGGATGAATG CGCGAGTTCG TCCGGTGCCA TTCGGGAAGC GGCCGAGGAC GCGTCGACGG CGATTGTAGC CTTGGCCAAT CCATGGGCCT TTCGCACCGT CCTGTGTCCG TTGCTGCTGC AATCGCTTAA GTCAACCGAA TGGCGGACCA AGGCGTGCAC GTTGGAACGA TTGGAGCAGT GTGCCTCGAC CGCATCCGCA CAAGTGTACA AAATGATTCC TACCCTGATT CCTGCCGTGG GGAACCAAGT GTGGGATACC AAGGCTCAAG TTTCGAAAGG TTCCCGCGCG GCACTGTTGG CCATTTGTAA CACAAACAAC AACAGGGACA TCAAAAAGAC CATTCCTGCA ATTGTTAACG CCATGTGCAA GCCTTCTGAA ACCAACAAGG CCGTGTCGGA GCTCATGGGC ACGACCTTTG TTGTCCCCGT GGACGCTTCC ACGTTGGCCA TGTTGTGTCC GATTCTAGCC CGAGCATTGA AGGAAAAGCT CGCCATACAC AAGCGTGCCG CTTGCATTGT CATTTCCAAC ATGAGCAAGC TGGTGGAAAC GCCCGATGCG GTGGCTCCCT TCGGCTCCTT GCTCGTGCCG GAATTGCAAA AAGTGTCGCA CAATGTTCAG TTTGAAGAAA TTCGGGACGA AGCACTCAAA GCGTTGGCCA ATCTGACCAA GGCTTTGGGA GACGCATACA AACTGACCGA TGAAGATGAC CAAGCGGCGG AAATGGCCAA CGAAAAGGCC GAAGTAGAGG CGGAACAGAA ACGTATCGAA GACGTGCGGG AAGCAGAACG ACTGAAAGAA GAAGCCGTAC AGAAAAGGGA GGAAGAGGAG CGCAAAAAGT TCAAGGAAGC CATGGATGCA CAGCGGGAGC TGACTCGCTT GGAGGCGGAA GAAGCGGAAC GTCAACGCTC GGAAGAAGAA ACCAAGCGCG AAGCCGCAAG ATTGAGTACG AAGGGCGGTA CTGGGAA
|
Protein sequence | MATINASTKI SVPKKKPSIA VGAAKSVIGA KKPSGTAARR GSVAPPLKPA TLSNASNLAR LQESDSAGQI AHVRDLFRQA LGEHTTLNLG SPKAASTTRD RAGAAADVAV AAKTLGVRWI LKKCGIVDEM QRMLFPGGIE EFLARNADNE SEMNGNGSTP MTGGLKASAS AVSLASMDEV TTVTSANSLG TDTKRGKTTP ANAREGCLLV IRALCQIVGK AAESFVVGAF LAAALDECAS SSGAIREAAE DASTAIVALA NPWAFRTVLC PLLLQSLKST EWRTKACTLE RLEQCASTAS AQVYKMIPTL IPAVGNQVWD TKAQVSKGSR AALLAICNTN NNRDIKKTIP AIVNAMCKPS ETNKAVSELM GTTFVVPVDA STLAMLCPIL ARALKEKLAI HKRAACIVIS NMSKLVETPD AVAPFGSLLV PELQKVSHNV QFEEIRDEAL KALANLTKAL GDAYKLTDED DQAAEMANEK AEVEAEQKRI EDVREAERLK EEAVQKREEE ERKKFKEAMD AQRELTRLEA EEAERQRSEE ETKREAARLS TKGGTG
|
| |