Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_17779 |
Symbol | NAP |
ID | 7196847 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 1901024 |
End bp | 1902364 |
Gene Length | 1341 bp |
Protein Length | 318 aa |
Translation table | |
GC content | 45% |
IMG OID | |
Product | nucleosome assembly protein |
Protein accession | XP_002176865 |
Protein GI | 219110227 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.184472 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAGGAACGTG CCCCCCTGTA CTACATCGCG ACTAATTCGT GGCTAACACG ATGGTAACGA ACGCGTTCAA AGGAATTGGT AAATCCCTCT TTGTCCTTTA CAGTTAGTCA CCACACAGAG ATTTATATGA GCTTTTAAAA TGACGGACAT TAACAACAAC GAGCAAGATC CTCTGCAAAA TATCGGTGTT GGGGACGAAG ACTCAGACGC AGAGGACGAT GATACCGCTG AAGACAACCC GATGGCCGAT CTCCCCGATT ACGTTGCACA TCGCGTGGAA AAACTGCGTG GTCTGAATGA GAAGCGAGAA GAGATCATGA AAGATTACCT CACGGAGCGA GCCGACTTGG AACGCAAATA CGCGGTTATT TTAAACCCGC TCTATGAAGA ACGTGCGACG ATTGTGAACG GTGAGAAGGA TGATGAAATA AGCGCCGAAG TCACTCGCCG CGGAGATAGC TCTAGCGCAC ACCATAATGA TGCGGAACCG TACGTAAAAG GCATTCCACA ATTTTGGCTG AGTACTATGA GCCAAGAAGA GACGATCAGT GAGAGTCTGA CAGAAGAAGA CGTTGACTGT TTGGAACACC TCGAAAACAT CACATGTGAA GACTTTGCTG ATGGAAAAGG ATTCGTTCTT CGTTTTCATT TTGCTCCTAA CGACTATTTT CATGATGCTG TATTGGTGAA GACATACGAT GTTCCGAATC TTTTGCTTTC CGATGAACCC ATCTTGAAGA ACGTCCACGG ATGCAAGATT CAGTGGAAAG AAGGGAAATC TCTGACACAT CGCCAGATCA AGAAGAAACA GCGCGGAAAG GGCAAGAATG CTGGTCAGGT ACGCACCATC TCCAAAATGG AGAAGAAGGA ATCGTTCTTC CATTGGTTCG AGCCGCCAGC AATGCCAAAG ATGGATGAAG TTGACGAGGA ACAGGCGGAC GAGCTAGAAG AATTTTTCGA TTCAGATTAC GAAATAGCGC AGGCGTTTCG GTCACATGTT ATTCCTTCAG CCGTTCTTTG GTTCACCGGA GAAGTAAGTT CTACGAACAC GAACTTTTCG TCCAGATTGC TGCACCTCTA ATATTTTTTT TCCCTCTAGA TTATGGCTCA GGAAATGATA CACGCAATCG AAGATCTTAG AGAATCAGAG GAAACAGATT GATGAGGATT GAATGTGTAA TTGGAATTGA CGGATCGGCA AGATTACAAT GATTAATCGA CAAAGACTGC ATAGTTTTTG ATCGAAGGAA GAAAAATATA TATGCAACGC TTGCTACTCA TATGTTAGAT ACTGGTATCT CACTGTGAAT TCGATAGATT CCTGTCTTGG TGGGCCCAGT C
|
Protein sequence | MTDINNNEQD PLQNIGVGDE DSDAEDDDTA EDNPMADLPD YVAHRVEKLR GLNEKREEIM KDYLTERADL ERKYAVILNP LYEERATIVN GEKDDEISAE VTRRGDSSSA HHNDAEPYVK GIPQFWLSTM SQEETISESL TEEDVDCLEH LENITCEDFA DGKGFVLRFH FAPNDYFHDA VLVKTYDVPN LLLSDEPILK NVHGCKIQWK EGKSLTHRQI KKKQRGKGKN AGQVRTISKM EKKESFFHWF EPPAMPKMDE VDEEQADELE EFFDSDYEIA QAFRSHVIPS AVLWFTGEIM AQEMIHAIED LRESEETD
|
| |