Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_54381 |
Symbol | H1 |
ID | 7200154 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | + |
Start bp | 762091 |
End bp | 763262 |
Gene Length | 1172 bp |
Protein Length | 229 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | histone linker H1 |
Protein accession | XP_002179285 |
Protein GI | 219116981 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGTCG GGAAGCTGTT GTTACGTTGT CTGACTGTGA GTATGACAAG CCCGGTCTGC GCAGTTCCAG AAACAGTGTC AATTATGCTG TCGTCTGATT TCTGTCCTAC GACGACGCCG CCCCGGCGTG CGTAGTGCTC ACATTCAAAA GGAATCCATT TTGTGGAACC AGTACTGTAC TTTTCTGTAT GTAGCATTTA CGAAGCGTTT TGTACTAGAC TGAAATCCGG CGCACCCTGT GTTTCGAACT GGGCCCGGAC GCTGGACCCC GACAGAGCGC ACACATCTCT CTCTCAACAG GTACGAGACG CTTTCCTCAC CATGACGGCG ATTCCGCACC GATTCATCGG CGTTCCCTTT CTCTCTGTTA GTGATCTAGG TTATCCTTCC TCTCCGCATG TCTTTCTAAC GATCGTCTTT CTATTATCTG CCAATTCATA GCCGAGAATC GTATAATCGT AGTTTCCATC ATGTCGTACA AAGCCGGTAT CGAAGAAGCC ATTACGGACC TTAAGGACCG CACGGGTTCG AGTATGATTG CGATTCGCAA GTACATGCAA TCCAAACTTC CCGCCGATAA GAAGTGGCAG AACGCTGTCT TTCTGTCCAG TCTCAAGAGC GGAGTCGCCG CTGGTGACTT TGTTCAGGTC AAAAACTCGT ACAAGATCTC GGCCGACTAC AAGAAAAAGA AGGCTGCCGC GGTCAAGAAA GCTGCTGCTC CCAAGAAGGT CGCCCCGAAG AAGAAGGCGC CTACCGCGAA AAAGAGTACG GCCGCGAAGA AGAAGACCAC GGCACCCAAA AAGACCACCG CGCCGAAGAA AAAGGCTCCG ACCGCCAAGA AGGCCACTAC GGCACCCAAG AAGACTGCCA CAAAGAAGGC CACCGCGCCG AAGAAAAAGG CAGCCACCGC CAAGAAGCCG GCGGCGCCCA AGGCGACAAA GCCGAAGGCT GCTCCCAAAA AGAAGGCAGC CTCTAAGAAG GACGCTGCTC CCAAGCCAGC TGAAACCAAA TAAATATTTG GCTTGACTGA GAGCTTGCAC TCTAGCTGTG CCTGACGCAT CCAGTGTCAG TATAGCAACC AAAACCGGAT TTTCTAAAAT CCACCCAACC CACAAAGGCA GTCAATTGTA TCCTCTTATT GTACAATCAA GCAGTGAACT TCTTTTTGCG TA
|
Protein sequence | MTVGKLLLRC LTTEIRRTLC FELGPDAGPR QSAHISLSTA ENRIIVVSIM SYKAGIEEAI TDLKDRTGSS MIAIRKYMQS KLPADKKWQN AVFLSSLKSG VAAGDFVQVK NSYKISADYK KKKAAAVKKA AAPKKVAPKK KAPTAKKSTA AKKKTTAPKK TTAPKKKAPT AKKATTAPKK TATKKATAPK KKAATAKKPA APKATKPKAA PKKKAASKKD AAPKPAETK
|
| |