Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49046 |
Symbol | |
ID | 7195297 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | + |
Start bp | 424921 |
End bp | 426461 |
Gene Length | 1541 bp |
Protein Length | 454 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183606 |
Protein GI | 219126736 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.208062 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACGACCGAAC GTGTGTGAAC GGAATGTAAC TGCTAGTACA TGCCTCCACT CTTTCTCGCC AGGTGTTGTA GCTAGGAGTA GAAGATAGCA AATGACAGCT TTGCATGGCT GGAAAACTAG CTGGCGGGGG GTAATACTTC ACGTAGCTCT GCTGTTGGAA AGCAACGCCT GGTTGGCAAC GTACTCGACC CGTTGTCAAA ACCGAGCCAT TGCCTGCTTC AACCGCGACT GTAAGTCTCA ATCGCGTATT TGCACTCAAT CTGTTGGCTC CTGGAGTATT TTAGGGGAAA AGTCAACCTC CATAGACCAA GTTTCTTGGG TAACGAACGA GCACGTCTCT AGCAGCGGAA CTGCTCCTGG TACTCCTCTC GAATGGTACA ATGATCTTAT TGATCGCCGG TTGGAAGAGG CTGATGAATT TCCGATTGAG CCGCAAGAGT CGCTCGAAGA TGGAGAAAAT GTTCCTTTAA TTGGATTTCT TGATCTCTTA CACATGGCCT TTGCGGCGGC GACATCGGCC ACAATGCGGG CTAAACGCCG AGATTTTGTT AGCACGGTAC AGAACGCGGG TGGATACTGC ATTGTTAAAT TGGAGGATCC GGAACTGTCA GTTGTCGAAG GAATGTGGGA TGGCATTGAC GAAATTTTTG CGAGGCCACG GCACAAAGAT ACGGCAGTGA CAGGAGCTAC GCAACTCGAA TTACGCCATC AAACACTTAC TCGCGAGGAC ACGACTGAAT TGCACCAAAA CAGTGGGTAC AAATTCGTTC AAATTTCCCT GATAGACAAC AGTATTCCCT ACTTGGCGGA CAGCGTTGGA AAGCAATCCG CCGAGCAAGC TGGACGGGTA TATCAGCTTT TTTCGCTGCT GGCCAAGGCA TTTGCGTCGG TATCCTACGC TGGATCATCC ACGGAATCAG AGCACGTTGC AAATAAGGGC GATCCCAAGC AGGCGTCCAA TTTGCTTACA AAAATGCTAG ACGACCCAGG CAAGCCTTTC AGCGGGACTT TCCATCGTTT GGCTAAGTAC GTACCGGTCC TGGAAGAGGA AGAATGGAGC GAATCCCTCC GATCCCATTG CGATTGGACG CTCGCGACTC CCATTCCAGT GTCGGCGACG GCTGGACTGG AAATTTTCAA TCCGACTAGT CAAACTTGGA TTCGACCCGA GCAAACAGCT AAATCTCTAT GGGAGCACGA AAACGGAGAG CATAGACCCA CAGACGATAA ACGATGGCAC AGTCGCTACG TGATTGTAAT GACCGGGAAG TGGCTTGAGT TGATCAGCAA GGGAGAGATC TCGTCTTGTA TTCACAGAGT AGTGTCGGTA CGCGGAGAAA ATTCTCGTTT GAGTGCTCCT TTTTTCATGA GACCAAGGCC ACAAGTTTTT TCGGATGCCG AGGCCGTTCA ATTCAAAAAT AGGACATCTG GTAATATCGA GTCGATGCAA GCCATTGGTG AGTATTTGTT GCAAAAGTAC GGCACTGACA GTGAGTATAT TGAAAGATGA AAGCAAGTAT TCAAACGAAG GTTTTTCTAC G
|
Protein sequence | MTALHGWKTS WRGVILHVAL LLESNAWLAT YSTRCQNRAI ACFNRDWEKS TSIDQVSWVT NEHVSSSGTA PGTPLEWYND LIDRRLEEAD EFPIEPQESL EDGENVPLIG FLDLLHMAFA AATSATMRAK RRDFVSTVQN AGGYCIVKLE DPELSVVEGM WDGIDEIFAR PRHKDTAVTG ATQLELRHQT LTREDTTELH QNSGYKFVQI SLIDNSIPYL ADSVGKQSAE QAGRVYQLFS LLAKAFASVS YAGSSTESEH VANKGDPKQA SNLLTKMLDD PGKPFSGTFH RLAKYVPVLE EEEWSESLRS HCDWTLATPI PVSATAGLEI FNPTSQTWIR PEQTAKSLWE HENGEHRPTD DKRWHSRYVI VMTGKWLELI SKGEISSCIH RVVSVRGENS RLSAPFFMRP RPQVFSDAEA VQFKNRTSGN IESMQAIGEY LLQKYGTDSE YIER
|
| |