Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_2648 |
Symbol | CDH1 |
ID | 7200617 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | - |
Start bp | 507721 |
End bp | 508698 |
Gene Length | 978 bp |
Protein Length | 326 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179858 |
Protein GI | 219118155 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAGCGTAAGC GACGCATTTC CAAAGTTCCT TTCAAGGTCC TCGATGCTCC AGCTCTCAAG GATGATTACT ACCTCAATCT CGTCGACTGG TCTTCGCAAA ATGTTTTGGC GGTTGCTCTC GGGTCATGCG TATACCTGTG GAGTGCTTGC AATTCAAAAG TGACCAAATT GTGCGATCTG TCATTGTCCA ATTCTTCTTC CTCAGCATCG GAAGATTCTG TTACGTCGGT TTCTTGGGCA CAGCGAGGTA CGCATTTGGC CGTTGGGACT AATCGCGGAG ACGTAGAGCT GTGGGATACA ACAAAGGGCA AGCGCATACG CTCCATGCCC GGTCATACGG CACGCGTCGG AACTTTGGCC TGGCACGGTC CGACGTTAGC GAGCGGCAGT CGCGATCGTC TTATTTTCTT GCGTGACGTG CGCGTACAAT CGGCCTACAC AGACCAGTTG GATTTTCACA AACAAGAAGT CTGCGGTCTT AAATGGTCCT TTGACGATCC AGGTTTGCTT GCATCGGGCG GCAACGACAA CGACTTGCAC GTGATCGATA GCCGTAATCC ATCGTCTCCC GTCCACAAGT TTTCCGAACA CCGTGCTGCG GTCAAGGCTA TTGCCTGGTC GCCGCATCAG CACGGGCTTT TGGCTAGCGG AGGAGGGACA TCCGATCGCT GCATTCGCTT TTGGAATACG CAAAGTGGTG TCGCTCTCCA CAAAATCGAT ACCGGTAGCC AAGTTTGCAA CATTGCGTGG TCTCGTAATT GCAACGAGAT TGTGAGCACG CATGGATATT CGCTGAACCA GATCATCGTT TGGAGGTATC CTAGTATGAG CAAGGTCGCA ACGTTGACGG GACATTCTTA TAGAGTTTTG TATTTGGCTA TGTCACCCGA CGGATCTACG GTAGTCACTG GCGCAGGTGA TGAGACTCTA CGCTTCTGGC AAATCTTCCC TGGTCCGCAA TCTGACAACA AGGACAAT
|
Protein sequence | KRKRRISKVP FKVLDAPALK DDYYLNLVDW SSQNVLAVAL GSCVYLWSAC NSKVTKLCDL SLSNSSSSAS EDSVTSVSWA QRGTHLAVGT NRGDVELWDT TKGKRIRSMP GHTARVGTLA WHGPTLASGS RDRLIFLRDV RVQSAYTDQL DFHKQEVCGL KWSFDDPGLL ASGGNDNDLH VIDSRNPSSP VHKFSEHRAA VKAIAWSPHQ HGLLASGGGT SDRCIRFWNT QSGVALHKID TGSQVCNIAW SRNCNEIVST HGYSLNQIIV WRYPSMSKVA TLTGHSYRVL YLAMSPDGST VVTGAGDETL RFWQIFPGPQ SDNKDN
|
| |