Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45381 |
Symbol | |
ID | 7200004 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | + |
Start bp | 1009981 |
End bp | 1011395 |
Gene Length | 1415 bp |
Protein Length | 451 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179335 |
Protein GI | 219117081 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.22287 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GAAACTCACT GTCCATGCCG CCAGCCTTGC TGGATCGACA CATGGCAATG AATCTCTTCG GAAAAAGATA TTCCTTGGTT TGGTCACTGT CGATGGCAGC CGTTGATACG AAAGCTTACG TGTCCGAGAA CGTATGGACA CACCATCATC GATGTGCTCG GCTCTCTTCC AAGAAGGGCA GTCGAAACAT TATGAACGAG GCAGCCTCAC ACAAAATCGG ACACAGAAAT TCTGTTCCCG CCTACTCATT GGCAGGTGCG TTTGTCACCA GATGGAAGCG AGGATGCAAT ACTCATGCAG ACTGTTGTAG TCTAGAACTC ATTCCCATTA CCAAGAATCG ACAGACATCG TATCTCGATC CAGATTATAC GCATTCAGCG GGATCCCAAG CCTGGGCGGC TTTGCACGAA CGTTTTACAG ATAAGCAGGC CCGTGATGCG CTTTTTCAGC CCGAAATTGT TCTACAGTGT CGCGAGGTTG GGCACAGGAG CCTCCGTGTA TCACTTTCCA ACAAAACCGT TGACTGTGAA ATGGATATTG CTTTGGTTGG TGTTTTGGCG CGTGTACTTG TCCAATGGAC ATTGGAACCA GACGGTGGCA AACCTACTGG CCAATCATGG ACCATTACTC TTACTATGGA AGAATCCAGT CTGACTTTGG TAAACGGAGT GGACGACGAA AATATGAAGG AATTGTTCGC TAAGTACCTG GACCTATCCA GCTCAAGTGT CGAAATTGTA GAAATGATGG ACCGGGACGG CAAGATACTT GGAAAAGTCC CACGAAATCT TGTGCATGAA TATAATTTGC TGCATCGAGG CGTCGGCGTT TTCATTACAC GTGACGTGCC CCTCCAGCTC CCGATCCACG GTGCGAGAAA CTCCAGTCCA AGTCCAACTC ACCAACCCGA CTTGTATTGC CATCGCCGAA CCTCTACCAA ACGGATTTTT CCTGATCTCT ACGATATGTT TGTCGGTGGC GTTGCCCTGG CTGGAGAAGA TTCCCGCAGG ACTGCGTTGC GCGAAGTCGG CGAGGAATTG GGGCTCGCGC AAGGGAACAT TGGCGATGAG GCCATCTTAA CGTGCGTCGT TTGTACGGGA TACAACCGAT GCGTCGTGGA TCTATATTGC TATGTAATGA ACACAATGGC CGAAAGGGTA TCGTGGCAAG CGGAAGAAGT GGCGTGGGGG GATTTTGTTC CTTTCAACGC TGTTCAAGCG TCAGTCGATT TATCGATTCA ACGATTAGTA TCCGACGGAT CTTGGCCCGG ACGCTATCCA CCAATCCAAT CGAGCTATAA TGGTGTCTTT CCCAAAGACG AATTTTCATC GATCAGAAAT TGGGACTCCT GGGACTACGT TCCCGACGGC TTGCTCGTTT GGGAAGCGTG GTTGCGCTAC CTGAAAGAGG ATTGA
|
Protein sequence | MPPALLDRHM AMNLFGKRYS LVWSLSMAAV DTKAYVSENV WTHHHRCARL SSKKGSRNIM NEAASHKIGH RNSVPAYSLA DCCSLELIPI TKNRQTSYLD PDYTHSAGSQ AWAALHERFT DKQARDALFQ PEIVLQCREV GHRSLRVSLS NKTVDCEMDI ALVGVLARVL VQWTLEPDGG KPTGQSWTIT LTMEESSLTL VNGVDDENMK ELFAKYLDLS SSSVEIVEMM DRDGKILGKV PRNLVHEYNL LHRGVGVFIT RDVPLQLPIH GARNSSPSPT HQPDLYCHRR TSTKRIFPDL YDMFVGGVAL AGEDSRRTAL REVGEELGLA QGNIGDEAIL TCVVCTGYNR CVVDLYCYVM NTMAERVSWQ AEEVAWGDFV PFNAVQASVD LSIQRLVSDG SWPGRYPPIQ SSYNGVFPKD EFSSIRNWDS WDYVPDGLLV WEAWLRYLKE D
|
| |