Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46461 |
Symbol | |
ID | 7201558 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | - |
Start bp | 407373 |
End bp | 409153 |
Gene Length | 1781 bp |
Protein Length | 426 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181002 |
Protein GI | 219120531 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.5833 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGAGTTCACC GATTTTTACA GCCCGTTCCA TAGCAAAGCA GTTGCGGTTT TTTAGTAGAG ACGCAGTGCC TTACTGTGTC TGTACCTTTA TTGATAGGTA GCCGGTGGTC TAGTAGTGTT CGGCAGTAGA GACTCACTCG AGCACTGAAC GCCAGGGTTT TTTCGAGGAC CGACATACAC ACGCATACCA GTCTGACTGC ACTACCGTTT GCGTTTGGAA CTCTTTACCG CCCGACGGTC ATGTCCTCCA ACGAGTCCAC GAAACTGCTC TGGACGGCTG TAGGTGCGGC AATCGCCGGA GCTGGTGTGG CCGTCGCCGT CCTGAAGTAC TCCCCAGCCC GATATTCTGC TCCGAAGGAC GAGGAACCCT CCTTTCCTCC CCTGCATCAG AAACGCACTT CGTTCATTTA CGAAAACAAC GCGTCCAGCA GCAACAGCGT GATCTTCCCG CACAATCACG AAGAACGGAT GCGTCGACAG ATTACCGCCC GTGCCGCCGT CGAAGAAGAC AACTTCATGC CGCGTGACTC GGTCACGGTA AGGGTACCTG CGACTTCGGC CAATATGGGA CCGGGATGTA AGTACTATTT CTTGGTGTGT AGGAATGTGA ATGTGTGTGT ATAGTGTATC GTGTTGCCGC AGTGAAACTG ACTGCAGCAT TCTCATTCCG CCAGACGATA CGATTGGCAT GGCCGTGGAT TTGTGGTCTG AAGTCACGGT GGAGCGTGCC GACACGTTCG AAATTACGGC CGAAGGCGAA GGAGCCACCG AAATGCCCAA GGACGAAACG AATTACATGG TGATTGGTTG TAAGGCTGCG TTTGAAGCCG CTAACAAACC CTTGCCAATT CTCAAATATC ACGTCTTTAG CCGCGTCCCC TTTGCACGTG GACTCGGTTC GTCTAGTGCC GCCATTGTTG CCGGGATTAT TGCCGGACTC ATTCTGGCCG GTCACCGATT GCCGTGCTGG GGGTCGGAGG CCTTGCTACA AATTGCGGCA GGGATCGAGG TACGTGCACG TTTGTCGGCG GCACCCGATA CTACAAGGAG AATCTTTTCG TTTTTTGCGG CTGGTATCTC ATCGTCTCTC TTATGTTGCT TTTCTGTTAG GGACATCCCG ATAACGTTGC TCCCGTCATC TACGGTGGCA TTCAGGTGGG CATCCACAAT GGTACTCGGT GGGTGACGGA GCGAGTTCCC TGTCCTGCCG GCTTACAGCT CGTCATGTTC ATTCCCGACT TTATCGGTAA AACCTCCGAC GCCCGTAGCG TATTGCAGCC AACCATCAGC CGAGAAGACG CCGCCTACAA CATATCCCGG GTTGCCTTTC TAGTCCACGC CTTGTGTGTA GGCAATTTGG ACAATTTGAA GTGGGGTGTC GAAGATCGAC TCCACCAGCC CCAGCGCGGC GGCAAACTGT ACAAGTACCT GTATCCCATG ATTGAAGCCG CGGAAAATGC CGGGGCTGCT TGTGCGTACT TGAGTGGGGC CGGTCCCACC GTCATGGCTA TTACCGCCGG AGCCAGCGGC GACATTTTTG CCCAACGTGA AAAGGAACGG TCAGATCTGG CCGTCGCCAA GGCCATGCGA CAAGCGGCCA AAGAATTCGG TGTACAGGGT CAAGTGGTTG TCACCAACGT CTCGAACGAA GGGGCGCGCG TGGTCAAGGT TGTCCCGCCG TTTAGTACGG GAGGCATTAC GTACCGTGAC AACATTTAAT GTAGGGATTT CCGATACGAC CCAGCACTAC AAATCGATAA TTCTAGAAAA ACGCACAAAA C
|
Protein sequence | MSSNESTKLL WTAVGAAIAG AGVAVAVLKY SPARYSAPKD EEPSFPPLHQ KRTSFIYENN ASSSNSVIFP HNHEERMRRQ ITARAAVEED NFMPRDSVTV RVPATSANMG PGYDTIGMAV DLWSEVTVER ADTFEITAEG EGATEMPKDE TNYMVIGCKA AFEAANKPLP ILKYHVFSRV PFARGLGSSS AAIVAGIIAG LILAGHRLPC WGSEALLQIA AGIEGHPDNV APVIYGGIQV GIHNGTRWVT ERVPCPAGLQ LVMFIPDFIG KTSDARSVLQ PTISREDAAY NISRVAFLVH ALCVGNLDNL KWGVEDRLHQ PQRGGKLYKY LYPMIEAAEN AGAACAYLSG AGPTVMAITA GASGDIFAQR EKERSDLAVA KAMRQAAKEF GVQGQVVVTN VSNEGARVVK VVPPFSTGGI TYRDNI
|
| |