Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50353 |
Symbol | |
ID | 7199136 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011697 |
Strand | + |
Start bp | 54423 |
End bp | 56393 |
Gene Length | 1971 bp |
Protein Length | 553 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185272 |
Protein GI | 219130229 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAGTGCCGAC GCAAAGTATT AGTCAATAGG TAGTCAACAC TTCTCAGTTC GCTTTCGTCA GCGACGTGGG GTTCGCTTTT GTAGCCGATC TCATAGCTAC GCCGATGAGA GTAGCATAAA CCACCTTATT GCTTGCTACC TCCTGAGCGC AGCCAATATG CTGCTGGGTT CAAGCACATA AAATTCTTTG ATCTCCGTCC CCCTTTCGAA ACCAAAGCAA AATGAAACAA TTTGGAATAT TCTCGCTTGC TTGCGCATTG GCTTTCTCTT CGTTGGCACC GGCATATGCG TATTCCGTTG CCCCTTCAGT ACCAAGCATT ATTCAAGGCG GAATGGGAAT CCGGATTTCC CAATGGAAGC TTGCGCGTGA GGTAGCCCTG AAAGGTGAGC TAGGTGTGGT TTCAGGCACT GCGATGGATA ATGTCATGGT GCGGGAGCTT CAGAAAGGTG ATGCGGAAGG ATCCTTTTTT CGCGCACTCA AGACTTTTCC TGATCAAGAT ATGGCAAACC GGATTGTCGC GCGATTTTAT ATAGAGGGCG GAAAAGACCC TTCGGAGCCT TACAAATCAA TTCCAATGTG GACGCTCACC CCTAACCAAC TACTCTTGGA AGCCAACGTT CTTGCAAACT ACTGTGAAGT TTGGTTGGCC AAGCACAACG ACGATGGATC AATTATTGAT GGGCTCGTCG GAATGAATCT ACTGACAAAG GTCCAACTTC CGACGATCGC ATCCCTCTAC GGTGCTATGA TGGCTGGCGT CGACTACATC ATAATGGGAG CTGGAATACC GATCTCGGTT CCTGGATTTC TCGACAATCT ATCCGAATGC AAAGACTGTG AGCAGAAGAT TGATGTGGAC GGAGTTGCAG AGAAAGAAGC CCCGGTTTAC AAGTTTTCAC CGATTGCGTT CTGGGAAGCA GCGGGGAAGC CAGAATTGGC TGCTCCGTTA AAGCGCCCAT CGTTCCTCCC TATAGTTTCT TCTACAATAC TCGCTCAGTC CCTTCTGAAA AAAGCCTCCG GAAAAGGGCC GACGAGAGGA ATTCAAGGAT TCGTGGTGGA GCTCAGTTCC GCAGGAGGCC ACAACGCCCC TCCCCGTGGT TTTAAATTTG ATCCTGTCTT CAGTACACAC GCAGGCGGTT TAAATGAGCG TGGCGAACCT GTATACGGCC CCAAGGATGA AGTCGACCTG GCTAAGTTTT GCAAGGCTTG CCAGGGTTTG CCGTTTTGGC TAGCCGGTTC TTACGCTCGA CCTGAACGAT TTGCCGAGGT CCGAGCATTG GGAGGTGCAG GTGTCCAGTG TGGTACAATT TTTGCGCTTG CTGAAGAATC TGGCCTCGAC GATTGGATCA AACAGGACAT TCTTCGCAAA CTGTCAGAAA CCCGTTTGGA TGTGCTGACA GATCCCGCTG CCTCTCCCAC AGGATTTCCG TTCAAAGTAC TCGATTTACC CCAGAGTCTT TCCCAAAGAG AAGTCTACGA AGCTCGTCCA CGTGCATGCA ACCTGGGCTA CTTGCGACAA CCTTACAAAC GGCCTGACGG CAAAATTGGT TATCGTTGCC CTGCGGAGCC GGAGGTAGCA TTTGCTAGAA AAGGTGGGGA TGCCAAGGCC ACTGTCGGTC GTAAATGCCT CTGCAACGCC CTCTGTTCGA ATGCAGGGTT TCCGCAAGTT GGAGAGGTCA AAGCCGTCAA CGGAGAAAAA ATGAAGTACG TTGAGCTACC CCTGATCACG ACTGGAGACG ATATTAGTAG TTGCCGAGAC TTCATCAAGG AAGATGCTGA TGGTCATTTA GGCTTTCCTG CTGGCGAGAT TGTGGATTAT CTGCTCTCTG AATGGAAAAG GAAGCCGGTC GGATCCGCAG CCGAGGGATC GATGTCAATT TAAAATTGGA CAGCAGGAAT CTTAAATGTT TTATTTGACT AGCATCAGAA TGTGACACTG ACAGTAAATT AAATTTTGAA CAAAGCGGTT G
|
Protein sequence | MKQFGIFSLA CALAFSSLAP AYAYSVAPSV PSIIQGGMGI RISQWKLARE VALKGELGVV SGTAMDNVMV RELQKGDAEG SFFRALKTFP DQDMANRIVA RFYIEGGKDP SEPYKSIPMW TLTPNQLLLE ANVLANYCEV WLAKHNDDGS IIDGLVGMNL LTKVQLPTIA SLYGAMMAGV DYIIMGAGIP ISVPGFLDNL SECKDCEQKI DVDGVAEKEA PVYKFSPIAF WEAAGKPELA APLKRPSFLP IVSSTILAQS LLKKASGKGP TRGIQGFVVE LSSAGGHNAP PRGFKFDPVF STHAGGLNER GEPVYGPKDE VDLAKFCKAC QGLPFWLAGS YARPERFAEV RALGGAGVQC GTIFALAEES GLDDWIKQDI LRKLSETRLD VLTDPAASPT GFPFKVLDLP QSLSQREVYE ARPRACNLGY LRQPYKRPDG KIGYRCPAEP EVAFARKGGD AKATVGRKCL CNALCSNAGF PQVGEVKAVN GEKMKYVELP LITTGDDISS CRDFIKEDAD GHLGFPAGEI VDYLLSEWKR KPVGSAAEGS MSI
|
| |