Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47039 |
Symbol | |
ID | 7202122 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 250613 |
End bp | 252596 |
Gene Length | 1984 bp |
Protein Length | 413 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181337 |
Protein GI | 219121987 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.915264 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAACCACTGA CTGACTGTGA ACTCGCAGCA ACAGGCACGG CCCGCAGAAT CTGTGGGAAC CTCACGATCG TATCAGTCAC AAACAAACGC GCACCACCGA TAAAAACGTT CATCGCCTTG CAGTAGACGC TCATTCTTAT CTTGAAACAA ACCACTTTGC GAAATCAAGA AACGACATTC AGAACCATCA TCGAGACACA CTGTTCATGG TTGATACCTG TTTCGAAAGC TTTCTTAAGA ATTCGCGTAC CGCCCGACTC CACGGATTTC ATTCGCAGGC GCCTATAACA GTCTTGATAT CGAAAGACTG CCATTCAGAC TCTCAAATAT CGTCTCAAGA TTCTTCCTTG CCGACACAGA TATCGTGTCG GAGACGTGCA AGATTCCATC TTCAACCAGA AAACAACGCG CTATTTGGCA TTTGGCCTTT GTCGATACCA GACATATAAC AAGAGTTTAG GTATAGCTTC ATTTCACGAC TTTAACTACA ACCATGGTGG TCCGTGGAGG AACTATAATT AACATTTCCG GTGCTTCGCC GGTAGATGAT CCCGAGTATC GCTACAAGAT GCCCGCGGTT TTTGGAAAGA TTGAAGGGTC TGGGAATGGT ATCAAGACAG TGATTCCCAA CATCACTGAA GTGGCTCTTT CTTTGCACCG GCAGCCCGGC GAGGTCAACA AGTTTTTCGG AACGGAACTT GGCGCGCAGA CGCGTTACAG TGCCGAAACG GATCGCGCTG TTGTAAATGG CGCGCATATG GACGCCGTTC TTCAGGATTT GATGCATCGT TACATTGAAC GGTTTGTGCT ATGCCCCAAC TGCAACCTGC CTGAGACGGA CTACAAAATT AAGAATGATG CTATTTGGCA CAAGTGCGCA GCTTGTGGAG CTAAAGAAAT GGTGGACATG AGCCACAAGC TCTGCAACTA CATTCTTGCG GAGGACAAGA AAGCCAAGAA AGACAGCAAG AAAAGTAAAA AGGGTGATAA GGTACGTGCA GTGGAACAGG AAATAGCACC GTTGAAAATG TCCGTTGTGT TGTTTAACCC TTTTCTCTTA AAATTTACAG GATGACAAGA AGAAGAAAGA CAAAAAGAAG GACAAAGACA GCGATGACGA GAAAAAGAAG AAAAAGGACA AAAAGAAGAG CAAGGATAAG AAAAGCAAAG ACAAGAAAGA GAAGAACGGC GACGACGAAG ACGAAAAGGA TTACATCAAG GAGGCTCTCG AGGGTGGGAA AACGGAAAAT AATGGTCTTT TGAACAGCGA CGACGAAGAC AGCGTGTCTC TTGCCTCTGA AGCTGGCGTC GATGATCAGG GTGCCTTGCT GCTTGCTGTC GAAGCTACAA AAAAGTATAT TGCGGAGAAT TCCGATGTTA GTGATAAAGA GCTTTCTGAG GTCGTGACTA ATCAACAAAT GGCTTCAGCC CTCAAGTCGC ACGACAAGGT CCATATTATC GTGCGTGCGG CGCTCTCCGC TCAATTCTTC AAAAACAAGG AAATCGAGAA GTATTCTTCG GCCATCTATA GCATCACGAG TGGCAACAAG ATCATGGAAC GTCATTTGAT TGCGTCGCTC GAGGCCCTGT GCATCGATAA GCCCAAGAAC TTTCCCGTCA TGATCAAACA GTTTTATGAC GAGGATGCCC TTGCTGAGGA AACAATTCTG GAATGGGCCG ACGAAGGTCG CTCAGAGTTT ACCCTACCAG AAGTGGACGA GGATGTTCGA GCTACACTTC GTGGAGAAGC TGAGCCTGTG GTTGTCTGGT TGCAGGAAGC CGATAGCGAA GACGATTCCA GTGACGAGGA TTAGGTCTTG CGCTCAGCTT TACGAAGTTG ACGAGTAGTC ATACTTTATT CTTACCGAAT TTTATCGATG TTGACAGGCA AGACAAGTAG ACCTCATTAC CCAGTGAATT GCTGCAACGT ATCGCCAGTG AATAATCTAC GAAGACTGAT AAAAATCTAA GAAATTTACT ATAT
|
Protein sequence | MVVRGGTIIN ISGASPVDDP EYRYKMPAVF GKIEGSGNGI KTVIPNITEV ALSLHRQPGE VNKFFGTELG AQTRYSAETD RAVVNGAHMD AVLQDLMHRY IERFVLCPNC NLPETDYKIK NDAIWHKCAA CGAKEMVDMS HKLCNYILAE DKKAKKDSKK SKKGDKDDKK KKDKKKDKDS DDEKKKKKDK KKSKDKKSKD KKEKNGDDED EKDYIKEALE GGKTENNGLL NSDDEDSVSL ASEAGVDDQG ALLLAVEATK KYIAENSDVS DKELSEVVTN QQMASALKSH DKVHIIVRAA LSAQFFKNKE IEKYSSAIYS ITSGNKIMER HLIASLEALC IDKPKNFPVM IKQFYDEDAL AEETILEWAD EGRSEFTLPE VDEDVRATLR GEAEPVVVWL QEADSEDDSS DED
|
| |