Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50348 |
Symbol | |
ID | 7199176 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011697 |
Strand | - |
Start bp | 8298 |
End bp | 10351 |
Gene Length | 2054 bp |
Protein Length | 317 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185314 |
Protein GI | 219130317 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00515891 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGTTT CTCGATCTGA CAGTCGTAGC TTTTCACCAA GATCTGGATG GTTCGCTCTG CCAAACCGGT ATAGTGCGTT GTTTCCGTGG GCCCTCTTTC TGATAGGTTG CTTTATCGGC CATCTACACG GGAACTTCAT AGCTCGTAGT GAAAATAGCC AAATCATCAA CGAGTGTCTC CGACATGACA ACCCTCTCTG TGTCACTTCG CACGTCCATA CCGATGGCGG AGTCGATCGC ATTGGCGACG GCTGGCACTC GATTGACGTG TTTTACGGCC ATACAGACTT GCTTAAGAAG ACACTACCGT CCAACCGCAC GTGGTTTTCG CAAGCTTCGC AAGATGAACT CGTAGCAAGC TTATTCAAAG GCAAACGCGA TGGTTATTTT ATTGATCTTG CTGCCAATGA TGCGACCGAC TTGTCAAACA CCTACGCTTT GGAACAAGAG TACGGTTGGA CCGGATTGTG CGTCGAGCCC AATCCAATGT ATTGGCACAA TTTGAGCTAC AGGGATTGCC AGATTGTCGG TGCTGTAGTT GGACAAGCTC GTCTGGAACA AGTGCATTTT CGGTTTGAAG CAGGGGACCA TGGTGGTATT GCAGGCGATG GCTTCGACAA TGGAAAAAGA TGGCAACGAT ACAGCGAACT CAAATACACC GTCACTCTAC TAGAAATATT GGAGCGCTAC AATGCTCCCA CCCAAATTGA TTATCTTTCG TTGGACGTGG AAGGCGCAGA GTCTTTCATT ATGATGAATT TCCCTCTCGA CAAATATCAG ATCAAGGTAA TTACAGCAGA GAGGCTTCGA GGACCAATCA GAGAGTTTTT GAAAGGCCAC GGATATGTGT TTGTAAAAAA GCTTACAAGA TGGGGCGAAT CCTTGTGGAT CCACAACAGT GCCAAGGACG AACTTGATCT CCAATTGATA CAACAGTTCA ACTTCCCAAT ATAAAAAGCC GTAGAGGACT GGCCAACGAA GCGGCAATCT CAATATTCTG CTATCCAAAC CTTCACTTCT GTTTGTTAGA TTTCTTTTGA ATGTTCCCAC TGGCTGCAAA AGCAGAAAAG GACATGTAAC TCAGAACGGA AGCAATCGTC GAGCCGCCAG TAATCCAGCA TAGGGCGGTG AGCGTTTCAT CCAATCCAGC ATGGAGCGGA AGAGTAATGC CCACGCCAAG AGTAATGAAT TGAAACATTG TGCTAACTTT ACCTGTAAAA GTTGGTTGCA CTTTTAAAGG TGTTCTTGTG GGATCAACAA CTGCTATACC TTTCTGTGTT TGTTTTGCAA CGTAAAAGTA GGTTGCCGAC ATCAGAACGA GGTCTTTCGC TAGCCATAAT ACCACAAGAG GCGTTGGAAG CGTACCGTTG CACCACAGGG AGACCGAGAG CGCGTTAATC AGAAGCTTGT CCGCAAATGG GTCTAAGTAG GTTCCGAGCA CAGTTGCCAT GTTGTGGTGC TTAGCTAGGT AACCGTCCAG TCCGTCCGAC AGTCCAGCTA AGAAACACCC TGCTAAAGCA GCCTCGTGTT GATTTGTAAT AATCCAGTAC GAAAGGAGCG GCGTGCATGC AATTCGAGAA AGTGTAATGA CGTTAGGTGG GGATTGTAAC TGCCGAAACC AATTTCCCGT TGCATTTGTT TTCCAGGCGT TGTCTTCGTC TGTGCTGTTC GTGCCAAAGC ATAGCTTTTG CTTCTCGGAA AAAATATCAT TCACTGTAAC TCTGATTCGC CGACACCGTG AATCAACTGT AAAGCGAGAT TTCTCGTTGC CACGAACTGC CCCACACGGC GCAAAGGAGA CTCTCCTTTC TGCTCTGTAG CGCAGTCTCT GTTGGACACG AAGCATTCTC CACATATTTT AATTGATGTT AGCAAACACA GTCAACGGGG CTCAAACGAA AGCGGTGATG GCACGAGGAG CAAATACAGA AGTTGCAATA AGCTTTTTTC ATTCATTCGT CTACAATGTA AAGCTTTTGC CCTTGGTTTG TCCGTGACAG TGTAGAGCAA ACGCACTGCA CAACCATGCG TCTCATCTCG AACTAGCAGG AGTC
|
Protein sequence | MKVSRSDSRS FSPRSGWFAL PNRYSALFPW ALFLIGCFIG HLHGNFIARS ENSQIINECL RHDNPLCVTS HVHTDGGVDR IGDGWHSIDV FYGHTDLLKK TLPSNRTWFS QASQDELVAS LFKGKRDGYF IDLAANDATD LSNTYALEQE YGWTGLCVEP NPMYWHNLSY RDCQIVGAVV GQARLEQVHF RFEAGDHGGI AGDGFDNGKR WQRYSELKYT VTLLEILERY NAPTQIDYLS LDVEGAESFI MMNFPLDKYQ IKVITAERLR GPIREFLKGH GYVFVKKLTR WGESLWIHNS AKDELDLQLI QQFNFPI
|
| |