Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_23920 |
Symbol | |
ID | 7199126 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | - |
Start bp | 192889 |
End bp | 194561 |
Gene Length | 1673 bp |
Protein Length | 503 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185231 |
Protein GI | 219130142 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.350117 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGACAACTGC GAAGGGGGAG AAGCACCATG GTAGAGCTGT TGCACACCAG CGGCAAAAAG ACTACGTGTA CACAACCGCG CCCTCTGTCG ATTACTGTTT GCCGTCGTAC GTTTTGCTAC GACGTGTCAA GCTCAGAAAC CGCGCTACGT AACGACAAAG CATGAACCGA ATCGTTGTCT TTCTGTTCGC GATGATTTCC TCTCGTCGGC TTGTGGCCGC GCTGGCGCCT ACCGCCGGCG GCAACCGGGT GTTTTCTCGA ACGTCCTTTG CTCTCCAAGC TTCGAAACCC AGCGGTGGAG GCCGCGTATC CGGTGCCGTG CAAGACCAAA ATCTACTAGA CAAAACTACC AAACAGGATT CTCCTCAACA GAAGGGGCGG CCCCAGCTCG ACACCAACCC TCCCAAGGGA ACCCGCGACT TTTATCCCCA AGATATGCGT CTTCGGACAT GGTTGTTCGA CCATTGGCGG GCCGTCGCTA AATCCTTTGG ATTTAGCGAA TACGACGCTC CCGTTCTGGA ATCGGAAGCC CTGTACGTGC GCAAAGCTGG TGAGGAAGTC ACCACACAGT TGTACAATTT TGTGGACAAG GGTGATCGCG CCGTAGCGCT GCGGCCGGAA ATGACACCAT CCCTTGCGCG TATGGTCATG GCCCAAAAAG GTGGACTTCC CATGCCACTC AAGTGGTACA GCATCCCACA GTGCTGGAGG TATGAGCGCA TGACAAGGGG ACGTCGACGA GAGCACTTTC AATGGAATAT GGATGTTTGG GGTGTAGCTG GTCCAGAAGC GGAAGCAGAG CTGATGGCAG CTATGGTGAC GTTCTTTGAA AATGTGGGAC TGACTGCCGA AGATGTAGGT ATCAAAGTCA ATTCGCGATT GGTGATTGGT GAAGTTTTGG ACTCGTTGGG AATTCCGGAA GAGAAGTTTG CGGTAACATG TGTGTTGGTA GACAAGCTAG AAAAAGTCCC GATAGAAGCT ATTCAGGGAG ATTTGGAGGA ACTGGGCTTG GATCGATCTG TTGTAGAAAA ACTGTTGGAT GTTTTGACGA ATAAATCATT GGATGCTCTA AAAGGAACAC TCGGAGAAGA TTCGCAAGCT GTCAAGGAGC TTTCCCAGTT CATGACATTG TGCGAAGCCT ACAACATTCA AAATTGGATT TTATTTGATG CTTCCGTTGT GCGTGGATTG GCCTACTACA CCGGTATTGT TTTTGAAGCC TTTGATCGAA GGGGTGAGCT CCGGGCTATT GCTGGCGGCG GACGGTACGA TAAGCTACTG GAAACCTTCG GTGGTGTCCA AACTCCCGCC GCTGGATTCG GATTTGGCGA TGCTGTAATT GTGGAGCTGC TCAAGGAACG TGACGTGTTG CCTTCCTTCG AAAGCACCGG AGTCGATACT GTGGTGTTTG CAATGAATCA AGATCTGTAC GCCCCGGCCG TCGGTGTAGC GTCGGTTTTG CGTAAGGCAG GCCAGAGCGT GGATGTTGTC TTAGAGGGAA AGAAACTCAA GTGGGTTTTC AAACATGCTG ACCGTATTGG TGCCAAATAC TGCGTCGTCG TTGGAGCAGA TGAATACCTG AACGGCGAGG TGGCGATCAA GGATCTTTCC GGGGGTGCTC AAAAATCAGT CAAGATTGAT GATCTTTCTA ATTGGGTCTC GGATGAGCAA TAA
|
Protein sequence | MNRIVVFLFA MISSRRLVAA LAPTAGGNRV FSRTSFALQA SKPSGGGRVS GAVQDQNLLD KTTKQDSPQQ KGRPQLDTNP PKGTRDFYPQ DMRLRTWLFD HWRAVAKSFG FSEYDAPVLE SEALYVRKAG EEVTTQLYNF VDKGDRAVAL RPEMTPSLAR MVMAQKGGLP MPLKWYSIPQ CWRYERMTRG RRREHFQWNM DVWGVAGPEA EAELMAAMVT FFENVGLTAE DVGIKVNSRL VIGEVLDSLG IPEEKFAVTC VLVDKLEKVP IEAIQGDLEE LGLDRSVVEK LLDVLTNKSL DALKGTLGED SQAVKELSQF MTLCEAYNIQ NWILFDASVV RGLAYYTGIV FEAFDRRGEL RAIAGGGRYD KLLETFGGVQ TPAAGFGFGD AVIVELLKER DVLPSFESTG VDTVVFAMNQ DLYAPAVGVA SVLRKAGQSV DVVLEGKKLK WVFKHADRIG AKYCVVVGAD EYLNGEVAIK DLSGGAQKSV KIDDLSNWVS DEQ
|
| |