Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_19586 |
Symbol | |
ID | 7200173 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | + |
Start bp | 173149 |
End bp | 175521 |
Gene Length | 2373 bp |
Protein Length | 694 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179148 |
Protein GI | 219116707 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGAACT TTGGCAATAA GTTGGTGGCT CTGGAGCACA AACCGTGGTC CGACTACTAC CTAGATTATG CCAGACTCAA GGATATTTTG GAGAGCATCC CAGATGAAGA GCAAGTGCGT CGTCAGCACA TTCTGGAACT AGATAATAAG TCTATTGAAG AACAGGGGAC GGCAACGCAA CTAGATGGAT CCTTTGAGTT CATTCACGTT TTGAACCGCG AAGTGGAAAA GATACTCTTA TTCTTTCTCC AAGAACAAGG AGAGATTGCC GCTAGCCTTG CCGACTGTCG GAGGAAGCAT CTCGGATTAA TCTCCTCCTC TACTTCTTCG GCAGTATCTC AAGATTCCAC TTCCTTGATG ATCAGCAACA ACATCGCCGC GCCTTTTAAT GAGGATTTAG ATTCCCTCCA AACGCTTTAT CATGAAATTG CTCTGCACCT GTTACATCTA ATCCAGTACG TCGATCTCAA CGTGACCGGT ATCCGCAAGA TATTAAAAAA GCACGATAAG CAACTTCCCA ATCAGCAGCT CGCGCCTATC TACCTGGGGC GTCGCGGCAA ACCCAGCATT CTGCTTCAAC CCTTGTTGGC GGACAACGAT GATTCCCTCA ACGGACTTGT GCTCGTGCTG GAAACGTCGC TCCGGGAACT CAAAGTGACG CAACAAGGAG ATTTACCGAT ACCACCACCG GCACCACCTG TGGAAGGCAA TCCGGTCACC TCCACGGGCC GTCACTTTCG TCGCCCCTCG GCACCACTTC TGGCCGATTT TTCGGTCGCC TCCTTGCTCG AATCTTCATC CCCTCGTCGC CTTAATCTGC ACCAGTATCA CAGCATCAAG CAAACAAATC ATAAGTCCAT GGTTCAGCTC AATCGGGAGA ACCCGTACGC CAATTTGCCA ACGGACGCTA TCGTGCTTCA AATCCGCGCC GCGCGCAGGC GTTTGCAGCA GACTAGCGAC TTTGTCCAGT TTCTAGCGGC TTCCCTCATG ATGTCATCGG AGGTATCGCT GATGGGAGAG GAAGAGGAGG AAGATGATAC CACCGACGAG AAAGCGGGAC AAACCAAACC GTCCGAATTT TCCAATTTGC TCAACTTACT GTCGACGTTT TTGTACATGA CCAACTACTA CATTGTAGCA CCGAGTTCCG GGACGTATGC CGAAAAGCTA GGCGGATCGG CCGCTCTGAG TGGAATCATT ATCGGAATGA CCCCCGTGGC GGCCTTGGTA TCAACAGTTC TTTACAGTTG GTGGACGTCG TATTCGTACA AGGCCGCCTT GATCTTTGCT TCGTCTTGCA GCTTGATCGG AAACGTCTTA TACGCTACTG GCCTGCCGTA CAACTCTCTG GCTTTGGTAA TGTTAGGACG ACTACTGAAT GGATTCGGGT CGGCCCGGTC CATTAATCGT CGGTACATTG CTGATACGTT TTCCAGGTCC CAAAGAACAG CCGCAAGTGC GGCATTTGTC ACAGCTGGGG CCTTGGGAAT GGCAGCTGGG CCAGCGGTAG CCAGTCTCTT GCACTTGACG GTATCAGATA GTAGTCTGAA CTTGTATTGG CAAGTCGAGA ATAGTGTGGG ATGGTTCATG GCTGTCGCGT GGGCAGTTTA TTTGGTCTGT TTGATAATGT ACTTTTCCGA TCCACATAAG AAAGTGCACT TGGCCTCTCC AAAATCAGAA TCGGGTGAAA AGAAACCATT GCTGACTAAT GGCGAGTCGG ACAACAACAG GTTGCGGGAT AGTGGTGGTA TGCAACAGCA AGACAATCCC ATGTGGAGAA ACATTCCGGT CATGACGACC TTTTGGCTAT ACTTTGTACT CAAACTAGTC CTCGAATGCC TTTTGAGTTC GACTTCGACC TTGACTTTGT TCTACTTTGG ATGGAGAGGC GATATCTCGG GAGTGTATAT GGCAGCGCTG GGATTACTCA TGCTGCCGGC CAACTTTGTT GTGGCCTACT TCTCTAAATC ATACTTTGAC CGCGAATTGA TCATGGGATT ACAAGTCACA ATGTTACTTG GATGCTTGAT CATTCTCCAG TACAGTTACA ACTACAGTAT TGCGCAGTAC ATAGTGGGTT CGGTTATTAT ATTTGTGAGT ACCAACGCTC TGGAAGGACC CAACATGAGC CTTCTGTCCA AAACAATTCC AAAGTCTTGG GCCAAAGGAA TCTTCAATAT CGGACTGCTA GCTACTGAGG CTGGTACGCT TGGTCGTGCC GTTGGTGATG TCTTATTGAC TGCATGTGGC GAAGAGGGCC TCCAGTATTT GCTGAATCGC TCTATGGGGA CAATGTCGCT GTTAAGCTTC GTCACGCTGC TCGTATCGTA CATGGTTTAC GATCACCTTG AGCCTTTAGA CAATGATGAC TAG
|
Protein sequence | MVNFGNKLVA LEHKPWSDYY LDYARLKDIL ESIPDEEQVQ QGTATQLDGS FEFIHVLNRE VEKILLFFLQ EQGEIAASLA DCRRKHLGLI SSSTSSADLD SLQTLYHEIA LHLLHLIQYV DLNVTGIRKI LKKHDKQLPN QQLAPIYLGR RGKPSILLQP LLADNDDSLN GLVLVLETSL RELKQTNHKS MVQLNRENPY ANLPTDAIVL QIRAARRRLQ QTSDFVQFLA ASLMMSSEVS LMGEEEEEDD TTDEKAGQTK PSEFSNLLNL LSTFLYMTNY YIVAPSSGTY AEKLGGSAAL SGIIIGMTPV AALVSTVLYS WWTSYSYKAA LIFASSCSLI GNVLYATGLP YNSLALVMLG RLLNGFGSAR SINRRYIADT FSRSQRTAAS AAFVTAGALG MAAGPAVASL LHLTVSDSSL NLYWQVENSV GWFMAVAWAV YLVCLIMYFS DPHKKVHLAS PKSESGEKKP LLTNGESDNN RLRDSGGMQQ QDNPMWRNIP VMTTFWLYFV LKLVLECLLS STSTLTLFYF GWRGDISGVY MAALGLLMLP ANFVVAYFSK SYFDRELIMG LQVTMLLGCL IILQYSYNYS IAQYIVGSVI IFVSTNALEG PNMSLLSKTI PKSWAKGIFN IGLLATEAGT LGRAVGDVLL TACGEEGLQY LLNRSMGTMS LLSFVTLLVS YMVYDHLEPL DNDD
|
| |