Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46994 |
Symbol | |
ID | 7202231 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 126464 |
End bp | 128773 |
Gene Length | 2310 bp |
Protein Length | 730 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181306 |
Protein GI | 219121923 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00183734 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTGATAGTAA GGCGTTCAAC TTCACGAGCT GACCATAGCG TGCATCGCTT GTGGCCTACA GATAGTTTTA ATACAATGAA TCTGCGAGAT GATTCAATAT CAGATTTTCT GGAATTTTTG TCTCGTCCAG AATCTTCAGC TGCAAGCCTG CTTTGGAAAG ATGAACCCAA AGCGACCGAT ATGCTCCGTC GCACCTGCAA GGTTCTTTTC CAACGCACTG AACATCTCGC AAAGAAGCAC CCCACCATAA TGAAAGATAA CAGCATAAGT TCGGGGCTTT CCGCTCTACC TGAGCTTTAC ATGGGCAGCG TAGACGACTC GGTTGATGCC GAAACTCTTT GGGGTCAGGT TGAACTGCAA AATGAAGCTT TGCAGAAGCT GCTGAAGAGA TCAGTGGCAC AGCTCGCTAA GATCGCAGAG GAAGGAGGAT CGTCTATCAA ACTTCTTGAT GACGTTTTGA GTGGAGATAA TGACGGCGAA ATTTCTGTTG AGGATCACAT CGAAAGGCAA GACCATGAAG CATTGAACGT GAATGTTGAT GACGCAACCC GACGGGTTTG GGAACGTATG GAGCGTGCTA TGGACGACGT GGACGAAGAG GATCCATCGG ATAACAGTGT AAACAACATA GTGGTCGGAG AAGAAGACGA GCGTAGATCT GTTGATGCAA GTTCGATCGA AGACCCTGCA GCTGACGAGC TCAATGACGG GTTCTTTGAC ATAAATGATA TGGAGGCTTT CGCCGACGAA GAGGAAGAGT ATCTTCCCGA CGAAGCTTTT GGTTCTATAC CACCTGATTC ATCAGAGACA GCTGATAATC GATCGTTCCA CCAGAAGCAG CGCGACGGAT ATTTTGACAA CAACTCCGAG GAAGTTCTCG ACGATGAGCT TCGTCGTGGA AAGGAATCTC AAGGAAATCG AAAGAAATAT CGAGAAGATG ACGAGATTGG GGCACTTTAT AAACTTTACG ATACTCCTCG AGACGACGAT ACAAATGGCG AAGATGCTGA TCCCGTGAAC GTGAAAGCTG TTGATGTTTT TGGAAAGCCA AAGGAGAAGG ATTTTAAGAA ATGGAATTCT CGAGTGAGAA ATAAAGCAAA CGACAAAGGC AACGACGGCG ACGACGATGC ATGGAACGAA GATGGTGTTG ATCAAGCTAT AGCTAGCAAA ACGACAGGTT GGAATAATGA CGAAGAAGAG TCCGATGCGG TTATTGAATT TAGCCGTGGT GATACATCAT CATTGTACAA AGAACATGGC GAAAGCAAAA TCGAAATCGA TCAGAGATTA AAAGCGGGAA GTTCTACTTT TACGAAACAA CAAGAGAGGC TTCGTCGTCA GACAGAAGAA CTAGAAAGGG AAATGATAGC TGAAAAGCCT TGGCAAATGA CTGGTGAGTC TACATCGACA TCTCGTCCTG TGAATTCATT GTTAGAGTCG ACGCCGGAAT TTGAGCGCGC TGCCAAATTG GCACCGGTTA TCACAATAGA GCACACAGCG GATCTCGAGG AAATTATCAA AAATCGCATC ATTGCTGACG ATTGGGATGA CTTGGTGCCT CGAGAGCTGC CTGATATTGG CTTTGGTCAA AAGAAAGGCG AGTTACCGGA AGTTAGCCAA GAAAAGTCCA AGCTGGGCCT AGGTGAGCTT TACGAGCGCG AATATCTCAA GAAGGCTATA GGCTACGATG TGTCTGCTGC TGAAAAAGAA TCAGAAGAAG AGAAAGCAAA AAGTGAAATG AAGACACTCT TCGCAAACCT TTGTAGCAAG CTTGATGCTT TATCCAACTA TCACTTCGCG CCACGCCCTA TTGCAGAGGA GGCTGAGGTG CGACCTGTCA CTAAGCCAGC GATTGCGATG GAGGAAGTTT TGCCTTTGCA TGTAAGTAAT GCTCGTGGTG TCGCGGCCGA AGAGGTGTAC GGCGCGAAAC GTGGTAGGGA AGCCATTCTC CGAAACGAAA CCGAGCTTGA CCAAAAGGAT CGGAAGCGCG CCAGAAGCTT AAAAAAGACG GCTAGGCGGA AAGCAAGAAA GGAGAAACAA GCGGACGAGA AACTCATCTC TCGGCTTCAA CCCGGGCTTG GGCTTAATAA CCCTTACGAA CGCAGGAAAA TGCGTGAAGA ACTATCTGAG GCAAGGGCAC GAGGCAAGGT CACAACTGGC GAAACTGACA TGAACGAGTA CGGCGGTAGT GGGACTTTCT TTAAGCGCAT GCAGGAAGAA GCTGAGCAGT CTATTAATGA TCGTAAGACT GATGGGTCAG GGAAGAAAAA TGTGCGCTTG CACCCCAAGT CAAGTTCACT GAAGCTTTAA
|
Protein sequence | MNLRDDSISD FLEFLSRPES SAASLLWKDE PKATDMLRRT CKVLFQRTEH LAKKHPTIMK DNSISSGLSA LPELYMGSVD DSVDAETLWG QVELQNEALQ KLLKRSVAQL AKIAEEGGSS IKLLDDVLSG DNDGEISVED HIERQDHEAL NVNVDDATRR VWERMERAMD DVDEEDPSDN SVNNIVVGEE DERRSVDASS IEDPAADELN DGFFDINDME AFADEEEEYL PDEAFGSIPP DSSETADNRS FHQKQRDGYF DNNSEEVLDD ELRRGKESQG NRKKYREDDE IGALYKLYDT PRDDDTNGED ADPVNVKAVD VFGKPKEKDF KKWNSRVRNK ANDKGNDGDD DAWNEDGVDQ AIASKTTGWN NDEEESDAVI EFSRGDTSSL YKEHGESKIE IDQRLKAGSS TFTKQQERLR RQTEELEREM IAEKPWQMTE STPEFERAAK LAPVITIEHT ADLEEIIKNR IIADDWDDLV PRELPDIGFG QKKGELPEVS QEKSKLGLGE LYEREYLKKA IGYDVSAAEK ESEEEKAKSE MKTLFANLCS KLDALSNYHF APRPIAEEAE VRPVTKPAIA MEEVLPLHVS NARGVAAEEV YGAKRGREAI LRNETELDQK DRKRARSLKK TARRKARKEK QADEKLISRL QPGLGLNNPY ERRKMREELS EARARGKVTT GETDMNEYGG SGTFFKRMQE EAEQSINDRK TDGSGKKNVR LHPKSSSLKL
|
| |