Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_43994 |
Symbol | |
ID | 7204400 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 641490 |
End bp | 644482 |
Gene Length | 2993 bp |
Protein Length | 942 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186388 |
Protein GI | 219113609 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACAGCAAACA CTCCGGTACC ACCGATCCTT GTTTCTGTGA CCATTTTGAA AGCATCAGCA CATATTTTTT CACGATGCCG ATGGATATTC GTCAATTTTT CAAAGGCGGA GGATCCAGCA AAAAAAACAC CGTAAAGCCG GTATCCAATT TGATGGATCA GGTCAAACTG GTGAACTCTG GCTCCAAAAA GCGCAAAGAA TCCCCCGTAC ACGAGGAAGA ATCCACAAAT ACTTTTCTTG AATCCACTGG AAATGTTCCT ATTCAGGAAC GAGAGAAAGA GCCATCAGGT CGTAGGAGAT CGCCCCGTAA GCTGTCGAAA AATAGTCCGG CAAACGTAGT GCGAGCCGAG ATTGGATTTG TCGACGGAAC GAAAGAAAAG CCGATCGCTA GTCCCAACAA AGCTCTGGAT ATGAGTTTAT CGAAAAAATC TTCCAAATTA GCAACGTCAC CCCAAAAAAA ACCGTCGGGA GTTGTCAGCA TTGCCAACTC TCTTCCCCCG ACCGCCGCTA TCGGGAATCG CAATGTCCCT TCCGATTTCA GCCCCTCGCC GCAGACCCGC AAATCTCCAC CCACGAGTAC TGTGAGCAAC ACGAAACGCC TAAAGCGTGA TCCGCCTCTG GAGCCCAAGC TAACACAATC ATCGTTCAAT GTCGACAAGG CTGCTCCAGA ATGTTTGAGA GGCTGCACGT TTGTCTTTTC CGGTGTCTTA CCGAACCTCA GTCGCGAGGA CGGCCAGGAA ATGGTCAAAA CACTTGGCGG ACGGATCACT GGAGCTGTAT CAAGTTTGAC AAATTATCTT GTTGTTGGCG AAGAGTTGGA AGATGGACGT GTCTACACAG AGGGCAGCAA GTACAAACGT GCGGTCCAAG AAGGTACGCA CATTGTTCAG GGCGAGGAGG CCTTTTACGG GTTGCTACAG CAGTACAATG ACAAGGAAAT CGCAGCAGGA AATGCTTTAT TGAACACAGC TCCCAAACTG AGCCAATCGG AAGCACCTCT TGCTGCGAAT CCGTATGCCA AGAAGGCGCC AAATAATCCT TACGCCAAAC CTGCCCTTTC AAATCCTTAC GTTAAGGCAA AACCATACAC TTCTGGCAAG CCTTCGCCTG CAGAAATTAG CTCACCAGTC GACATCAAAG CAGATCGCTC TTCCGGTGCT AACCTTCTTT GGGTGGACAA GTACAAACCG ACTCGCTCGG GGGAAATTTT GGGAAATGCC GAGTCGGTGA AGAAGCTTGG CCTTTGGCTG TCATCTTGGG AACAGAAGTT TAACAACTCC AAAGCTGTTG GAAAAGGTGT TGCTAATCCA AACGATCGCT TCAAGGCCGC ACTTTTGTCT GGGCCACCTG GCATTGGTAG TAAGTACAGA TTCTTCCTGT GTTTGTTTTC TCTCGCTTTC GGATACGTTT CTGACACTCA ACCCATCATA TATTCAGAAA CGACTACAGC AACTATTGTT GCAAAAGAAT CAGGTCGCGA TGTGATTGAA TTCAATGCTT CCGACGTGCG ATCCAAGAAA GCGATCAAAG ACGACATGGG TGATATCACT GGTTCATACA CACTCGAGTT TGGCAAACCC GCCATCAATG AAAAGCGCCA AAGTAGTCGG ATTAAGCGTT GTATAATTAT GGACGAAGTT GATGGCATGG GTGCTGGGGA TCGCAGTGGG ATGTCAGAAC TTATTCAAAT GATTAAAAAG AGCCGAGTTC CGATTATCTG CATTTGTAAC GATCGGCAGT CCCAAAAGAT GAAAAGCCTG CTTCCCTACT GTATGGATCT TAGGTACCGG CGACCGACAA AATCTGTAAT CGCGAATCGC GCTGTAAGAA TTGCGGCACA AGAAGGATTT ACCGTCGAAC AAAACGCAGC TGAAGCGATT GCTGAGTCAT GCGGAAACGA CGTTCGGCAG GTTTTGAATT GCATGCAAAT GTGGGCCAGT GACAGCAGTA GTGAATCGCG CATGACTTAC AAGGATTTGA AACAACGCGA GAGCTCCATT AACAAAGACG AGATCCTCCG CGTCAGTCTT TTCGATGCAG CGCGAAATAT TTTGGAAGGT CGTCGAGGGC TACAAGGAGC TGATGCATCG ACCGAGCGAC AGCACTTTTT CAGAAGAAAC GATGCCTTCT TCGTAGACTA CAACTTTGTT GGTCTGTTGG TACAGCAGAA CTACATCAAA GTGATGCAAG GTCAGTTCAA TGATGCAAAA CGTTCAAATG ACCAGTCCAA TATTTTAGGT GTTTTGGAGC GAATGAGCCA GGCCTCGGAT GCCATGTCCG ATTTTGCTGA GGCCGAGAAC GGACTGAGGG GAGGCCAGAA CTGGAGCCTT TTGCCCTTTT GTGCAATGCT AGCGGTAAAA ACTGGCTTCC ATGCTGGTGG TCCCAATGGG GGCGGTCTTC CTGGCTTCCC AGACTTTACT TCTTGGCTTG GACGAAATTC TAGCAAAGGC AAGAAAGCTC GTCTGTTACA CGAACTACAG CATCACATGA ATTATAAGAT TAGTGGTGGA GCTCAAGAAA TGCGTTTATC CTACCTACCA GTTTTACGTG ACCGGTTCTT GTCGCTCCTA CTGGGCAGAG AAGAAGGACT CACTGAAAAA GCCATTGACC TCATGGATGA ATATGGCCTG GACCGAGACG ACGTCTTCGA AAAGCTTGAT GAGTTTCGAA TGGATCACAA GGCGGACACC TTCGCTAAGC TGGATAGCAA GAAAAAGGCC GCCTTCACAA GGTTTTATAA TCAAGGTACT CATAGAAGCC AAGCACTAGT GGCTGAACAA GGCGGTAGCA AGACGGTTAA GCGTGGTGCT AACGCGGTTG CGGAGGAAAC GATTGATCCA GATGCCATCG ACGATGATGT CGCAAAGGCT GAAGAAAATG AAGGTGATGA TGCGGACGAG GACATGGAAA AGATTAAAGC CATGTTCAAA AAGAAAGGGC GAAACACGAC GCAGAAAGCT GCTACCAAGG GTAAAGCCAA GAAAAAGAAA TAG
|
Protein sequence | MPMDIRQFFK GGGSSKKNTV KPVSNLMDQV KLVNSGSKKR KESPVHEEES TNTFLESTGN VPIQEREKEP SGRRRSPRKL SKNSPANVVR AEIGFVDGTK EKPIASPNKA LDMSLSKKSS KLATSPQKKP SGVVSIANSL PPTAAIGNRN VPSDFSPSPQ TRKSPPTSTV SNTKRLKRDP PLEPKLTQSS FNVDKAAPEC LRGCTFVFSG VLPNLSREDG QEMVKTLGGR ITGAVSSLTN YLVVGEELED GRVYTEGSKY KRAVQEGTHI VQGEEAFYGL LQQYNDKEIA AGNALLNTAP KLSQSEAPLA ANPYAKKAPN NPYAKPALSN PYVKAKPYTS GKPSPAEISS PVDIKADRSS GANLLWVDKY KPTRSGEILG NAESVKKLGL WLSSWEQKFN NSKAVGKGVA NPNDRFKAAL LSGPPGIGTT IVAKESGRDV IEFNASDVRS KKAIKDDMGD ITGSYTLEFG KPAINEKRQS SRIKRCIIMD EVDGMGAGDR SGMSELIQMI KKSRVPIICI CNDRQSQKMK SLLPYCMDLR YRRPTKSVIA NRAVRIAAQE GFTVEQNAAE AIAESCGNDV RQVLNCMQMW ASDSSSESRM TYKDLKQRES SINKDEILRV SLFDAARNIL EGRRGLQGAD ASTERQHFFR RNDAFFVDYN FVGLLVQQNY IKVMQGQFND AKRSNDQSNI LGVLERMSQA SDAMSDFAEA ENGLRGGQNW SLLPFCAMLA VKTGFHAGGP NGGGLPGFPD FTSWLGRNSS KGKKARLLHE LQHHMNYKIS GGAQEMRLSY LPVLRDRFLS LLLGREEGLT EKAIDLMDEY GLDRDDVFEK LDEFRMDHKA DTFAKLDSKK KAAFTRFYNQ GTHRSQALVA EQGGSKTVKR GANAVAEETI DPDAIDDDVA KAEENEGDDA DEDMEKIKAM FKKKGRNTTQ KAATKGKAKK KK
|
| |