Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44021 |
Symbol | |
ID | 7204216 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 716332 |
End bp | 718703 |
Gene Length | 2372 bp |
Protein Length | 728 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186403 |
Protein GI | 219113639 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGAAAGCCAA TCGTCTCCCT GCCGGTGAGC GAAAAATCTC AACTATAGAT TCGGTGTACT GGTGCAGCTT GCATTGAGAA ATCTAGTTTA CTTCGGACGA GGGTTCTCTC AGATTTTGGA CAGAATGAGC GGCCGTGCCG TGGAAGGACG CGACGTCAAT TCGATTCGCG AGGATGACGA TGAACATAGC CAGTGTTTTG CCCTAGATCT GCGCAATCAA TCCAGAGGAC CAATGGCCAT GGACCAATGC ACCGAAACGA GCAACAACTG CGTAAGCGAG CCTCCCCTAT CTCAAGCCGA AATGTCGGGA AGTGCGCCTT TCGCCAATCA ATTGTCCTGT CTGAGCAAGC AAGCAAGCCC AATGAGGTCC GAAGGGTGGA ATGAGCTCCA GCCCGCCGGA GAGGAGGCCG CAATGGAAAG GCGACTGCAA AGAACGAAGG TGGAAAAGCC ATCAGAACCT TTTGAGTTCA GGGATACCAA TCAATATCCT GCACAAAAAA TTATCTCGCC AATCGATTCC AAGCAAGCAA TCGTGGTGGA CAAGGGCAGT TTGGCTCAGG GAAGACCCGG TACATCGGAC ACAAAAACAA TATCATTTCC TCCAAGAGAG AGTGTCGATA AATCAAACCG CTTGCTGCAC GTTGATGTAA GCTGCAAACG AGAGACTATA CAGACTCAAT CCAACGAAGA TTCGGAAAGT GAAGGAGCCT CCGCAGATAT TGGCGACTAT TCCGAATTTA GCTATAGTGA CGAAGAAAGG CCTTTGACGG TTCAAGAAAA CCAGCTTCTC TACGGATCAA CAGGTGAATA CAGTTTATAC AGCTTGGAAC AGATGGAAAA CCGTTTCGAG GATCGCGATT TCCATGGTGC TCACCAAAAA GTAGTTCGAC ACAGCCATAC ACATTCGCCG TTCGCCAACT TGGGCAAGAC CTCAACTCTC AAGGCCGCGC GCGCCACCTT CTTACCTCAG ACTATTGTTC TGGAAACTGA CGCTAAAAAC TATCCCACTT ATCTGTGTCC AATATGTGGC ACACGACAAA GGGAGTTCTT CAGTGTTTCC GACGCACCGC GACAGCTTGA AGGTCCCTCC GGCTATTTGG CTCTGTACTT TTCAATTTAT GTCATCTCGT CGCTTTTCAT CTTTGGTTTG GAAGAGGGAT GGCGACCATT GGATTGCATT TATTTTGCAG TTGTTACACT CACAACCGCT GGACTGGGAG ATTTCGTCCC CACGTCAGAT GTGAACAAAA TTATTTGTTC TATCTTCATT TACTTCGGTG TGGCCTGCAT TGGATTATTG CTCGGATCTT ACATCGCTGG CATGCTAGAT GATAGTGCAT CACGGGAAGC CAAAAGAAAT CAGCTGAGTT CGTGTCCAAA TTGTGCCCGT ATAAAGACTC TCCAGGATGC AACGTCCAAT ACGCCAGAGC ATACGGTGCC TGACACAACA CCAGCTATTC CTAGGCGAAG TAATCGTCGA AGTTGTGCAT CCGAGCGTAT ACTTGACAAA GCTCAAAACC AGGATCACAG TGCTGTGTAT ACGTCACATC ACTGCCATCG AACGCACACA GGTCAGCGGC AAAGCTCGTT TGAAAGTCAC TCATCCCCTT TCCACAACAA AGGGGTCGAG GTTACAACTT CTTCTTCGCA AGGTGCATCG ACGGTTGAAA ATATAGCGTC TTTGGGTGAG TCGCAGGTTT CAAAAGGTAG CGAGGCAGGA GCAGCATCTA GGAAACTAGG AAACATGCGC TCCGCAGCTA TTCTCCACTA TCTTCCTCAT CCCCCCCCCC CCCCCCCCAA CAAATCTTTT AGGATCGCCA GTCACAAGAG GGATCCTTGG CCGACAAAGG CATACCCGTC ACGATTCGTT TGACATAACA GGAAATGCCA GAATGTACAG CGCAGCAGCT GGAATGGGAC GAACACGCAA ATTTAGCGAA GATGTAGGAG TAATGATGAT TCCATCGATC AGACAAGCTC CCCCCACTAT TCAGGAAGGT GCCCAGCTCG AGACTCCGCC GTTGGGTACT GATGCAAACG GGTGTATGGA GCCACCATAC ACAAGGTCTC GCGGGTTCAC TTCTGAATCG GATAGTTCCA AAGAAGATGA CATGAGCGAT AGCGACGACG ACGACTCTTT CGCAGCCTCC ACTCATACCT CGTCAGGTTC TTCCGAAGCT GTAGACGATG GAATGTTCAA ATTGCAAGTG GCCAAGTATG TGTTTCTGAC GTTGAAGCAG GCGCTGGTCA ATTCAATGGT CATCATAGGG GTTGGCTGCC TCGGGTTTCG CTTTATCGAG GGCTTTTCAC TCGTAGACAG CTGGTATTTT ACAACGGTTT TTCTCACAAC CGTCGGATAT GGTGGGTGCT GA
|
Protein sequence | MSGRAVEGRD VNSIREDDDE HSQCFALDLR NQSRGPMAMD QCTETSNNCV SEPPLSQAEM SGSAPFANQL SCLSKQASPM RSEGWNELQP AGEEAAMERR LQRTKVEKPS EPFEFRDTNQ YPAQKIISPI DSKQAIVVDK GSLAQGRPGT SDTKTISFPP RESVDKSNRL LHVDVSCKRE TIQTQSNEDS ESEGASADIG DYSEFSYSDE ERPLTVQENQ LLYGSTGEYS LYSLEQMENR FEDRDFHGAH QKVVRHSHTH SPFANLGKTS TLKAARATFL PQTIVLETDA KNYPTYLCPI CGTRQREFFS VSDAPRQLEG PSGYLALYFS IYVISSLFIF GLEEGWRPLD CIYFAVVTLT TAGLGDFVPT SDVNKIICSI FIYFGVACIG LLLGSYIAGM LDDSASREAK RNQLSSCPNC ARIKTLQDAT SNTPEHTVPD TTPAIPRRSN RRSCASERIL DKAQNQDHSA VYTSHHCHRT HTGQRQSSFE SHSSPFHNKG VEVTTSSSQG ASTVENIASL GESQLFSTIF LIPPPPPPTN LLGSPVTRGI LGRQRHTRHD SFDITGNARM YSAAAGMGRT RKFSEDVGVM MIPSIRQAPP TIQEGAQLET PPLGTDANGC MEPPYTRSRG FTSESDSSKE DDMSDSDDDD SFAASTHTSS GSSEAVDDGM FKLQVAKYVF LTLKQALVNS MVIIGVGCLG FRFIEGFSLV DSWYFTTVFL TTVGYGGC
|
| |