Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49574 |
Symbol | |
ID | 7198191 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | - |
Start bp | 86823 |
End bp | 88640 |
Gene Length | 1818 bp |
Protein Length | 555 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184390 |
Protein GI | 219128375 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.251941 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACGAACCGA GGATGCTTTC TACTACCTTT CCTCGACCAA CATCATCCAT TTTACATTCC CGTACGAGTT CATTATCAAA GTCGGAAAAA TACTTCTCAC ATTAGCAACC GAAATACGTT ATGCCGATGG AACCGGCGTT ACCATTACGC CGAGGCTTTG TCAAAAGCTG TTTCTCTGTT GCCAGAGGGG TTGCGATAGC AGCAGTATGT TTGCTAGTCC TACGCAATTT GCAGATGCAA CAGAAAAGTC TGCTGATGCA ACAAGAGACC TTCAGCCATT CTTCGTCTTG GTTGGAAACC TCCTGGTGGG ATTGGGAACG ATCAAGACTT TCAGAACCCA AAGTCCCGAG GGCTGATTCT TTGATTCATC GGCAGCTTCT CAATCGCTCT GTGTCTTTCT TGACGGCCAA ATCCATGTCC ACTACGGTAC AACCTGTCGG ACATTATTCA GGAATGTGGT GGATGCCCCG GGACGCGTTC GTTAATAAGC TTCAAGCATA TCGTATACAA AATCTAGAAG GAGGTTCATG GCAGGCCTTC GCCGCGGTGG GAATGGACTC GGTTATTTTG ACCTTTGACT GGCTGGACTT TTGCGTCGAG CACTTGTCTC GCTATTTCAA AATGATCAAC TACAACAAGT TCAACCACGT CTTTGCCAAA CTCGTGGACC TGCACAAATC CTACATGACT TCAGAGCTTG GGGAGGACGC TAGCGGTCGC TTTGACGTAC AGGATACGGG CACGGCCATG CGAGAAACCA TTGCCCTGTT CCCACTGTAC ATTCCCAGTG AACCGGTTTT GCTCACGGGT ATTTCCAGGC CTGAATACAA TACCTCGCTT TTGTATACAA TTCCCGCAGG TGCTGCCAGA CGCAACGCAC TGGACATGTA TTCCATGGCG GCCACTTTGT TGTCGTTGTG GCGCGTGGGT GTGGGACGAG TAGTGGTGGC GGGAAATATG GCCTGTGGTG AAGATGACAC TGACGTCAAC AGCGTTTACC ACGAAGCCCT GGATCTTTTC TTGTCGAGCA TACCGTCCTC TGGACGTGAC GCCATGGAAA TCAACTACGT CTGCGCAGTC GACGATTTCG ACCGCCAGGA GGCCAAGAAC ACCTCCAAAT CGTTACTGAT GCCGCGCCTT GTTATTGACA AACTTCAACG AGCCTTTCGT GGCAATCTGA CGACAACACA AAACCATGCC TGGCTTGGAG ACGACAGCAA TCGGTGGAAA TACGTGTATT TTTCCGAGCC CGATCTGATT CTGCACACCC GGCCCCACGC AGTACGAGAA CTCGGGGTGC AGCTCGAGCA AGGCAAACTC ATTGCCGCCC ACCGCTTCCA GCCTATATCC CATGCTGTGG ATTTTCCAAA TTATCCTCGA TCACAGGACT TGATACCGGC CGACGCGGAC GACCCAGCCA CAACCGCGTT TATCAACCTA GATCCATCGG CCGGAGACTC GTGCTGCGAT GCTGGTAACT ATTGGCCGGG AAGAACCGAA CACAAGAAGT GCAGCTACAT GTGGTTGTAC TGTGGCTATT TGGCAAACGA CGCAGATCCA GATGTAAACC AAAGCGTATC TTGGAAACGG CACGAACGAC TGTGGCGACA CTTTCCTTTG GTGAGTTTCA CGAGCGGCTT TCGCTCTCCC ATGGTAAGCG AACATGCACG CATCTGCCGG CCGCAGCCGG CTTCGGCTGG AGGATGCATG CATGCTTAAA GACTTTGTCT TGATTGGACT ATCGAATTCA CTTTACTCGA GCTGGACTGT TAATCTCCCA CTTTTGCATA AGCCTACCTA GCTTTTGTGT ATCTTTCT
|
Protein sequence | MLSTTFPRPT SSILHSRKQP KYVMPMEPAL PLRRGFVKSC FSVARGVAIA AVCLLVLRNL QMQQKSLLMQ QETFSHSSSW LETSWWDWER SRLSEPKVPR ADSLIHRQLL NRSVSFLTAK SMSTTVQPVG HYSGMWWMPR DAFVNKLQAY RIQNLEGGSW QAFAAVGMDS VILTFDWLDF CVEHLSRYFK MINYNKFNHV FAKLVDLHKS YMTSELGEDA SGRFDVQDTG TAMRETIALF PLYIPSEPVL LTGISRPEYN TSLLYTIPAG AARRNALDMY SMAATLLSLW RVGVGRVVVA GNMACGEDDT DVNSVYHEAL DLFLSSIPSS GRDAMEINYV CAVDDFDRQE AKNTSKSLLM PRLVIDKLQR AFRGNLTTTQ NHAWLGDDSN RWKYVYFSEP DLILHTRPHA VRELGVQLEQ GKLIAAHRFQ PISHAVDFPN YPRSQDLIPA DADDPATTAF INLDPSAGDS CCDAGNYWPG RTEHKKCSYM WLYCGYLAND ADPDVNQSVS WKRHERLWRH FPLVSFTSGF RSPMVSEHAR ICRPQPASAG GCMHA
|
| |