Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_33106 |
Symbol | |
ID | 7204245 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 58421 |
End bp | 60394 |
Gene Length | 1974 bp |
Protein Length | 657 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186270 |
Protein GI | 219113373 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTTCC TCCCGGCGTG CGCTCGTATT CTCCTCCTCA TATGGCTAAC GGGCACAGTA ATGTCGGTAC AATCGTGGCT TGGATCGTCA GCTCAAGAAA GTACCGACTC GATTGTGTGT CGAATCACAC TTTCCGCTAC ACTGCTTGCC CTTCCCACCA TCGGCAAGCC AGCTGTGTCT ACGAATACGG TCGCGTGTAT TCCTATTGTT GACAACCGAG AAACCGCCGA TCTCTTTTCC ATAGATCTTC CACTGCACTT CTGGGAACAA CACGCGGTCG CTGCTGCCAA CGGCACTTTG TTGGTATCCA TCGAGGGTGC CTCGATTACC CGGAAAGGCA TCGTGGCAAC GGCCCAAGCT ACCTTTCAGG TCTTGCCCGA GCTTCCCAAT AGTTCACGAC ACTTGTCGTT GGATGATGAC CCCAACCATT ATTGGACGAC GGGAATCAAA ACTATAGCTG TAGTTCGAAT ATCCACGCGC GATGCCGAGC CCACCTACTC GACCGCTGAT ATGGAATGGG GGATCTTCGG GGATGGTCTG GAGAACGATG GGGTTACCAT GCCCACACAA TACAATGCCT GTTCTTTTGG AAAACTCAGA TTCATCCGGA GTGTTTACGG CGTTGTGGAT TTGAAACTTG ACCGAACGCT TGGAAGCTTC GAGTCGGTGG ACTCGATTTT TCAGTCCGCG CAAAAGCAGC TCGTGGAAGA ACACAATTTG GACAGTATCA CAGACCTGGG AGACAAAATT CTCTTTTGTC TCCCTCCCGG AACCGGATCT TGGATAGCTG TTGCCGGTGT TCGTCACTGG CGAGCCTTGT TCAACGATCA ATGGTGCCTC AGCCTGAGCG CTCTCATGCA CGAGGTTGGG CACACTGTGG GGTTGATGCA TTCCAACGAA GCTGGTCAAA TCTATGGGGA TCAAACGGGG TACATGGGAT TTGGACGATT AGCCGTCAAC ACGCCCCGCC AATGCTTCAA CGGTCACAAG AACGACGTGC TAGGATGGTA CAAAGACCGA GTCGTAGCGG TGGATCCGAA AGAGGATGGA GGCGGGGCCA GATTGTACAA AATCGCCGCT TTTGTCGACT ACGACAAGGC TGCATCCAAC GAGCCGGTGA TCCTAGATGT TGGAGGACAA ATCTTTTTAC AATATAACCG TGCCAAGGGT TTCAATGTCG GTACCGAAGA GAAGGGAAAC ACCTTGACAA TCACCGAGTA CGATGGTACC GACAGTTCCA CAAACCTAGG GGGCCTCAAG GTCGGTGAAG AATTCGGCGA ATACAACTTT CAACACTCGG GGCATGTGTT GATGATTCAG GTTTGCGAAA CGATAATTGG AGACAGCAAT TCTCCCGATG CTATGATTGT GAGTGTGGGC CTCGATCAAT CTCTCTGCCG AAACGCTCAA ATGGCCTCGC CTCAGGCCGA CATACAAGGC ATGCCTTTTC CACTCACAGC TTCCCCCACA GTCTCGCCAA CGGTGGCGAC CAACTATCCG ACGATGACTC CCACTCTTAG CCCTACCTCT CCACCCGTGT TGAATCCTAC CCGCCGCCCT ACAACCAGTC CGACGCACTC ATCTATGTTT ACAGAAACCT CACATTCGAC CAACAGTCCA ACTTTTCGAC CAAGCGAGGA TACTGCTGAA GCCATTCGCA GCATTGAAGG TCTTCCCTTT ACGAGAACAC CAACATTGGC GCCTTCTGCG CTACCTGTTG CAGCTCCCGG TCTTCCGTTC TCCGACGATG AACAAGCAGA ACCGAATGAG GCGATCACTG CACCACCCTC CCAGCGTCGT CCACTACGAA ACTACGCGCC TCGCGTAGAG TCCTTTCCGA GTGCGGCGAT CACCCAGCCG CCCTCCCAGC GTCGTCCATT GCGAAACTAC GCACCTCGTG TAGAGTCCTT GCCTTTAAAA AAGGAAAAGA CTACAGAGTC GCGCATGCAC GATCGTTTGA GACTTTCTGA CTAG
|
Protein sequence | MKFLPACARI LLLIWLTGTV MSVQSWLGSS AQESTDSIVC RITLSATLLA LPTIGKPAVS TNTVACIPIV DNRETADLFS IDLPLHFWEQ HAVAAANGTL LVSIEGASIT RKGIVATAQA TFQVLPELPN SSRHLSLDDD PNHYWTTGIK TIAVVRISTR DAEPTYSTAD MEWGIFGDGL ENDGVTMPTQ YNACSFGKLR FIRSVYGVVD LKLDRTLGSF ESVDSIFQSA QKQLVEEHNL DSITDLGDKI LFCLPPGTGS WIAVAGVRHW RALFNDQWCL SLSALMHEVG HTVGLMHSNE AGQIYGDQTG YMGFGRLAVN TPRQCFNGHK NDVLGWYKDR VVAVDPKEDG GGARLYKIAA FVDYDKAASN EPVILDVGGQ IFLQYNRAKG FNVGTEEKGN TLTITEYDGT DSSTNLGGLK VGEEFGEYNF QHSGHVLMIQ VCETIIGDSN SPDAMIVSVG LDQSLCRNAQ MASPQADIQG MPFPLTASPT VSPTVATNYP TMTPTLSPTS PPVLNPTRRP TTSPTHSSMF TETSHSTNSP TFRPSEDTAE AIRSIEGLPF TRTPTLAPSA LPVAAPGLPF SDDEQAEPNE AITAPPSQRR PLRNYAPRVE SFPSAAITQP PSQRRPLRNY APRVESLPLK KEKTTESRMH DRLRLSD
|
| |