Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48969 |
Symbol | |
ID | 7195247 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | + |
Start bp | 190622 |
End bp | 192112 |
Gene Length | 1491 bp |
Protein Length | 470 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183567 |
Protein GI | 219126655 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.731324 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGCAAGCATG AGAAAGCTAA AGAGGCGCAA TGCTCCGTCC AATGATGAAG ATGATCGATT GCCCTTGTAC GATTCGTCCA CTGCCACAAG CTCCAACGGA CCGTCGTCTC GAGCCAAGCA GCGCAAACGA AGATCGACCG GACCTTGTTG GCCATGGGCC AGACTAGTAA TGGTATCCGG TATTGTTCTG ACAGTCTACT GTGGTTGGAT CTGGTGGAAA GCACCCGATC ATAAACCTCC CATTCCCCCC ATTCTACATC GTGCGTTTCC CTACAAACGT GATTGGTACT TGCCACGGAT ACGAGACGAT GTCAAACTGG AAGAGTGGGA TGGCCCACAG CTAATACACG TGGTTCACAC ACGGTTCATG CAAGAACAGC CCAGCTTGAC GTCTTTGGGA CGTGCACGAC TGGGGCTCTT TCGCGTCTTT TGTCTACCTA CGATGATTGA GCAAACCACC AACCATTTTT TGTGGATTAT TAAGACGGAT CCCGACCTCG ATGCCGAAAT TATGCAAGTG CTGGTGGATT TGGTCTCTCC TTATCCCAAC TTTTTCCTCG TAGCATCCAA CGTCAATTTT CGTATCAACG AAGATTTTCC TGGCGCTTGG CGGGATGGTG CCCAGGCCAG GGACTTGGCC CTGTCTCGAA CTTATACGGG CAATCAAACG CTTCTCGAAG TCGCCATGGC GTTGGAAGCC CAGCTACCCA TACTCGAAAC CCGGCTCGAT GCCGATGACG GATTACACGT TGAATTTCTG GAACAAATGC AGTACCAGGC GACCAAGGCC TTTCGACAAA GTGCACTTAA ATGGATGTAC TGGTGTACAC GACGGCATAT GGAATGGCAT TGGATAGACG AAGTACCGTC CTCCTTCGAA CACGACTCGC CACTTGGCCA AAAGATTGTG GAATACGGGG CTTTGCAAGG TGTGCAACAT TCCAACCTCT GTATTACAGC CGGATATACA GTGGGCTTTC CAGTCGGGGT GTCTGAACCA GACGTACCCG TGTATCCTCA TCAAGATTTG GTGTCCATGA TTCGAAAACT ACCATCGGAA AAGGCTTGTG GATTGAAACC GAGTGAAAAA TGTTTGCAGT TTGTTGAAGA ACACATTTTT GAAGCAGTTC GATCCCGAAC GCACACCTCG GCCGGGATGC TGAAGGTGCG ATTAGAGCAA GACGGCCTGG TGAATACTCC TTGGTTGTCC TACGCGTACT GGGATCTGTT ATGCAAGAGT TTTGAAATTC AGCGAATGCA AGTGCGGTGG ATGAACGAAT ATCTGACATC CCACATTATC GACATTGCCC GAGACAATCT TCTGGGACAG TGCACTCTGG GTCACAGTTG CAAGGTATGT GAGAGAATTA ATCCTTAAAC CTTGTATTGG TAAATTTGAC AAACCTGACT TCTTTCTCTT CTCAGGACTC GGCCAAGGAA GAGCTGGCAA AGGTGATTGA AAAGTACAGA AACCGAACAA CTTCGGGCTA G
|
Protein sequence | MRKLKRRNAP SNDEDDRLPL YDSSTATSSN GPSSRAKQRK RRSTGPCWPW ARLVMVSGIV LTVYCGWIWW KAPDHKPPIP PILHRAFPYK RDWYLPRIRD DVKLEEWDGP QLIHVVHTRF MQEQPSLTSL GRARLGLFRV FCLPTMIEQT TNHFLWIIKT DPDLDAEIMQ VLVDLVSPYP NFFLVASNVN FRINEDFPGA WRDGAQARDL ALSRTYTGNQ TLLEVAMALE AQLPILETRL DADDGLHVEF LEQMQYQATK AFRQSALKWM YWCTRRHMEW HWIDEVPSSF EHDSPLGQKI VEYGALQGVQ HSNLCITAGY TVGFPVGVSE PDVPVYPHQD LVSMIRKLPS EKACGLKPSE KCLQFVEEHI FEAVRSRTHT SAGMLKVRLE QDGLVNTPWL SYAYWDLLCK SFEIQRMQVR WMNEYLTSHI IDIARDNLLG QCTLGHSCKD SAKEELAKVI EKYRNRTTSG
|
| |