Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44555 |
Symbol | |
ID | 7197800 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 892640 |
End bp | 894976 |
Gene Length | 2337 bp |
Protein Length | 743 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178322 |
Protein GI | 219115053 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.966071 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AATGTTTCAC TGGACATCCC ACTGATCTCA TTTTCTGCCT GAGGAAGTGT CCGATAGAAG GGCCCATTCA CCATGGCGGC ACAGCCAGCC GTAAGCCAAG GATCTACTCT TTTTGACTCA GACAACGAGG ATGGACGGGT TTTCAAGGTA AACACGAAGT ACGCCAAAGA GTACGAGTCC CGGAAACAAA AAGAGGAGTT GAGGCGAGTG CAGTTTCAAG GTGATGATGA CGATGGCGAG AGTACTGACT CTTCTGAAGA CGAAGACGCA GAACTCCTTA CTGCCAAGCT GGACACGAAT ATAATAAAGA CGATTAATAT ACTGCGAAGC AAAGACTCTC GTATTTACGA CCCTTCCGTA AATTTCTTCG ACAACGCAGA AGACAGTGAA GAGGACTCAC TGCCTACGAA AGAGTCAAAG TCGAAACCGA AGCGCTACAA GGATGTTGTC AGAGAACAGA TATTGAAACA AATGGACGAA GATGTGCCAA TTGGAGAAGA AAACGATAAT GATTACGGGG TTCTGGCATC CGACGAAGCG AAATCTCGAC TCGCCTACGA CAATCGCCAG CAAGAACTAC GAAAAGCCTT CGTCGACTCC ACGACCGGCA AAAAGGGCTA TGGCATCGAT GACGACGGAG AAGATAGCGA CGATGATTCG GATACCTTCT TAGTTGTTAA GAAAGTTAGC AAAGGAATTA CGGAAGACGA AGATACCGCA GAAGCCCGTC AGGAATTTTT ACAAGAGATG GAAAAGCTTG AAAAGACGGC GCGCGACGAC AACCGCGATT TCGTCGACCC AAAAGGCGAA GTAAAAGACG GCGAACGCTT TTTGCTTGAT TTTTTCAAGC GACGTAATTG GCTAGAGAGA GATAATGGAG ATAGCGGGCC GGACGGAAAT AGCATAAAGG GCATTCAACC CCCCAGACCG ATTGCGGGCG ATGGAAACGA ATCAGAGAAC TCACTGGAAC AGCTGCACAA GACGGACGAC TTTGAAGCGC AGTATAACTT CAGATTTGAA GAAGCTGCAG CGAAATCACA GTCAGGTGCG GATTTCTCAA TCATCGGTTA CGCCCGCGGG CAAACAATGA ATACGCTCCG TCGTAAAGAC GAAAGTCGTA GAGATAAAAG ATTGAGTCGT AAAGATCGCA AAGTAGCTGA TCGAACAGCC AAAGAAGAGC AGCTGAAGCG CCTGAAAAAT GCCAAGAGAC AAGAAATGGA AGGAAAATTG AAGCAAGTTA AGTCAGTTCT TGGTGAGGTT GAAAATCGTG GCGAAGCAGT GGACGAGGCT GCAATTCTGA AATTGCTCGA GGGTGATTTT GATCCTGAAG AATTTGAGGT ACTGATGGAG AAAACGTACG GCGAAGATTT TTACGGAAAA GAAGATTCGG AATGGCAGAA TGATAAGGAC GTGCGGGAGT CCTTAAAGCA CGATGAGGAC GGCGACCTTC TCGTTGGCGA GGGTGACTCC GACGGTGGCT TATACGATAA CGTCGAAGAG GATACCGAAA GCTACAAAGA TGGTCACGAA ACGCCGGCCG ATGAAAACGA CGAAGAAGGA TGGCCAGAGG AAGAAGAAGT TAGAGAAGAA ATAGAAGAGA CAGAGCTGGA AAGAACTGTG AAATTAAAAG TGGAAAACGA GCTATACAAA CTGGACTACG AAGACATTGT TGCAGATATT CCCACTCGAT TTAAGTACCG TCAGGTTGAA GCCAATAATT TTGGGCTCTC TACGGAAGAG ATTCTGCTAG CCCGGGATAC CACGTTGAAG CAGTTTGTTT CTCTGAAAAA ACTGGCGCCT TATAACGAAG CCGGTGAGCA TTTTGTGGGC AGTAGGAAAC GGAGACGATT CCGCGATTTG CTCAAGCAAG AGTTGGAAGA GACCGTAAAA AGCTCCAAAG CTGTGGCGGA AGAAGGCGCC GACGAACCGG CAATGGAGGA TCGAACGCAA ACGAAAAAAC GTCGCCGCCT GAAGAAAGGA AAGAAGGCTG AAAACGTACC TGGAGATGCT ACCGGTACGG CCAATTCTGA CATTTTGGAA AGATCGGAGG AGACCGACGA GGGGCCGAAA ACGAAGCGGC GGCGAAGAAA GAAGCTGAAA AAAGAAGAAC TAACCGATGG CAATCTCGAG AAGACGGAGA AAAAGACGAA CAAGCATAGT ATCAAGGCAA GGCTTGAGTC CGAAGCTCAA GAAGAAAACA AGGTTGATAA ACGAAATCAC CACACCAAGA AGCCAAGGCA CAAGAAAAAA AAGTCCGGGA TTGAAGGAGT ATCGCATTCT CGACTTGAAT CGTATGGGCT ATAGATTTGT AAATCTAGAC GACCTGTCTT TACAGTT
|
Protein sequence | MAAQPAVSQG STLFDSDNED GRVFKVNTKY AKEYESRKQK EELRRVQFQG DDDDGESTDS SEDEDAELLT AKLDTNIIKT INILRSKDSR IYDPSVNFFD NAEDSEEDSL PTKESKSKPK RYKDVVREQI LKQMDEDVPI GEENDNDYGV LASDEAKSRL AYDNRQQELR KAFVDSTTGK KGYGIDDDGE DSDDDSDTFL VVKKVSKGIT EDEDTAEARQ EFLQEMEKLE KTARDDNRDF VDPKGEVKDG ERFLLDFFKR RNWLERDNGD SGPDGNSIKG IQPPRPIAGD GNESENSLEQ LHKTDDFEAQ YNFRFEEAAA KSQSGADFSI IGYARGQTMN TLRRKDESRR DKRLSRKDRK VADRTAKEEQ LKRLKNAKRQ EMEGKLKQVK SVLGEVENRG EAVDEAAILK LLEGDFDPEE FEVLMEKTYG EDFYGKEDSE WQNDKDVRES LKHDEDGDLL VGEGDSDGGL YDNVEEDTES YKDGHETPAD ENDEEGWPEE EEVREEIEET ELERTVKLKV ENELYKLDYE DIVADIPTRF KYRQVEANNF GLSTEEILLA RDTTLKQFVS LKKLAPYNEA GEHFVGSRKR RRFRDLLKQE LEETVKSSKA VAEEGADEPA MEDRTQTKKR RRLKKGKKAE NVPGDATGTA NSDILERSEE TDEGPKTKRR RRKKLKKEEL TDGNLEKTEK KTNKHSIKAR LESEAQEENK VDKRNHHTKK PRHKKKKSGI EGVSHSRLES YGL
|
| |