Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48502 |
Symbol | |
ID | 7194695 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | - |
Start bp | 36004 |
End bp | 39622 |
Gene Length | 3619 bp |
Protein Length | 707 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183147 |
Protein GI | 219125772 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GATTTGGAAT TGCGTCTCCA TTCGTTTACA GTTAATATTA CACTCCAGTC TAGTAGCGCC CTACAGACGA GCCAACGGGA TGCCAACGGA AAGAAACCTT CTCGGTCGGC GAATCGCAGA ATCTCGACTC CATGAAGATG GGACAAACCA TGGTGATGGC AAAGTAGTGG TTTACTGTTA TTCTGAGCAT CCTGGAAGAA GACTTCACCG TAAGCGACGT AGCTAGACAC GTTTAACCAA TCAAGTAGAT TTAGTGTAAA CAAACGACTC TTGTTCCACC ATGGGCAAAG GAAAAGCCAA AGCGTCGAAA AGTGCCAACC GGAAAGGCAG CAAGAAACCA GCTTCCACAC CCTCCACGGC ACGTCACACG CACGCTAAAC CGGGGATTGG TAGTCTCTTT GGTCCCATAC GCGTGCGAAG CAAACGAAAA GGCGAGACTC TCGAGTTGAA CTTTCCCAGC AAAAAAACGC AACATCTACT GAAAGCTGTT CATCCAGGAA AGCTGTCGTG GAAGATAGGA AAAAAATCCA AGAATGAAGA CAGCTACAAG AAGAAGGCAG CTCGACCAAC ATTTGTACTG AATCTTCCCG AAAGTATTGG GCACCCCCAG TCTGTACCAT GGATGAAACT TAGCAGGACA AGGAACCGCA GCTCGCCAGG GAGTGAAGGT CTCTCCAAAT GCAGCCGAGG TCCGGGCCAC ATCATCCCAT CGGACGCTCT TCCCGAAGAA ACCAAGCTAA AGCTCGATAA GGAATTGGAA TCTTTTGCCA ACTACGTACG ACTCACAGAT GAAGAATGTC AAATTCGGGA CTATCTCGTG GAGCAAGTCG AGCTGATATG TCGAGACTTA TTCGAGGAAT CGCGACATAC GCTCTTCAAT CGGAACAGCG AAGCAATGCA AGAAGCGGTT CGGGTTCAGG TGTTTGGTTC GTTTGGAACC AAAGCCGTTT GCAGTTTTCG AAGTGACGTT GATTTGGCCA TATGGGGCGT GGTACCTGTC CAGAAAAGAC GGCCTGCAAT CATGTCCAAG AAAAACAGCC AAGTCACTCA TTCGGAAAAG GTGGACTGTG CCAAGCAAGA GCGGAAACGA AAATGGCAGG AAGCGCTGGC GGCTGTCGAC GAAGCTAATA CCATGTCTCC GGACCCGATG ACGAGGCAAA ACCACTCACT TTCGAACGAG GATGGACCAC CATTGAGTTT CGCGCCTGAC AATGAACCGG AGCGAGCGGG AGTGATCGAT CAAGAGGAGT CCCTGTTTGT CATTGACCGT TTCGGCGACG TCCGCAGTAT CGACAGACTA AACTTACCCA AAACCTCATC CATCAATTCC TCTAGCTCCG TGAGTTTATC GCAAACGATC GGTGACAACA GCGGAGTGAT CAAATCACAG GGTCATTTTT TGCAAAGTCA AAAAGCCGAG AGCCATTTCA AAGCCAATTC CGAAAACCCC ATCGACGACT TAAACGACAA AGCTGCCAAA CCTAAAAGGG CCAAGCCCAA ACCTGTTGAG TCTGATGATG AATCTTTTAA CGATAGTACT GATAAAATGG AAGCTTTCGT CCCAAACCGT GCTAAAAGTG AGGATCGGCA CTTTGTGAGC TATTCGTCTG ACGATGATTC CCACGACGGC TTCGGCGAGG AAGTGAGCGA AGGAGCAGGA AAGAGCGACC GTGACCTTGA GGTGTCGTTT TTCTCGCAAG ATAGCACATC AAAAAGACTG GGACCGACCG GCTTGGCTCG GCAGCATGTT ACTACTTGTT TGGGCCTCAT TCGGAGGAAG CTTCAAAAAC GGAAATTCGC GAGGTCGACT CTTTTCATCA AGACCGCGTA CGTTGTTTTT GGCCTCCCAA AAAGACATAT GTTACACTTC CATGGACAAC TCTTACCTCA AGTCTCCTGT CTCCGTATTA CAGAAGGGTA CCCATTGTAA AAATGACGAC CTCTCTTGGT TTTGAAAGCG ATATTGCTAT TGGTGGCCAC AATGGATCGG ATACCTCGCC TTACGCCGGT AGTCAAACTG AGAAATATCG AAGGTCGGTT TGGGTCGGAA TATGCGTGGG CATGAAATTT ATATCCTTCT AACTCTACCT TTTGTACTGT TGCAAAGCTT TGCGCCGGTT GTGCTGGCTT TAAAAGTTGT TTTGCAGCAA ACCAATCTCG ACGAACCATT CGCGGGAGGA CTGGGAAGCT ACAAGTTGTA TGTGCTAGTA GCATACCACA TCGAGCAGCA TCTTTTATTA GGTGGCAATG ACCGACCGAG TGAGATCTTC TTAGGATTTC TGTTTCGCTA CGGTGCAATT TTAGGTTACA ATTCACTAGA TGGTACGATG ACGCACTTGC AAAAGAACGT GCCGGTTGCA ACTTTTGATG CTTCCATAGC TGATTTAAGC AACGTTTTCC TCTTGGAGCA CTGCGTTGAT CTGTTCGGTC GGTGCTGGCG CCGTCTTTGG AAGCGGACAC GCTCGTCGTC GAAAAATATC GGATCCTTTC TCGCCGACAT TGTGGACGTC AAAGCCTTGG CGAAGGAACG ACAGTCGCAT ATACAGCGAG CAAAGGCAAC TCTTTGCCAT GAACTTGCAA AGAACAGTAA CTCTTTCCAT AAAGCACCGG TACGGAACTT CGTCGCACAG ACATCCAGCA CAAGAGCACA TAATTTGTCC AGTTCAGCTA ATGATGTTCG CCATCCCGCG GAGCTCACTA AAGCTGCGTC TTTGCCACGA GAAGCAACAG CTGCTGAACT ATTGAAAGGT TACAATGTAC AGATTGATCA AGAGCTTCCG ACACGCCGCG AGTGAGTTTT ATTTTTGGAG CTTATGACGT CTATTTTGCC ACTAGCTGCT TTCGAAGAAG CCCCACATAG ATTTGTTGTG AACGCTCCTT CGGGCTTTGC TGTCTTCCGG CTTTTGAATC CGCGCAAAAT GTTGCCCCAC CCTGTAGTAT CTCTTCTTTC CGTCAAGGTC AACGAAGAAA TGGTAGCGTC CTTGGTCGAC TTTGCGGCTT TGCTTTTGAA GCGCATCTTT TCCAATTTTG TCACCTCGTC CCTTCGCCCG AGTTTGGTCT CCGTACCGCA AAACACGATT CCAAAGTTCA CCCGTCCAGG TCGGCCACAT AATCCCAACG AGGACGGCGC CACTGCTTAC TAGCAAAGGG GATATGATGA AAAGAGCTCC GATGAGGTTG GTCCCAAGTA AAGCCACAAA GACAGCTGTT ACCCTGTTGG TATAGTCGGC AACTCTCGAT TCCGATAAGT CCATCCGAAG CCCCTCCAGG AACTCATGCA GTTCCTTGTT GTCCCTCCCA GTAACCGAGT TCAAAATTGA TCGAAGAGAT CTACCAATCC AATAAGCCGC GTCATACACA ATTTGTACAA ACCTGTAGCG GGATTGGCGC CGTCTGCTGC GTAGAGTCTT TTTCAGGCTT GATTCCTCTA CGAGCCAGAA TGTACGAAGG GCTGCTAGGG CCTTGCGCCC GATCTCATTT TCCTTTTCCC AACGATCGAA GGCGATCTTT CCTTCTACAA ATCGAGCGTT CCATGCATCA ACTTTCGCTT GGATGGCGAA GCGCTGGTCT AGTGATTCGT ACTGTTTGTA GTAATCGTAG GAGAGTTGCC CGGTCTTGT
|
Protein sequence | MGKGKAKASK SANRKGSKKP ASTPSTARHT HAKPGIGSLF GPIRVRSKRK GETLELNFPS KKTQHLLKAV HPGKLSWKIG KKSKNEDSYK KKAARPTFVL NLPESIGHPQ SVPWMKLSRT RNRSSPGSEG LSKCSRGPGH IIPSDALPEE TKLKLDKELE SFANYVRLTD EECQIRDYLV EQVELICRDL FEESRHTLFN RNSEAMQEAV RVQVFGSFGT KAVCSFRSDV DLAIWGVVPV QKRRPAIMSK KNSQVTHSEK VDCAKQERKR KWQEALAAVD EANTMSPDPM TRQNHSLSNE DGPPLSFAPD NEPERAGVID QEESLFVIDR FGDVRSIDRL NLPKTSSINS SSSVSLSQTI GDNSGVIKSQ GHFLQSQKAE SHFKANSENP IDDLNDKAAK PKRAKPKPVE SDDESFNDST DKMEAFVPNR AKSEDRHFVS YSSDDDSHDG FGEEVSEGAG KSDRDLEVSF FSQDSTSKRL GPTGLARQHV TTCLGLIRRK LQKRKFARST LFIKTADIAI GGHNGSDTSP YAGSQTEKYR SFAPVVLALK VVLQQTNLDE PFAGGLGSYK LYVLVAYHIE QHLLLGGNDR PSEIFLGFLF RYGAILGYNS LDGTMTHLQK NVPVATFDAS IADLSNVFLL EHCVDLFGRC WRRLWKRTRS SSKNIGSFLA DIVDVKALAK ERQSHIQRAK ATLCHELAKN NIQHKST
|
| |