Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_13951 |
Symbol | |
ID | 7202332 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 236291 |
End bp | 237707 |
Gene Length | 1417 bp |
Protein Length | 450 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181513 |
Protein GI | 219122358 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCACAC GCGGTGATCT AGACTTTCTC GATGATTTGC GCCGAAAGTA TCCGCACCAA CCAACGTTTT TGCAGGCAGT TGAAGAAATG GCCCTCGCCC TCAGCGATCT TTTTGATGGA CCGGACGGAG ATTTTTACCA AAGGGCCTTT CTGGCCATGG CGGAGCCCGA GCGTATCATT GCCTTTCGCG TTTCTTGGAT GGATGATAAC GGAAAGATTC GTTTTAACCG TGGATGGCGG GTCCAGTTTA GCAGGTATGT GACAACAAGA TAGTGGTTTT ACATGCTTCG TGAACATCTC TCATTTCTAT TCTTGTAGCG TTCTCGGTCC CTTCAAGGGC GGTCTTCGCT TCCACCCAAC TGTCGATGAA GGAGTCTTGA AGTTTCTTGG CTTCGAGCAG ATCTTCAAAA ACGCTTTGAC CGGATTGCCG TTGGGTGGAG GTAAGGGCGG GTCCGACTTT GACCCCAAGG GCAAATCGGA CGGGGAAGTT CGTCGTTTTT GCGAAGCCTT CATGTCCGAG CTTTGCCGTT ACTTGCATCC ATCCACAGAT GTTCCTGCTG GTGATATTGG AGTTGGTGGC CGCGAAATTG GCTACATGTA TGGACAGTAT AAGCGTATAA CAAATCGCCA CGGTGTCGGT GTCCTAACGG GTAAGTCCAT GAACTTTGGT GGCAGCGAAA TTCGCCCCGA GGCGACCGGA TATGGTCTTA TTTACATGAC AAAGATTGCC GTCCAAAGGA AACTGAACCG AAACTTGACG GACATGCGCT GTGCAATTTC CGGTTCGGGA AATGTTGCAC AGTTTGCGGC GAAGAAGTTG CTCGAGTTTG GTGCCAAGGT CATGACTGTG AGTGACTCCA ACGGTGTGAT CGTTTTCGAG AGCGGAATGA CGGCCACAGA CTGGGATGCT GTTTCCGACT GCAAAAATAA GCACCGCGGA CGTCTTTCGT CCATCCAGGA CAAAGTCAGC GGGCAGTATT ACGACGGCGA AAGTCCTTGG AGTTTAGACA TCAAGTACGA CTTGGCCTTG CCTTGTGCTA CACAAAATGA AATTGACGAG AAGTCGGCCA GGCAACTTGT AAAAAACGGA GTGTTGGGAG TGTTGGAGGG CGCAAACCTA CCGACCGACT TGGAGGCGCA GGCCGTGTTC CGTAAGGCCG ACGGTGTTAT TTATGTCCCG GGAAAGGCAT CCAATGCTGG AGGTGTTGGT GTCAGTGGTT TGGAGATGAG TCAGAACGCA CAGCGGCTCA CGTGGAAATC AGAAAAGGTG GACGAAAAAC TGCATGGCAT GATGGACGAG ATTTACAGTA TGATGGAAGA AGCCGAACTA AGTGGAGGAA CTTTGGAACA GGGAGCAAAC CGTGCTGGAT TTCTGAAGGT TGCCACGGCT ATGAGAGAGC TAGGTTGGGT GTATTGA
|
Protein sequence | MSTRGDLDFL DDLRRKYPHQ PTFLQAVEEM ALALSDLFDG PDGDFYQRAF LAMAEPERII AFRVSWMDDN GKIRFNRGWR VQFSSVLGPF KGGLRFHPTV DEGVLKFLGF EQIFKNALTG LPLGGGKGGS DFDPKGKSDG EVRRFCEAFM SELCRYLHPS TDVPAGDIGV GGREIGYMYG QYKRITNRHG VGVLTGKSMN FGGSEIRPEA TGYGLIYMTK IAVQRKLNRN LTDMRCAISG SGNVAQFAAK KLLEFGAKVM TVSDSNGVIV FESGMTATDW DAVSDCKNKH RGRLSSIQDK VSGQYYDGES PWSLDIKYDL ALPCATQNEI DEKSARQLVK NGVLGVLEGA NLPTDLEAQA VFRKADGVIY VPGKASNAGG VGVSGLEMSQ NAQRLTWKSE KVDEKLHGMM DEIYSMMEEA ELSGGTLEQG ANRAGFLKVA TAMRELGWVY
|
| |