Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40473 |
Symbol | |
ID | 7198180 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | - |
Start bp | 510742 |
End bp | 512557 |
Gene Length | 1816 bp |
Protein Length | 565 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184472 |
Protein GI | 219128546 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACATG GAGACATTCG AAAAGTTCTC GCTTCGGCTT CCTTCTGTAA GCAGAATCCC ACGAGCTCAC TACAGTCCAA CATGCTCGAG TACAGTATTT CCCGGCACGC CGTTACTGGG ACAACATCCT CCCTCATTGA CAGAGGTGCA AACGGTGGAC TCGCTGGGAA TGATGTTAAA ATCCTGAACA AGACAGGTCG TTTTGCTAGC ATCACTGGTA TCAATGACCA TACCCTGCCT GATTTAGATA TCGTCACCGC TGCTGGACTT GTTGAATCCC AGAACGGACC TATCATTGTC ATACTACACC AGTATGCACA CCATGGGAAA GGTAAAACGA TTCATTCTAG TGCGCAACTT GGATACTACA AGAACGTTGT CGAAGACCGT TCTCGGGTCC TAGGGGGTAA ACAGCGTATC GTAACTCTAG ACAACTACGT TATTCCTCTT CACATTCGCC AAGGACTGGC TTATATGGAC ATGCGCCCAC CTTTGGATAC CGAATTTGAC ACACTTCCGC ATGTTGTTCT TACTTCCGAT GTGGACTGGG ATCCATCTAT CATTGACAAT GAAATTGATC TTGTCACGGA CTGGCATGAT GCCGTCCAGG ACCTTCCCGG CGATCTGTAC GTTGAACCTC GCTTCAATTC AACCGGGGAA TACCGACATA GGCACGTTGC CAATTATGAC ACGAATTGGT CGATCCATCC ACGGCTATTG GCAATATACT CTCGTCAAAC AAGCATGATA TGAGCCGCAA TGCCCACAAT TACGAAGCTT TGCGCCCTTG TCTTGGTTGG ATCTCTTCCG ACACAGTTCG GAAGACCATC TTGGCCACCA CACAGTTTGC TCGCGAAGTT TATCATGCAC CTATGCGTAA GCACTTCAAG TCTCATTTTC CGGCACTTAA TGTTCATCGG CGCAATGAAG CTGTCGCTAT CGATACCATT TGGTCGGACA CGCCTGCTGT TGACAATGGC GCTAAATTTG CACAACTATT TGTTGGTAGA CGGTCGCTTG TCACCGACAT TTATCCTATG AAAACAGACA AAGAGTTTGT CAATGCTCTT GAAGACAATA TTCGTCATTG TGGCGCCATG GATAAGCTCA TTAGTGATCG TGCCAAGGCC GAAGTCAGCA AGAAGGTTTC TGATATTACC CGTGCTTACC ACATTGATCA ATGGCAAAGC GAGCCCAATC ACCAGCACCA AAATTATGCT GAACGCCGCA TTGCAACTGT CGAAGCAAAT GCGAATAATA TCCTAAACAA AACCGGTGCA CCCAATTCTA CATGGTTATT GTGTGTTTCC TACATTTGTT ATTTGTTCAA TCATTTGGCA CATGAGTCTT TACACGATCG TACTCCCCTT GAAGTCCTCA ACGGTAGTAC CCCTGATATT AGCGTACTCC TTCAATTCCA TTTCTGGGAA CCGATCTACT ACCGACTTGA AGACCCTACT TTTCCTTCCG ACGGAACTGA AAAAAGGGGC CACTTTGTTG GAATTGCTGA TTCCGTTGGT GATGCTCTTA CCTACAAGGT ACTCACCAAC GACTCCCACA AGATCCTTCT CCGATCTAGT GTTCGCTCTG CGTTGAAACC TAGTGAAACC AATTTGCGTC TTGAGCCACA TGAAGGGGAG AGTCCTCCTA AGCCCATCAA CTTCACTAAG TCGCGCAGAA CTGAGGACGG AAATTCTTAT GCCATCCACA CGCTACCTGG TTTCACCCCG GACGATCTCA TCGGACGCAC CTTTTTAACC GATACCCAGG ACAATGGGGA GCGTTTTCGT GCACGTATTG CCAGGAAAAT TCTTGA
|
Protein sequence | MAHGDIRKVL ASASFCKQNP TSSLQSNMLE YSISRHAVTG TTSSLIDRGA NGGLAGNDVK ILNKTGRFAS ITGINDHTLP DLDIVTAAGL VESQNGPIIV ILHQYAHHGK GKTIHSSAQL GYYKNVVEDR SRVLGGKQRI VTLDNYVIPL HIRQGLAYMD MRPPLDTEFD TLPHVVLTSD VDWDPSIIDN EIDLVTDWHD AVQDLPGDLY VEPRFNSTGE YRHRHHDMSR NAHNYEALRP CLGWISSDTV RKTILATTQF AREVYHAPMR KHFKSHFPAL NVHRRNEAVA IDTIWSDTPA VDNGAKFAQL FVGRRSLVTD IYPMKTDKEF VNALEDNIRH CGAMDKLISD RAKAEVSKKV SDITRAYHID QWQSEPNHQH QNYAERRIAT VEANANNILN KTGAPNSTWL LCVSYICYLF NHLAHESLHD RTPLEVLNGS TPDISVLLQF HFWEPIYYRL EDPTFPSDGT EKRGHFVGIA DSVGDALTYK VLTNDSHKIL LRSSVRSALK PSETNLRLEP HEGESPPKPI NFTKSRRTED GNSYAIHTLP GQWGAFSCTY CQENS
|
| |