Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_12044 |
Symbol | |
ID | 7200677 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 140023 |
End bp | 141123 |
Gene Length | 1101 bp |
Protein Length | 266 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179588 |
Protein GI | 219117591 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACGAACGTT CGGCACCGAA GATCTCCCGC AAAATGGTAC GCATGGGTGG CATTTACATC AAACTGGGTC AAATTCTATC CACAGTTGGT TCCGGATTTT TAAACGAAGC CTACGTCTCG GCTCTACGCG TGCTTCAAGA TGGAGCCCCC GCTCGACCCT ATGCTGATAT TGTACATATT GTGGAATCAT CGACCGGTCG AACGATGGAT GAAATGTTTG TAGACTTTGA ACCGACCCCG ATTGGGGCCG CGAGTATCGG TCAAGCACAC CGCGCCACTT TGCGCCTACC CAATGGTGTC GACGCGGGGA CAATATCTGA GCCCGTCATT GTCAAAGTAC AGTATCCCGA AGTAGCCCGC TCGTTTCAAA TCGATTTCGA CAATCTAGCG GTTGTGACAC GCTGGTTCGA ACCCGAGCAA GTAGATTTGG TGGAATCATT ACGCGCACGA CATAATCAAG AGCTAGATTT TCATCACGAA GCTGATAATT TACGAACAGT CCGTCGGAAC CTGCAACGCT CTGGCGTTGA GCCGACACTC GTCCGAATTC CGATGGTCCG TAACGAAACA GGCATCTGCA ATAATAACGT TCTCGTCATG GAATACCTGG AAGGTACCAG TTTGGCTTCC GTCATTCAGC ACGAGCAAGA TCGGTTTGCA CAAGCTCTAG GTAAGAGTGA CGGGAAAGAG CTTCAGCAAA TACTACTGGC TCGTATGAAG GAACATTTCG AAAAGGGGGG CGGTACCGGT GAGGGCAGTC TGCTGGCCAT GGACTTTCCT TTGCTGCAGA AGTTGGGACC TGGCGTCACT CGCTTGATTC GCTGGTACGG AAACGTCAGA GAAACTGCTG GAGATTTCGC CTTTACACTA CGAACGGCTT CAGGAAAGAT GCGGAATGCT TTCGGAGGAA CCGACAATTT CGTCGCGCTC AAGAATCCCA AGCGCACTAC GAAAGTCAAT CTTGGCCGAG TTTTGAAAAC GCTCGTTCAT GTACACGGTC TTCAACTGAT GCAGGATGGC GTGTTTAATG CTGACCCGCA TCCCGGTAAC GTACTTGTAT TGCCAGATGG TCGACTAGGT CTATTGGATT ATGGCATGGT C
|
Protein sequence | HERSAPKISR KMVRMGGIYI KLGQILSTVG SGFLNEAYVS ALRVLQDGAP ARPYADIVHI VESSTGRTMD EMFVDFEPTP IGAASIGQAH RATLRLPNGV DAGTISEPVI VKVQYPEVAR SFQIDFDNLA VVTRWFEPEQ VDLVESLRAR HNQELDFHHE ADNLRTVRRN LQRSGVEPTL VRIPMVRNET GICNNNVLVM EYLEGTSLAS NPKRTTKVNL GRVLKTLVHV HGLQLMQDGV FNADPHPGNV LVLPDGRLGL LDYGMV
|
| |