Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41109 |
Symbol | |
ID | 7198898 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | - |
Start bp | 376484 |
End bp | 377512 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | homeobox protein |
Protein accession | XP_002185097 |
Protein GI | 219129860 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.117683 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGAAT CGTTCTCCCG ACCGCACGAG GAAGGTTCTG GAGGCTTGCC ACCAACTTTG CTTCCCACCA AAGTGTATGG ATCACAGCCT CAATACAATC GTCCTTTTCC ATATCCGGAG GCTCCGGTTC GCTGGCAGGT GGGTTCGATC TTGGAGTGGT TCGAGTCGCT TGCTCAAGCC GCCGAGCAAG GCGAGTTCCA AAACAACTTT CACCTCGGGA ATCAGCAGCA CCGGCAACCC ATTGGAGGTT CCGAGTTACC ATTGGCAGAT CCAGTACCGC TGCCACCGTG GTATACTCAG CCAGTCTCTA ATGAGGAGGC TGTTATCGAT GAGACAAGCT GCAATGCTGC AACTCCGAAA AACTGGGATT TGTCCTTTGT GGAACATCAA GATCCGTGGG GCTGGTCTGA TCGTCAACGC ATGGCAAAGG TTGCTTGCCA AGTGGAAGCT AGTTCAAGTA TTAACTGTGA AAAGTATCAT GGTATCAGAA CTCGTTTCGA ATTTTTGTTT GGGCGCTGCG TTTTCCAACA CTCCACTCCG GTAGAGCATG GACAACGCCG GATGATTCTT CGAGTTCTTT ACGAAGAAAC CATTGCCAAG CTTCGGGAGC TCCTCGGTAA TCACCAGCCC CCACTGCACC AGATTCCGGA TAAAATACTG TCCGAGTCGT CCCCTATCGC TACAAAACAG GACTTGTCCA AGTACATGAC CGCATGGTTA CGGGAGAACT GGACAAACCC ATACCCGGAT GACCAAGGCT TGGCTACCAT GGCTCAAGCA TGCCACACGA CCCCCACGGT CGTCAACAAC TGGCTAATCA ATGCTCGCAC GCGGAAATGG CGACGAGCCA TAACTAAAGC GACAAACATG CATCGTCCGG CAAAACTTTT GTTGGAAGAT TCCTTGCGCA TTTTTGACGG CGAAGAAGTC CGAGAGTTGG GAAATTTGGC AATGGAGATA TCGTCATCGT GCTCGGAAGC CAACGACATG GAGGGCCCAA TGCACAAGCG GCACAAAAGT GGTATGTAA
|
Protein sequence | MEESFSRPHE EGSGGLPPTL LPTKVYGSQP QYNRPFPYPE APVRWQVGSI LEWFESLAQA AEQGEFQNNF HLGNQQHRQP IGGSELPLAD PVPLPPWYTQ PVSNEEAVID ETSCNAATPK NWDLSFVEHQ DPWGWSDRQR MAKVACQVEA SSSINCEKYH GIRTRFEFLF GRCVFQHSTP VEHGQRRMIL RVLYEETIAK LRELLGNHQP PLHQIPDKIL SESSPIATKQ DLSKYMTAWL RENWTNPYPD DQGLATMAQA CHTTPTVVNN WLINARTRKW RRAITKATNM HRPAKLLLED SLRIFDGEEV RELGNLAMEI SSSCSEANDM EGPMHKRHKS GM
|
| |