Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_20547 |
Symbol | |
ID | 7201155 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | - |
Start bp | 490127 |
End bp | 492407 |
Gene Length | 2281 bp |
Protein Length | 555 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | homeobox protein |
Protein accession | XP_002180649 |
Protein GI | 219119793 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGTGCTGCAG CCCATCACGA AAGATTCGGC GTCTACGACA AAACTCCAAT AGGTCTCGGT TTATCCATTC CATCGCGAAC AAGGTTCATC ATGAAGTTCG TTCTGGGATA CTTTGGATTC ACGCTGGGCT TTGCCCAGGC CTTCAGCTTA CAAAGTGTGG GAAGATCGTC GTCCAAGACG ACGACGCACC TCCGAATGGA TTCGGAAGGC CATGATGAAA AGCCCGTCTT GAATAAGTAC AGTCGGTGAG TGTATGCCAT CGCGACATGG GACAATACTC CGTCGGATTG TTCGTAGAAG AGGGTAAAGT GAGCGTTGCC GAAGAACAAA GGTGCACTAA ACAACACAGA GTAGCTAGCT TACACACGCA CGTGTTCCTA TATTTTCAAT TTGCAGAACC TTGACGCAAA CCAAGGTCCA GGGTGCGTCG CAGGCGATGC TGTACGCGAC GGGTATTACG GAGGAAGATC TCGACAAACC TCAGGTTGGC ATCTGCTCCG TCTGGTACGA AGGCAATCCT TGCAACATGC ACCTTTTAGA TCTTTCCGAA AAGGTCAAAA AAGGAGTCGA AGACGCCTCC TGCGTAGGCT ACCGCTTCAA CACGGTGGGT GTTTCCGACG GTATCAGTAT GGGCACCTCC GGTATGCGGT ATTCGTTACA GTCGCGCGAT TTGATTGCGG ATTCCATGGA AACGACTATG GGAGGACAAT GGTACGATGG ATTGATTGCC TTGCCGGGAT GTGACAAAAA CATGCCGGGC TGTATCATGG CCATGGGACG CTTGAACCGA CCGGGTATCA TGGTCTATGG AGGAACTATT CGGGCTGGAA AGCAGCCGTC TACTGGAAAC AGTCTCGACA TTGTCAGCGC CTTTCAGTCT TACGGAGAGT ATGTTTACGA TAAGATTACG GAGGAGGAGC GCAAGGAAAT TCTGCAGCAC GCATGTCCCG GACAAGGAGC CTGTGGAGGA ATGTATACCG CAAATACAAT GGCCACCGCG ATTGAAGCCC TCGGTACGTG CTCGGATTTT TTTCTTCCCT TGGATCATGT AGTGATTCTC ACGTGACTTG ATCTACAGGC ATGTCCCTTC CGTATTCATC GTCGTCGCCG GCGGATTCCA AGGAAAAGGC TGATGAGTGC TATCGTTCGG GGGAAGCCAT GTATCGCCTT TTGGAACTTG ATCTTAAGCC TCGTGATATC ATGACCAAGG CGGCTTTCGA GAATGCCATG CGTATGGTCA TGGTCACGGG TGGATCCACC AACGCTGTCT TACACTTGAT TGCTATGAGT CGCTCGGTTC AGAATCCAGA AGTAGCAATC ACGTTGGAAG ACTTCCAACG AATCTCCAAT CAAACACCAT TCTTGGCTGA CTTAAAACCT TCCGGCAAAT ACGTCATGGA AGATGTCCAG AATATTGGCG GAACTCCTGG ATTGATCAAG TTCATGATTG ACAATGGTTT GTTTGATGGA AGCCAAATGA CCGTTAGCGG GAAAACACAC GCCGAAAACT TGAAGGATCA TCCCGGACTC ACACCCGGAC AGGACATCAT CCGTCCTCTT TCTGACCCCG TGAAAAAGAC TGGTCACTTG ATGATGATGT ATGGAAATCT CTGTCCCGGA GGTGGTGTCG CCAAGATTAC CGGTAAGGAA GGAGAAACGT TCACTGGAAC TGCGCGTGTG TACGACAATG AGCAATTGAT GATGCGTGGT TTGGAAAACA AGGAAATTAA GGCAGGCGAC GTGGTCATCA TTAGATATGA AGGGCCAAAG GGTGGCCCGG GCTTACCAGA GATGCTGACA CCCACAAGTG CGATCATGGG CGCTGGGCTC GGAGACAAAG TGGCGCTTTT GACCGATGGT CGGTTCAGTG GTGGAAGTCA CGGCTTCTGT ATCGGACACA TCACTCCCGA AGCGCAGGTT GGTGGACCCA TTGCCCTCGT TAAGAATGGT GACCCCATCC GCATTGATGC TCGTGCTGAA CAACGAACCA TTGATCTGTT GATTTCGGAC GAAGAATGGG AGAAGCGAAG AACAGAATGG ACGCCGCCAC CTCTCCGAGC GACGCAGGGA ACCCTCTTTA AGTACATCCA GTGCGTTGCG ACTGCCAGTG AAGGATGTGT GACTGACGAA GTTGGAACCT CGACAGCTGC TGAGATTGTG ATTGCCGCTC CCAAAACTCC CGCGGTTGCG GAATTGGAAG CAAAGATTGC AGCGCTGGAG GCCAGGATCG GCCAGGTTAC CAACTGAGCT AATTATCCTG AAATCTCATT TGGTATTGTT C
|
Protein sequence | LTQTKVQGAS QAMLYATGIT EEDLDKPQVG ICSVWYEGNP CNMHLLDLSE KVKKGVEDAS CVGYRFNTVG VSDGISMGTS GMRYSLQSRD LIADSMETTM GGQWYDGLIA LPGCDKNMPG CIMAMGRLNR PGIMVYGGTI RAGKQPSTGN SLDIVSAFQS YGEYVYDKIT EEERKEILQH ACPGQGACGG MYTANTMATA IEALGMSLPY SSSSPADSKE KADECYRSGE AMYRLLELDL KPRDIMTKAA FENAMRMVMV TGGSTNAVLH LIAMSRSVQN PEVAITLEDF QRISNQTPFL ADLKPSGKYV MEDVQNIGGT PGLIKFMIDN GLFDGSQMTV SGKTHAENLK DHPGLTPGQD IIRPLSDPVK KTGHLMMMYG NLCPGGGVAK ITGKEGETFT GTARVYDNEQ LMMRGLENKE IKAGDVVIIR YEGPKGGPGL PEMLTPTSAI MGAGLGDKVA LLTDGRFSGG SHGFCIGHIT PEAQVGGPIA LVKNGDPIRI DARAEQRTID LLISDEEWEK RRTEWTPPPL RATQGTLFKY IQCVATASEG CVTDE
|
| |