Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38261 |
Symbol | |
ID | 7203181 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 448695 |
End bp | 449987 |
Gene Length | 1293 bp |
Protein Length | 406 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182396 |
Protein GI | 219124197 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCAAG CGAACACTTG GAGCGGTGGC AGTTTTGTAA TAGCGGAAGC TAGTATTGTC GGACTCGATC AACGCTCGGG GGTTGCACTC GAAGTGTTGG TGAAACGCCG CGGGAAAGAG GACGTAAAAG AAATGGTTGA ATTCGACCTG AATGCAATTC CCGTACCAGA GAGGAAACGA TACTACGGGG ATTTACCACC GGTGCCAGAG GACACCGAAC GGACCGTAAT TGACGACGTT GTGCGGCGGA TGAATCGTTT ATGTTGGATT GTAGGCCAGC CCACTGTCAC GGGGAAACTG ATTCAGTTAG CCATTCAAAT GGGTGGTGCT GGTGTTGGAA ATCTGCGAGA GAACATGTAT CTTAACCAAG TTCCACATAA TCGCTACGTA CGGGATTATT TCTACGAGCA AGCTGCATTG GCGGTTCATG ATGCGGTGGT TTTGTGTTCC GAAGGAAAAT GTATTAACCG AATGCTGATC ACAAGTCAAT TTCCGGAGAT GAATCCGTCA ATGGACTCCT ATCGGATCGG AACGATTCTC GAAATGGTGC GCACGATTGG GATCAAGCTC GCGGAAGAGA ACTTGCGCGT TCGCATTTGC GTACAGGGAT CTATGGGAGG TGCGTTTCTC TTTCGTTCCC ATATGTCTCT AAGAAGAACC ATGTTTCTTA CACTTTGTCC CTATCGTTAA GTCGGTATCT TTACGGGAAT GCCCAAGCAG CTAAATGGGG TATCAAAAAT AATTCAAATG ATGGACTGGC AAAGCGGCGA GGGCGAACTC AACGAAGGCA TGGTTGGCGA CTACATTCGT TTCGGTGCCG TTGGACCAGA ACATGTGCTA AACGAAGAAA AGGACAAGGA TGACAACGTT GTACAATACC AAGACGACGT TTTTATCCTT ATTGCACCAC AGTCCATGGT CGGGACCGAC AGTAGTATTA TGCCATTGTT ACAGGGTATG GTGGAGGCCG CCGGCAATCG TCCCGTCATT TTAATGAATC CTGATTTGAC TGACAAGGTC AGTGCCGCCG GTCAACAAAG CGTTCGTGGG CGGCAGCAGC GTATCGATTT TGCCGAGAGT TTCCAAACCG TTTACCACTT CCAGAATATT TACATCTCGG GAACGTCATA CTTTCCCATT TTGGGGGCGA TTACGAAACT TCACCCAAAA GAACCCTGGC TGGCCCATCA GCGCCGCGAC TATGCGGACG GAGAAGGAGA AATTTACGTC CCGGTTTTGG CTGGCGAAGT CATTCCCAAG GGCGAAGAGA TTCTCGACGC CTTTGACCGA TAA
|
Protein sequence | MAQANTWSGG SFVIAEASIV GLDQRSGVAL EVLVKRRGKE DVKEMVEFDL NAIPVPERKR YYGDLPPVPE DTERTVIDDV VRRMNRLCWI VGQPTVTGKL IQLAIQMGGA GVGNLRENMY LNQVPHNRYV RDYFYEQAAL AVHDAVVLCS EGKCINRMLI TSQFPEMNPS MDSYRIGTIL EMVRTIGIKL AEENLRVRIC VQGSMGVGIF TGMPKQLNGV SKIIQMMDWQ SGEGELNEGM VGDYIRFGAV GPEHVLNEEK DKDDNVVQYQ DDVFILIAPQ SMVGTDSSIM PLLQGMVEAA GNRPVILMNP DLTDKVSAAG QQSVRGRQQR IDFAESFQTV YHFQNIYISG TSYFPILGAI TKLHPKEPWL AHQRRDYADG EGEIYVPVLA GEVIPKGEEI LDAFDR
|
| |