Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50238 |
Symbol | |
ID | 7199015 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | + |
Start bp | 70649 |
End bp | 73068 |
Gene Length | 2420 bp |
Protein Length | 625 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185118 |
Protein GI | 219129906 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TAATGACGAT CACATTCAAA GGAACAGCAC AAGATTGAAC CACTGTTCAG ATAGCCACAA ATTGTTAAGC GAAAACCGAT TCAGACAACG AAGAAGAGCG TCCAGTAGAA GCGTCCCCCA GATACAGGAA CTTGCTGATG AGACGGCCTT TAGCTTGGTC GACTCATTTT GTCGGTCTGC TAGGTGGTTT TGTCGTTCTT TCAAATCAAA CGGACGCAAG CTGGGTAGAT CCGGACACGA AGCAAGCTCA TTTGACGACG CAACCCCTCG CAAATGAAGA CAACCGAGCG TACGAGCTAG TGAGTAGTCT TTACTGCGTT TTGGTTTTGG TTGTGCTGAC TGGATATGTT TCGAAGTAGC TCATCCAATT CGATTTTCAC TTGTTCCACG CTACAGGTGT TTTCGGATGA GTTTGAGCAA GCCGGGAGGA AATTTGCAGA TGGGAGTGAT CCTCGATGGA CAGCAATCAA CAAAAATGAC TGTATGTGAA TATTCAAAAG CGGTCGACAA GTATTGCCAG TTTCTCTTTT ACTCACATTT TTCCATTCGT TTTGTAGATA CGAACGAAGC CTTGCACTTT TACAGTCACG ACAACGCGCA CACGTCCAAT GGGGTGTTGA ATATTACTAC CGTTGAAAAG GAAAACGTTT ATAAAGCATT TAACGAGCAC ACAAAGCAGT TCTACGCCGA TAAGAAATAT GTGCAGAGCG GGATGCTACA GAGTTGGAAC AAGTTTTGTT TCGTTGGCGG CATCGTCGAG TTTTCGGCAA AACTACCCGG AGATCCACAC AAAGGAGGAT TGTGGCCCGC TCGTAAGTCC AGGAAGTGCA CTATCTGCTT CTCTTCGTCG TTTGGCTCCA ACTCTGACCT TTTCTCGATC GTAATAAGTG TGGATGCTCG GAAACTTGGC TCGGGCAACC TATGTAGGTT CTTCCGACTA TGTTTGGCCA TTCAGTTACA ACCAATGCAA TCCCAAGACC CGACAGAGCC AAGAGATCAA CGCTTGTTCT TTAGTAAACC ATTACGGATT AGCTCCGAAA AGCGGTCGAG GTTCTCCTGA GATTGATATC CTGGAAGCGA TGCAAGGCGA ACCGGGAGAT TTACCAAATA CTTTCATCCA ACGACCATAC CAATCAACCT CATTACAAGT GTCACCTGGC ATTGAAATAG ATCGGCCTAT TCTGGGAAAA CGACCACACG AGGTTCGTTT GTATCCGTCG TTTTTCTTTT GTTTTTTGGT TCTCTTCCAA CAGTCTGTAT TTCTTGTTTT TCTTTTAGGG TCACTGGTAC CCAGACTTAG AATATAGCAG TCAGAACAAA TCAGACTTGA ATCCGTTCTT TTACGGAGTC ACGCTGGTGC ACAAGCCTAA CTCTTACACG TACCAGTCAG ACGCCCTGTC GGCAAACTTG CAGCTAAAAG CCACGCACTA TAGCAAACAG CATGTCTACC GAGTCGAATG GGAACCACCG GCAGAAGACG GCACAGGAGG CTACATAAAG TGGTTTACCG ATGGGGAGCT TATCTACGGG ATTCATGGCA AAAGTCTTGA CATCATGAAG ACGGAGATTC CTAGCGAACC GATGTATTTG TTAATGAATA CGGCAGTGTC AAGTCACTGG GGCTTTCCTC AGCCATGCCC TGAAGGCTGT TCTTGCAAGT GCTTTGAATG CGGGAACCCA GAGTGTGCGT GTGCAATACC GTCAGGGTAT TGCGACAATT TTCCTGCCTC CTTTGAGATC GATTATGTTC GTGTGTATCA GGCTATCAAT GAATCTAAGC ACATTTTGGG ATGTTCACCA GAGGCACGGC CAACTGCAAC GTTTATCGAA GGACATGCAA AGCGATATAT GACAGAAGGG CAGCGACGGC CGCTAGAACC CGTCGTGACA GGCGGTGGGA GCTGTTCTAG CCACAAAGAC TGCGGAGGAA TCGAGCGAGG CGTTTGCTCA GCTTCAGGTC TCTGTGAATG CTCGGAGTAC TCAGCTGGTC CGCTGTGCTT AGCACATGCA GCCTTCTACG ATTTTGATAC CAGCAAACAA CCAAAAATAT TTTCATGTAA GTTCGCGTGC TTACCTTGTT CAGCAGGCTG GTAAACGACA ATTTAACCAC TCTTTTGTCC ATCCGTCCAC CTATAGATCG TCACATACAC TTCCCATCGA GTCTCATGGT AGTGGTCAGT TTGCTGATTG GAGGCTTTCT GTTGTCGATG GCTTCGGCGG TAAGGGAAAA GTCGAAAGAG CCGAAATACA GCAACGTGAA TGGGGGAACG AGTAATTTAT CTTTCCAGAC GACAGGATCT GGAGCTGGTG TCGGCTCTTA CCAGAACCCA GATGGTGCAA CTTTCACTGT ACCTGCAAAT CAAAAGGATG TGACCTATTG CGTCATCGAT GGACGACTAG TCGATCAAGA CCATAACTAA
|
Protein sequence | MRRPLAWSTH FVGLLGGFVV LSNQTDASWV DPDTKQAHLT TQPLANEDNR AYELVFSDEF EQAGRKFADG SDPRWTAINK NDYTNEALHF YSHDNAHTSN GVLNITTVEK ENVYKAFNEH TKQFYADKKY VQSGMLQSWN KFCFVGGIVE FSAKLPGDPH KGGLWPALWM LGNLARATYV GSSDYVWPFS YNQCNPKTRQ SQEINACSLV NHYGLAPKSG RGSPEIDILE AMQGEPGDLP NTFIQRPYQS TSLQVSPGIE IDRPILGKRP HEGHWYPDLE YSSQNKSDLN PFFYGVTLVH KPNSYTYQSD ALSANLQLKA THYSKQHVYR VEWEPPAEDG TGGYIKWFTD GELIYGIHGK SLDIMKTEIP SEPMYLLMNT AVSSHWGFPQ PCPEGCSCKC FECGNPECAC AIPSGYCDNF PASFEIDYVR VYQAINESKH ILGCSPEARP TATFIEGHAK RYMTEGQRRP LEPVVTGGGS CSSHKDCGGI ERGVCSASGL CECSEYSAGP LCLAHAAFYD FDTSKQPKIF SYRHIHFPSS LMVVVSLLIG GFLLSMASAV REKSKEPKYS NVNGGTSNLS FQTTGSGAGV GSYQNPDGAT FTVPANQKDV TYCVIDGRLV DQDHN
|
| |