Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43434 |
Symbol | |
ID | 7197440 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 449785 |
End bp | 451691 |
Gene Length | 1907 bp |
Protein Length | 593 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177935 |
Protein GI | 219112367 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.996285 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGATTGT TGTCGACGCG ACGCTCAACC CACGCCTTGG ATCCACGATC GCTCGCAAGC TTCCAGCAGC GGCTGGAGTG GCTACGAGCG GCTCTCCAGC GTGTGCATGC GAATGTTAGT CGCGACGATC TTCGAGATGA GCAAGAACAG CTCTCGTACG ATATTTATGT GACGCAACTG TCGGATTATA TACGCTTCAC GCCATGGCAC AAATCCTATC TCTGCTGCTT GAATCGTTTG GAAGGTCCCC AAACAGACTT GCCACTATAT GCGCGATACT TGCCTACAAA AACCAAAGAG GAAAGGGCCT TCTATTTGAA GTTTCTGCAA GCAATTCCAA TTCAATTGGG TGAGGTTATA AATCTCCTGC AGACAGGCCT TGTGGAGAAT CGGACTCCGC CGAAGGTAAG CTTGGATGGC GTGGTACCAC AGGTTCGTGG TATGATCAAT GGCAATCTTG AGGCATTCCG CGAGCCTATC CGCAACGCTT TCCCTAAAGA CGAGGCCAAG ATCTTGGAAG CATGTCAAGC CCAAATCGAC GGATCTGTGA CACAGGCTTT TGCAGATTTC GCTGATTTCT TGCAGAATGA GTACATTCCT CATTTAAGAG AAGACATCAG TGCGGTGACT GGATATCCAG ATGGAAAGCA ATACTATCAG GACTGCCTAG CGTTTCACAC AACCACCAGT ATGACCCCGG ATGAAATTCA TGAGCTTGGA TTGAGCGAGG TTGAGCGCGT ACGTCAAGAA ATGGAAGCGA TTGCCGCCCA AGCTGGGTAC GGAGGTCGAC TCGATGACTA TCTGGAACAT TTGCGAACTT CCAAAGTTTA TGAGGCAGAG TCTGGACAAG CTTTGTGTTC CTTATTTCGA GATATCACTG GGAGGATTGC CCCCGCAATG CTCAAATTGT TCCATCTCGA AACGCTGCCA CGTATGCCGT TTTCGATCGT CGAAACTCCC TCTGCTCATG CTTCCATGGC ACCAGCTGCG TATTACTTAG CCGGTAGCAC CAATCAAAGT GCGTCGCGTC CGGGTATATT CTATGTAAAT ACATCAGAGC TTCCGACTCG TCGCACTTAC GAGTGTGAAG CTTTGGCCTT ACACGAAGCA ATCCCAGGAC ACCATACACA AGGATCGATT CAAGGTGAAA GTCACAACTT GCCCGCATTT CGTCAAATGC AAGAAGATCG GCGGTATTTT GAAGCTCCCT GTGAGTGAGC AAATGCTGAT ACTGCGTCCT TTTCATTCTT CCAATGCATA AAACTAGTCT CATATCTCTC TTCTTTTGCT ATAGGCCGTT TCCCCTTCTA CACTGGCTAT ATAGAAGGCT GGGGTCTGCA CAGTGAGACG CTCGGTGAAG AGCTCGGTCT GTACACAAAA CCAGAGAGCA AAATGGGACA GCTTTCCATG GAGGCCCTCC GTAGTTGTCG ACTGGTGGTG GACACAGGGA TGCACGCCAT GGGTTGGACG CTGGACGAGG CATTGCATTT TATGCTGGAA AATACCGCTA TGGGAAAGCA TGATGCCGCC ACGGAAGTAG CACGGTACGT CACTTGGCCC GGACAAGCCA CTGCCTACAA AGTAGGGGAG CGCTATTTGC GGAAACTGCG CACTATGGCA GAAACAGAAT TGGCTGAGAA ATTTGATCCG AGAGATTTTT ATGATGTCGT ATTGCAAGTG GGACCAGTCC CGCTGGACAC TCTGGAAAAG CTTGTTAGGG ATTACATTCA GGAAACAAGC AATAGAACCG CTTCATCAGG TGGGGACTTG AGCGAGGGTG AACCTGGCTT TCTGGAACAA ATGACCTTTT TCAATTGGTG CAAATGTTGT GTTGTCCCGG GGTCGTGTCA GTCAACAGCA CGTTAGAATA GAATGCTTTT ATAAGATTTA ACACTATCTA ATATAGT
|
Protein sequence | MGLLSTRRST HALDPRSLAS FQQRLEWLRA ALQRVHANVS RDDLRDEQEQ LSYDIYVTQL SDYIRFTPWH KSYLCCLNRL EGPQTDLPLY ARYLPTKTKE ERAFYLKFLQ AIPIQLGEVI NLLQTGLVEN RTPPKVSLDG VVPQVRGMIN GNLEAFREPI RNAFPKDEAK ILEACQAQID GSVTQAFADF ADFLQNEYIP HLREDISAVT GYPDGKQYYQ DCLAFHTTTS MTPDEIHELG LSEVERVRQE MEAIAAQAGY GGRLDDYLEH LRTSKVYEAE SGQALCSLFR DITGRIAPAM LKLFHLETLP RMPFSIVETP SAHASMAPAA YYLAGSTNQS ASRPGIFYVN TSELPTRRTY ECEALALHEA IPGHHTQGSI QGESHNLPAF RQMQEDRRYF EAPCRFPFYT GYIEGWGLHS ETLGEELGLY TKPESKMGQL SMEALRSCRL VVDTGMHAMG WTLDEALHFM LENTAMGKHD AATEVARYVT WPGQATAYKV GERYLRKLRT MAETELAEKF DPRDFYDVVL QVGPVPLDTL EKLVRDYIQE TSNRTASSGG DLSEGEPGFL EQMTFFNWCK CCVVPGSCQS TAR
|
| |