Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_34720 |
Symbol | |
ID | 7200306 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | - |
Start bp | 215337 |
End bp | 217067 |
Gene Length | 1731 bp |
Protein Length | 576 aa |
Translation table | |
GC content | 59% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179386 |
Protein GI | 219117183 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTCCTT CGAATCGCAG TCCAACTACA GTCGAACATA CCAGTGCCAT GCTTGCTACG GCTGCCCCCA CTACCGTCAT CAGAAGTACT AGCAGTGGTA GTGGTGGGAA TAGTAGTATT AGTTACAGGA AGAGTAACAG CAACAGTCGC AGTCGCATTC TCACCGAGGC CAACGAAGGC AAAGTGAAGA CGGCTGATTC CAAACCAGCT CCTTTGGAAC AAGCGGCGTC CTCGTCCGCG TCCTCCACGA CAGCATCGTC TTCATCGTCG TCGCCAACGC AACACTGGAA GCATCGCAGT TCTCCCATTT CCGTCGCGAC GAGCAAGACC CCGCCGTCTC CACTGCTGCC GCATCCGTCC ATGACCACCC CGATTCCCCG TTGGAAACAT TTGCAACAAA AGCGTCTCGC GCTTTGTCAG CAGCGTGTCA AGCTGGAACA ACGTCTTTTC GAATACTCAC AGGACAAGAA TATTGTTGTT TCGCCGAGTC CACCACATTC TCCGACGACG GCTCCATCCG CAGCTTGGAC TACCAAAAAC AAAACCCCGA CTCCGCCTCC ACCCGTTCTC GCGTCCTCAT CGTTACTCCA ACCGTCCTTG CCGGACTCCA AAGCTGCTCC ACTCCATTCC TGGACGGGTG CGTACTGCTC CGGAAAACCC CACGGACAAG GCACGCTCAC CTACACCAAC GGACAAGTCT TCACGGGTGA ACTTTGTCAA GGCCGTCGCC ACGGACACGG CGACAACGTC TGGCCCACCG GACAGCGGTA CCGTGGGGCC TGGCGACGCG ACAGTCGGGA CGGCCGGGGC ACGCACTCCT GGCCCGACGG ACGCACCGTC ACCGGACCCT GGAAAGATGG ACATTTACAC GGACGGGTAC TCTTCCAGTG GCCCGACGGA ACCGCCTACG ACGGAGACTG CCACCAGGGG AAGAAACACG GACGCGGCAT ACAAAGCTGG TTCGACGGAC GCGTCTACGC CGGACAGTAC GTCAACGGCG TGGAACACGG ATTCGGGACC CTCACCGAAG CCGACGGCGT ATCGCAGTAT CGGGGACAGT TCCGGGATGG ATCCCGGGAC GGCTACGGCA TTCAAAGTTG GCCCACCAAA ACGTACGACG GCGAATGGCG ACACAACGTC GTGGACGGAC GGGGGAAGTT GTCCTGGACG AACGGATCGA ACTACACCGG ACAATTCCGA GACGGGGTGT ACCATGGATC GGGCTGTTAC CTGGACGCCA CGGCCGGGAC CAAGTTCGTG GGTCAATGGG ACCACGGACG CAAACACGGA CACGGACGCC AAACCTGGTC GTCCGGACAG TCCTACACGG GCAACTACCG CTTGGGACAA CGACACGGCT ACGGACGTAT GGTGTACGCG GACGGTACCG TCTACGCCGG GGGATGGTCC AAGAGTCTCC GGCACGGATA TGGAATCCGC CTGTCCGCGA AAGACGTCGT TCTGCATTGT GGTTTGTGGG AACGGGACTT GCCTTTGGTG TCGAACAAGG ATAGTAGTAT CCCACGTCAG ACCCAAGATG ATCTAGCCAT ACTCCGTGAG CTTCGGAAAT CATTCCGTGG CCCCGTCCGG TCCCTAGACG ATGGGGATTT GACCCACGAT CGAGTTGTAC TATCAACAGA CGTAGAGGAT GACGACTACG ACGACCAACA AGATTTTCCG AACGAGGCCA TTGACCTACC ACCGGTCGAC GAGAACGTGG CGGCGTGTTG A
|
Protein sequence | MPPSNRSPTT VEHTSAMLAT AAPTTVIRST SSGSGGNSSI SYRKSNSNSR SRILTEANEG KVKTADSKPA PLEQAASSSA SSTTASSSSS SPTQHWKHRS SPISVATSKT PPSPLLPHPS MTTPIPRWKH LQQKRLALCQ QRVKLEQRLF EYSQDKNIVV SPSPPHSPTT APSAAWTTKN KTPTPPPPVL ASSSLLQPSL PDSKAAPLHS WTGAYCSGKP HGQGTLTYTN GQVFTGELCQ GRRHGHGDNV WPTGQRYRGA WRRDSRDGRG THSWPDGRTV TGPWKDGHLH GRVLFQWPDG TAYDGDCHQG KKHGRGIQSW FDGRVYAGQY VNGVEHGFGT LTEADGVSQY RGQFRDGSRD GYGIQSWPTK TYDGEWRHNV VDGRGKLSWT NGSNYTGQFR DGVYHGSGCY LDATAGTKFV GQWDHGRKHG HGRQTWSSGQ SYTGNYRLGQ RHGYGRMVYA DGTVYAGGWS KSLRHGYGIR LSAKDVVLHC GLWERDLPLV SNKDSSIPRQ TQDDLAILRE LRKSFRGPVR SLDDGDLTHD RVVLSTDVED DDYDDQQDFP NEAIDLPPVD ENVAAC
|
| |