Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40200 |
Symbol | |
ID | 7195833 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | + |
Start bp | 391864 |
End bp | 394177 |
Gene Length | 2314 bp |
Protein Length | 710 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184124 |
Protein GI | 219127817 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00209699 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAACGA ATTTTGCTTT GTCGACGCGC TGCTTTGCTT CTTCATCCGA CAACCATGAC GAAGAGGAAC AACGAGACTC TCCGAAACAA AGATCCAAAC GCAGCCAAAC TAATCGGTCC AAGAAATTCA AAATTGCTGA ATCAATCGAC CAGAGCAAAA TAGATAAGCT AGCACAAGCA TTCGATGAAC TCGCTCGGAA GGAAGGCTTC GACTCGTCAA CAGCACGCTT TGCCGACGAT GTGACGTTCG AGGACAAGTT TGACGACGAT TCGTTTCTGG ACGATGACGA TGATAACAAC AAAGATAAAG TGGGAAACTT GCACCTAGAT GCATCCATGT TCAGTTTAAG TGACTTTATA GATAAGAGTG AGGAAGATGG CGGCAATCCA ACCGATCAAG ATGACGAGGA CTACCTTGAT TTTGGTGCAG ACATTGACAT GAGTATAGAA GCAAGGATTG CCGCTGCCAA ACGGGATATG GATCTCGGTC GAGTCAGCGC CCCTCCCGAT ATGAGATCCT CGCGCAGGGA GGTAACTGCA GCCGACCTTC GCAAACTTGG ATTTCGAACC GAGGCAAACC CATTCGGCAA CGACGAAACT CCACGGAAGG AGCGCTTCCA GTTGGTAACA AACTCCATGT CGTGCTCCGC CTGTGGATCG GACTTTCAAT GCCACAACGA AGATCGGCCC GGATATCTGC CTCCTGAAAA GTTCGCTACG CAAACAGCAC TTGGAAAAAT AGAACAGATG CAAAAGTTGC AGGATAAAGC AGAAAAAGCG GAATGGACAC CTGAAGATGA GATTGAATGG TTGATTCAGA CTCAGGGCAA AAAGGATCCG AACAAAGAAA TGCAGGAGGT GCCCCAGATC GATGTTGATT CTTTGGCAGG GGAAATGGGC CTTGACCTCG TAGAGCTTTC CAAAAAGATG GTTATTTGCA AGCGCTGTCA CGGTCTGCAA AACTTTGGAA AAGTGCAAGA TTCCCTCCGA CCTGGGTGGA CGAAGGAGCC ACTGTTGTCG CAGGAGAAAT TTCGTGAATT GTTAAGGCCA ATCAAGGAAA AGCCGGCAGT TATCGTTGCA TTGGTCGATC TTTTTGATTT TTCGGGGTCT GTGCTCCCTG AGCTTGATGA AATCGCTGGT GAAAACCCTG TAATTCTTGC GGCCAACAAG GCGGATCTTC TTCCAAGTGA AATGGGACGC GTGCGAGCTG AGAGTTGGGT TCGACGCGAG CTCGAATACC TTGGAGTCAA GTCGTTGGCC GGTATGAGAG GAGCAGTTCG GCTTGTCAGC TGCAAGACTG GAGCTGGGAT TAATGATTTG CTGGAGAAAG CAAGAGGATT AGCCGAGGAA ATCGACGGCG ACATATACGT CGTCGGGGCT GCAAATGCAG GAAAAAGTAC GCTTTTGAAT TTTGTTCTAG GTCAGGACAA GGTGAACAGA TCACCCGGAA AAGCACGAGC AGGCAACAGG AATGCCTTCA AGGGCGCGGT GACGACAAGT CCACTGCCAG GCACAACGCT TAAGTTCATC AAAGTCGATT TAGGCGGCGG TCGAAGTCTA TATGACACTC CTGGTCTTCT GGTATTAGGC ACTGTGACAC AGTTACTGAC CCCCGAAGAG CTGAAGATAG TTGTTCCCAA AAAGTATGTC AAACCGATCA AACTGATATT CGATTCACAG TCAATAATGT TCAAACTAAC ACCTCGTTCC TCAAACAGGC CAATTGAACC TGTCACCCTC CGGCTCTCTA CCGGAAAGTG CGTTCTAGTT GGAGGATTGG CCCGCATCGA GTTAATCGGC GACTCAAGAC CCTTTATGTT CACATTTTTT GTTGCTAATG AGATCAAGCT CCACCCTACT GACATAGAGA GAGCCGATGA GTTCGTTCTA AAGCACGCTG GTGGCATGTT GACTCCACCG CTAGCACCCG GACCAAAACG TATGGAAGAG ATTGGAGAAT TTGAAGATCA CATCGTGGAT ATCCAGGGTG CTGGCTGGAA AGAAGCTGCT GCTGATATCA GTCTTACCGG ACTAGGATGG GTGGCCGTTA CAGGAGCAGG GACAGCGCAA GTAAAAATAA GTGTTCCGAA AGGTATTGGT GTATCGGTGC GGCCTCCGCT TATGCCTTTC GATATCTGGA AAGTTGCATC GAAGTATACC GGAAGTCGAG CTGTAAGAAA GTCATCCAAA CTGGCGAATG GGAAACGAAG AAAAGGTGTA GGGCGTAATT AGTCTTGTTA GTCGTTAGAC TTTATTTTAA TTTGACTACT GTTAACAGGT AAATTATAAC TTTTCTCTTT CAGTTGATAT CTAG
|
Protein sequence | MRTNFALSTR CFASSSDNHD EEEQRDSPKQ RSKRSQTNRS KKFKIAESID QSKIDKLAQA FDELARKEGF DSSTARFADD VTFEDKFDDD SFLDDDDDNN KDKVGNLHLD ASMFSLSDFI DKSEEDGGNP TDQDDEDYLD FGADIDMSIE ARIAAAKRDM DLGRVSAPPD MRSSRREVTA ADLRKLGFRT EANPFGNDET PRKERFQLVT NSMSCSACGS DFQCHNEDRP GYLPPEKFAT QTALGKIEQM QKLQDKAEKA EWTPEDEIEW LIQTQGKKDP NKEMQEVPQI DVDSLAGEMG LDLVELSKKM VICKRCHGLQ NFGKVQDSLR PGWTKEPLLS QEKFRELLRP IKEKPAVIVA LVDLFDFSGS VLPELDEIAG ENPVILAANK ADLLPSEMGR VRAESWVRRE LEYLGVKSLA GMRGAVRLVS CKTGAGINDL LEKARGLAEE IDGDIYVVGA ANAGKSTLLN FVLGQDKVNR SPGKARAGNR NAFKGAVTTS PLPGTTLKFI KVDLGGGRSL YDTPGLLVLG TVTQLLTPEE LKIVVPKKPI EPVTLRLSTG KCVLVGGLAR IELIGDSRPF MFTFFVANEI KLHPTDIERA DEFVLKHAGG MLTPPLAPGP KRMEEIGEFE DHIVDIQGAG WKEAAADISL TGLGWVAVTG AGTAQVKISV PKGIGVSVRP PLMPFDIWKV ASKYTGSRAV NYNFSLSVDI
|
| |