Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44950 |
Symbol | |
ID | 7199845 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | + |
Start bp | 732116 |
End bp | 733934 |
Gene Length | 1819 bp |
Protein Length | 567 aa |
Translation table | |
GC content | 57% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178836 |
Protein GI | 219116082 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0434742 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCACCA CGGAACGGAG AGGTAGCGCG GTGGCGGCAG CCGCCGCTGC CGCGCTCCAA CTCTTGCCGG AACCTCTCAC CAATCACGGA GTCAAACGGA TACGAAAGGA AGTACAGTTG GTGAGCAAAA AGACCGGGAC ACCCACCGCG CGCGACGCCA ACCACCACAA CAATAGTAAC AACAACAACC ACAACGCGGC GTACGGGTCC ATCCACACCG CGCCACACGC CAAGGGGACG CCGCCCGCCG CGTCCTCCAA ATTCGGACCC TCCTACGAAC GCAGCGACGA GGATTACACG GCGGAAGACG ATGAACTGCA CTTGACCTAT ATGCAGCTCC TGCGCAAAAA TCGTCCCTTC CGACTCTACC TTGCCTCTTA CGTTGCCAAT CACGTCGGCG AGTGGTTGAC CTACCTCGCC AGTATCTCCG CCATTGAAGC CATACAACTC GCTGCCGGAT CCACCCACAC CTCACGCACC GCCGTCAGTG CCCTCATTAT CGTACGGCTC ACTCCCAATA TTCTCCTCAG TCCCTTTGGC GGAGTACTCG CGGATGGATT GGATCGCCGA CAGAGTATGA TTCGTTTGGA TGTCGCTGGC GCGTTGGTCG CCTTGGTATT TTTACTAGCC ATGCATCGAC ATTCAATCGC CCTCATCTAC ATCGCGACCT TTCTGCAAGA ATGCGTCGCC GGACTCTACG AACCATCACG CTCCGCCATT ATTCCGCAAC TGGTTCCGGA AGAGAATCAT CTCAAACTCG CCACTACCTT GGCCGGGTTG GCCTACGCCG CCGTTGCGGC CTTTGGGAGT GCCGCCGGTG GCTTTTTGGT TGCCCTCGTT GGTATTCGAG TCTGCTACGG TACGTACCTA CCTATTCCGT GGCGTGAGGT GTGTCCGTAG GGAACCATAG GACCTCACCC GTGTTGTTGT TGTGTGCCGT TGCGGTTCTT GTTGTTGTTG TTGGATTTTG TCAGTCATTG ACAGTGCCAC CTACATGCTC AGTGCCTATC TAATGGCCTT GGTGGGAGGC AAATGGAACG TCTCCGTATC ACCCTCGGTC TCACAACCCG CACAGTCTCC GTGGACTTTG GTCAAAAGCA TGATTTGGGA CGGCTACAAG TATCTCAAAG GCTCCGGTTT TGGAATTTTG ATCTTTTGGA AGTTTTCATC AGCCCTGGGA TACGGTGCCA TGGACGTTTT GAACGTGTCC TTTTCCGAGC GAGGAAATTT GGGAAATCGC TCCACGCGCT TGGGTTTTTT GTTCGCCGCC GCCGGAATCG GTTGCCTGGT GGGCCCGCTC GTGGCGGATC GCTACACCGA CATGAATTGC CCCAAATCGT TGCAGTTGGC CTGTTTGTAC TCGTTGCTGC TGTCGGCAGT CGGCACACTC TTGACCGGTT TGCCGAGTCC GTTTTGGATT CTGTGCGTCT GGTCGGCGGT CCGCTCCATG GGATCGTCCG TGCTGTGGAT CAATTCGTCG CTTTTATTGC AAAAGTTTTC CCACGCCGAA ATGCTCGGAC GCGTTTCGGC CATGGAATAC GCGCTGGCAC TCTTCGCCGA AGCCTTGTCC GCCGTACTGG CCGGTGTTTT GCAGGACAAT CTGAGTTGGA CCGCCGAAGA AGTGAGTATC GGACTGGGCG CTTTTGGGGT GGTACTGAGC CTGGCTTGGG GCGTTTACCA CTTTCGCGGG GGCGGAGCGG CCTCGGCGGG CGGCTCCGGT CCCGACGAGA ACGGCCAGAT GGAGGCCCCG GAGGAACGGA ACGAGGCCAC ATCCGAACAG ACCTGTTTAA TGGAAAGCGG AAATGAATAC GATATGTAA
|
Protein sequence | MPTTERRGSA VAAAAAAALQ LLPEPLTNHG VKRIRKEVQL VSKKTGTPTA RDANHHNNSN NNNHNAAYGS IHTAPHAKGT PPAASSKFGP SYERSDEDYT AEDDELHLTY MQLLRKNRPF RLYLASYVAN HVGEWLTYLA SISAIEAIQL AAGSTHTSRT AVSALIIVRL TPNILLSPFG GVLADGLDRR QSMIRLDVAG ALVALVFLLA MHRHSIALIY IATFLQECVA GLYEPSRSAI IPQLVPEENH LKLATTLAGL AYAAVAAFGS AAGGFLVALV GIRVCYVIDS ATYMLSAYLM ALVGGKWNVS VSPSVSQPAQ SPWTLVKSMI WDGYKYLKGS GFGILIFWKF SSALGYGAMD VLNVSFSERG NLGNRSTRLG FLFAAAGIGC LVGPLVADRY TDMNCPKSLQ LACLYSLLLS AVGTLLTGLP SPFWILCVWS AVRSMGSSVL WINSSLLLQK FSHAEMLGRV SAMEYALALF AEALSAVLAG VLQDNLSWTA EEVSIGLGAF GVVLSLAWGV YHFRGGGAAS AGGSGPDENG QMEAPEERNE ATSEQTCLME SGNEYDM
|
| |