Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46400 |
Symbol | |
ID | 7201772 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 224405 |
End bp | 226449 |
Gene Length | 2045 bp |
Protein Length | 567 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180781 |
Protein GI | 219120068 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCTCATCCA CCGCGTCCAC CATTCCGTAG TTTTCCAAAA ATGGAGATGT TTGATTGTTG TTTCACTTGC TTCGAGCGGG GATACGAGTC GTCCGGCGAC GAAATCGAAG ACGACGACGC CGAGAATCTG TCCCACGTGG GGCAGGCCGC CTACGACGTC TTACGCAAAC GACACAATCG GCAGCACCAC ACGTTATGGA ACGTCACGTC CGGTGTCGTT GTGGGAGAAC TGCACCAGAC GCCCCTCACG GGATGGCTCC GCGACGTCGA ATATCCCCGG GACGGCCACG ACGATTGGTT TCCCGAGAAA ATGGCGGAAA TCATGGCCCG CACCGAAACA TGGTGCGACG TTATGAGCCT GGGACCTCCG GATGGTTTGT TCATGACCCA ATTCCAGGAA GCACTGAAGA CGATCGCGTT TCGTGCGACA GGCAAGACCA AGCCGGTCGT CGTGCGCATG ATGTTTGGGA ACATTGTCGG CATGCCGGTC AACTGCAACA AAGTCATCAA GGCGTTGACG GCACTCGTTC CCAAAAGTGC CAACATCAAC CTCTGGGTGG GTGCTTGGCG TCGTGGAGTT TCCTGGAATC ACGCCAAAAT TATTGCCGTC GATGGACAGT ACCTGCACAC GGGAGGTGAG CTTTTCGACA CGGCAGACTG CCCCCGGTTT GTCTACGAAA TCTCTCTCCA CACGCGCACG GCTTCTCACC ACCTGCGTAT CTCTGTGTGC TTTACCCCCA CACAGGCCAC AATATGTGGG ACGCGCACTA CTTGAAGCAA AATCCGATAC ACGACCTTTC GTTTGAATTA CAGGGCCGTG TGGCGCACGA TGGGCATCGG TTCGCGAACG AACAATGGGA CTTTATTGAG TTCAAGCAGG ACACCTGTTG TGGACAGTTC GTCGACAAAA TCCCCGATGG TATGCCCCTC GTTGCCAAGA CAAGGGTTAC GGTGAGTGAA TTTCCGCGGG GGCGGACTGC GGAGTTCCCA CCCCGGTACC GCAAATCCTT GACGCCGATG CGTGATCCGC TGCCCAACGA AGTGCCCATG ATCACCGTGG GACGATTCGG GACAATACTG CGCCACGCGC GTCCAGCCGA TGACGCATTC TTAGCCATGC TCGGGTCCGC CAAAACAATC ATCCGAATGG CCTTGCAAGA CCTGGGCCCG GTGTGCATTC CGGGCACCAG AATTCCGTTG CCCGGTTGCG TTTGGCCCAA GGAATACCTG TCCGTGCTGG GTCGCGTTAT CTGGGAACGG CACGTGGATG TGGAGATTAC GTTGAGCAAT CCTAATTCGG TCCCGGGAGG ATTGGGCGCG ATAGAAGCTT GCTACGGCAA CGGCTGGACC TGTGTCGATG TGGCGGCCGA AATCATCAAG ACGATCAAGA AACAATTTCC ACAAGCGCAA GATGCGGCAC TGCGGAAGTG TGTCGCCGAC AACCTGCGCG TATGTTTTAT TCGCGAAAAG CGTGGACACG AATACGACGA TGGAGCCACG ATGGGCATGC ACGCCAAGCA TTTTATCGTG GACGACGTTT GTTCGTACAC GGGATCGCAG AACTTGTATA TTTGTGACCT TGCGGAATGG GGTGTGATAG TGGACGATCC GGTCGCGACG AAAAAAATGA TGGATGAGTA CTGGACGCCC ATGTGGCAGG TTTCGTACAC GGGTCAAGAT TGCGATGTGC AGGCTGTTAT GGATGGTCTC AAGATCAACC GTGATGGTGA GAATCCTGTC TTTATGTCGG CCGAACAGAA ACAGCAAAAG TTAAAAGCGA CTCGCTTGCA AACACACAAT CCGGGAAACT CGGAGTATTA CTACGAAGGT GATACCGAAC ACTCTTCGGA ATAGAATTGG ATCATCCAGT GACGCGCATG AAATGGACTT CTCGACGCTC TCTGTTCGCG GAGTAATGGC AGTGCGAGCA GGCTAATTCC ATCGCGTGTG TGGTGGTTTG CGCAAATTTC CCTCGACAGA CCTGTACAGA AGCTCCCATA TTGCTGTACA CAAGTGTATA TGTAAATAGG TTACGTTTGT ATTTG
|
Protein sequence | MEMFDCCFTC FERGYESSGD EIEDDDAENL SHVGQAAYDV LRKRHNRQHH TLWNVTSGVV VGELHQTPLT GWLRDVEYPR DGHDDWFPEK MAEIMARTET WCDVMSLGPP DGLFMTQFQE ALKTIAFRAT GKTKPVVVRM MFGNIVGMPV NCNKVIKALT ALVPKSANIN LWVGAWRRGV SWNHAKIIAV DGQYLHTGGH NMWDAHYLKQ NPIHDLSFEL QGRVAHDGHR FANEQWDFIE FKQDTCCGQF VDKIPDGMPL VAKTRVTVSE FPRGRTAEFP PRYRKSLTPM RDPLPNEVPM ITVGRFGTIL RHARPADDAF LAMLGSAKTI IRMALQDLGP VCIPGTRIPL PGCVWPKEYL SVLGRVIWER HVDVEITLSN PNSVPGGLGA IEACYGNGWT CVDVAAEIIK TIKKQFPQAQ DAALRKCVAD NLRVCFIREK RGHEYDDGAT MGMHAKHFIV DDVCSYTGSQ NLYICDLAEW GVIVDDPVAT KKMMDEYWTP MWQVSYTGQD CDVQAVMDGL KINRDGENPV FMSAEQKQQK LKATRLQTHN PGNSEYYYEG DTEHSSE
|
| |