Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46007 |
Symbol | |
ID | 7201067 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 940446 |
End bp | 944426 |
Gene Length | 3981 bp |
Protein Length | 773 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180148 |
Protein GI | 219118763 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGGCTA TTCGCAACCC ATACGCCAAG CCGAAAGCTT CCTTGAAACG GCCAAGGTCG AGTTGGGACC GCGAACGGAG TGACTGCAAG TGCCAGAACC CATCGGAAAA TCAAAACGAC AGCAATGGTG ATCGCTGCCG TGTTCCGCCA CAACCCACAA CGGCATCCGG CACATCACCA TTGAATCGAA GACTTACCAC AATTATGTCT GCGGAGAAGC CAAAGCAAAC GGCCCAATCG ACTATCAGTT CTTTTGATGA TGGGGGCATC GATTGGGAAG CTGCCGTTGC CATGATGGAA CAAGCGACGC CGGAATCGCA CATGAACACC AAATCTCATT CCACTCTGCA TTCCGGTCAC GAAAGCGCCG TTACAGAGAT TCCCATACCG GCAGCGTCTA TTCCTCGGCA CTCACCCAGC AAAAGTCTTA TGTCTACAAA ACGAGGTCCT CCGGATGCTG TCTACGAGCA TCCGTCCACT CGACTAACAT CTGATCCGAC TCGGCTCAAG CCCCCTCCGC GACCTACGAA TACAGCGTCG GTTCCTTCCC TTTCGTCTCG ATCCTCTGTC GCTCTCCCGA CATTAGCGTC CTTACGACCA GCTTCGTGGG CTGGCAACTG CAGCTCCGAT CCCTCTTTGT TTGCTGTTTC GGTTTCTTCT TCGCCTGCCA AGTCCAAATC AAATCTGACA ACGTCGTCGC CCTCAACCCA CCCTCGGTCT CGAACGGCGG TATCACCTTC CATGCGCCCT TCTTCTTGGC AACGCACAGC GCCTCCGGCA TCTCCAGACT CGGATGTTCC CAGTGATCCG CGACAGGTCA ATCTTCCCAA AAAGCTGCAG TTTGATGTCG CGACCGTCCA GCCTGTGCAA GACGAACACC GACAAACGCT AGTGCAAAAT GCTACCCTCT CCCAACCGCT ACTCAATGGG TGGACGCTTT TTTCGCACCA AAAAAAGGCT ATTCTCCAAG GATTGCTCAT GCGGCGCATG ATTCTCGCGC TCGACATGGG ACTAGGCAAA ACCCTCATTG GTTGTGTTTG GGCCAAGGCC TTTTCCAAGA CGCTACAAAC CAAAACAATC GTTATTTGTC CCGTCTCGCT TAGAGACGAA TGGAAACGGA CCGCAATCGA AGCCACAGGA TTGTCCGTAC AAGATGATAA GGACGTGCAG GACAACGTCA ATAACGCGAG TCTTTGCATT GCCTCGTGGG GGAAAATACC TCGCCTCGAC AAACACGACT TGGATCAGCG ACCCTTTGTC GTCGTCGCCG ACGAAGCGCA TAGCATGCAA AGTATGGCCG CCTCTCGAAC CAAGGATGTG CTTGTCTTGT GCGCACATCC CGCTTGCCGA GGCGTTCTCC TGTTAACGGG CACGCCCATG AAAAATGGCA AACCGTCCAA TCTTTTCCCT TTGTTACAGG CCGTGTCACA TCCGTTAGGG CGTCACCAAC TTTCGTACGA ACAGCATTTT TGCGCCGGCC ATTCGGAATC GTATGGACGG AGTCGTCCAG TCTGGAAGGC ATCTGGAGCT TCCAATCTAT CACAGCTGCG CACATTGGTC TCATCGCATC TGTTGCATTT AACCAAGCAG GATTGTCTAA AAGCGCTACC GCCGCAAACA CGTGAATTTC CGGTTGTTCC TATTAGCAGC CGTCAGCAGC TTGCGCACAC ACAGGCCTTG CACGATTTGG CCAAAATATA TCGAAAGACG GGAGAATCCA AAAGTAGCGA AGCTATTTTG GGCGCCGTAC AACGGGTACG TCTCGTTGCT TCCACAGCAA AGATTGACGC GACGGTCGAA TACGCCAAGC GCATACTAGA GGCAGAGCCT GCCATTGTGA TTTTTACAAG TTTTGTCAAG GTTGCACAAA ATGTACACCA GAAATTGACT GATGCGGGTT GGCAAGGGTC TTGCTTGACT GGGGAAACGG TCGCAAAATC ACGGCAAGCA CTCGTTGACA ACTTTCAAAA TGGATTGACT GCGTTTTTTG TGTGCACGTT CGGCGCGGGC GGTGTCGGGC TCACTTTGAC GGCAGCGTGT ACAGTGATTC TGTTGGACCG GCCATGGACA CCTGGCGACG CACATCAAGC TGAAGACCGC GTCCGACGTA TCGGACAAAC CAAACCAGTC AAAAGTTTAT GGATGACCGC CTTTGATTTA GACAAGCAAA TTGACTCCAT AATTGAGCAG AAGAGTAAAA CGGCGGCAAC GGTGCTCTCG ACAACAATGA CCGGGAGCTC CAGCGATGCT ACGAGTGAAG GCCCCAAATT GTCCATTTTT CAGCTTATCA AAGCTATTCT GCCGCCAAAC GAATCAAATT GATCGAAAGA ATAGACTGTC GATACCGGGA ACATAAGTCA AACGGAGAAA GCTCAAATCG ACTAATGTGA CCTACGTACT TTGATTGCCG CACGCGGCCA GACTTTCCGT TTCTATATCT GTTGAATTGC TTTCGGCGAG TGTCATGAGC AAATCCTCAA TGGAAATATT CTCATCACTA ACTTTAATTA TGGTCGACCT GATGGCATTT TTCATTTGGC TCGCTCCACA TGTTCCAAGA ATAAAGGGGG CCGTGCAAGA ATGTGTACCG AAAATCTGGG GACTGAGAAC TTGCATGAGC AGAATGATGG ATGTCATATC TGGCAGTGAA TAAAGCACCC TATGTGATAC GAGTAGCTCT TGTACGACAT TTGCATGACA CTCATCGGAA GAGGAATGGG GCTTGCCATC CGAGCTGTAC TTTTACCAGC GAACACAACC TGTCCCGTCT CTAAATCCGA CCCTATCTAA ATTTCCGATG CTATCCAAGG GAGAGGCCAG ATGTTGCTCT GGGGTCAAAA TCACTTCCGC AATTGCACAC GGTTCTTATT CCTCGATCGA TGTTGCTACA ATCCGATCGG TCCCCCACCG AATTGCGTAT TGGCTTTGAG AGAGCACACT TCACGAACAA CTTGCGATTA TCGAGATGCT ACGGATTTGA TTCGAGGAAA TCTGGTAAGG GGTTGGTTTT CTGCGATGAC TCCCTCTCTC GTGTCGCTGA AATCATGGCT GAAAGAAAGC GGATAGTCAA AGCACTACTA GCACTAGAGA AACTGTGGTG AACTCCTTGT TCGAAGTTGG CCTTTTGATT ATGACTTTGT TACTTGTTTC CCCAGATTGA TTTATCGTTT GTGATCTACG CCACACGCTC CATGTAAAAA TCGTCCATGA TTATATCGAG ATCATGGTTA GGGTCCTCAA CCTTGATGTG GAAGTTGCGA ATTTCAGGGT TCCATTTGGT TGAAGCGCCG GGGAATTCAA AGACGGAGCG AAACTCATTG AATACATTGG CGTTCCAAGC TGTCTGATAA TCTCCAAGAG TGAACGTTTC CAATCGCCAA TTTTTGTCAC GAAGGTAAAT CGATACGCGA GGACAAAAGT CGGAGCCCAT CTTGTCGGGT TTTCCATTTG GACTAACGCG ACAATTTACT CCTTTTTCCG TGGTTGGGTC CACAAGACGA AATCGGGCAC GTACTTCCCA CTTGGAACCC GGCGGAGTGC ATCCGTTATA GCTATTCCTG CTGACTCCTA TCCCCACTCC GTCCCACACG CGACTTCGCT TGGAATAGCG CAGCGCAGTT TTTGACTTGT AACCAGGAGA GACAAGCGAG ATGGATCCAG AGCCACGAGA GCGCCATTCC GTCAGCTTCG ACTCAAAATT GTTGTTTGTG ACCATGTTAT TGCACCTACT ATCCCAGGAC CCTGGCTTTT GATCTCCGGG CGTCGACACA GGACTATTCA CCTGCCCCCT TGTGGGAGCT ACGGGTACTT CTCTGGGAGC TTGGACAGGT GCTCTCGTAG GAGCCTTCAC CGGAGATCTT GTGGGCTGCT TACGCTCGAT GGCATCCCTG ATCCACGCTG TATCAGAGCA ACTGGCTTTG CCTAAGGAAT CGTTGGTATG CATACTACAA TCGTACGATA CCGTGTCGTC ACCGCACTTG C
|
Protein sequence | MVAIRNPYAK PKASLKRPRS SWDRERSDCK CQNPSENQND SNGDRCRVPP QPTTASGTSP LNRRLTTIMS AEKPKQTAQS TISSFDDGGI DWEAAVAMME QATPESHMNT KSHSTLHSGH ESAVTEIPIP AASIPRHSPS KSLMSTKRGP PDAVYEHPST RLTSDPTRLK PPPRPTNTAS VPSLSSRSSV ALPTLASLRP ASWAGNCSSD PSLFAVSVSS SPAKSKSNLT TSSPSTHPRS RTAVSPSMRP SSWQRTAPPA SPDSDVPSDP RQVNLPKKLQ FDVATVQPVQ DEHRQTLVQN ATLSQPLLNG WTLFSHQKKA ILQGLLMRRM ILALDMGLGK TLIGCVWAKA FSKTLQTKTI VICPVSLRDE WKRTAIEATG LSVQDDKDVQ DNVNNASLCI ASWGKIPRLD KHDLDQRPFV VVADEAHSMQ SMAASRTKDV LVLCAHPACR GVLLLTGTPM KNGKPSNLFP LLQAVSHPLG RHQLSYEQHF CAGHSESYGR SRPVWKASGA SNLSQLRTLV SSHLLHLTKQ DCLKALPPQT REFPVVPISS RQQLAHTQAL HDLAKIYRKT GESKSSEAIL GAVQRVRLVA STAKIDATVE YAKRILEAEP AIVIFTSFVK VAQNVHQKLT DAGWQGSCLT GETVAKSRQA LVDNFQNGLT AFFVCTFGAG GVGLTLTAAC TVILLDRPWT PGDAHQAEDR VRRIGQTKPV KSLWMTAFDL DKQIDSIIEQ KSKTAATVLS TTMTGSSSDA TSEGPKLSIF QLIKAILPPN ESN
|
| |