Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_54310 |
Symbol | |
ID | 7199562 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | + |
Start bp | 428981 |
End bp | 432477 |
Gene Length | 3497 bp |
Protein Length | 531 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178771 |
Protein GI | 219115952 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGTCGG CTGTTGCTGA AAACGCTGCG CGAAAACGTG CAGAACTTGG AGATGTCCCG AAGAAGACCA TGAATCAGAC TATGTATCTT GTCAAGTGGG CAGGTCTGGG GTACGAGCAT TGCAGTTGGG AAACGAAAAA AGACGTCAAT GATGACAAAC TTATCGCTGA GTTTCACAAG CTCAACAATA CGTTTCCTGA TGAGCCAGAT ATGCCGATGG AAGTCGTTGA CGATTTTATC GAGAGTACGA AGCACATTAA CGTTGAAAAT GCGGGGGGAA TCTCTTGCAT ACCGAGTCTT CGTGCCCAGT TGTACGCTCA AAGCCGATCA TTTCATTTCG CGAAATTTGG GATGAACATT CCCGAGAAGG TCGGCGCAGA ATGTGGCCCA AAAACCAGAG CTGCATGGCA TTACCAGTTT TCTAGTGATG ATGACAGTGC CAGACATCAT TCTACCGTTC CGCGGGAAGT GATTGAGTGT GTTTCTGATC TCGTCTTCCA GGTCGCTAGG AAAGAGCCTG TCAGCTTTAT GCGTGCCAAT ACGTCGCTGC CTCCGCCGAT GACTGGAGAG TATGATGCTA TCCTACCTAT TACTTCGAAA GGATTGATGA TGAACGTAGG TGAAATTCAT GGGTCCGTAG CTTTTCTCGG TTATCGAACC TTTCCCGATG GATCAAAAGG ACCGGCGGAC ATCGCGAATT TGATTCGGAA CGTGGGAGAC AAAATCATCG CTGTTGATGG TTCCAGTACG ATCGGGAAAA CCTTCAAGGA GGTAATTTTG ATGTTGCGCG AAAGCGGCAA GAACAAGTTT GCTTACATGC GCTTCCTTGA GACCAAATAT GCCGTTTGTG ACAATGATCT TGCCAGTGTC GGAAAGAAAG GACGTTATGC TATCGAAGAA CTTCAAAAGA AGGTGGCCAT GGATCGACAG CGTCTTGTTG TACAAAGGAA ACATTTGCTT TCGGTGGATG AAGAACATGT TCCCGGGGAC ATAGGAAAGG ATCTTGCCCC AAAGGTCGAA GACTCAGATG AAGAATCCGA GGAAGGAAGT GAGGGTGAGT TTGAACCGGA AAGTGACGAC GAGGATCTTG TGGTGACGGG AAAGACAGGC GAAGTGGCCA CGGGTCCAAC GGTTTCCGAT TCTCTAGAAC GTATTGATTC TGTGCCTGCC GTCTCTCCAA CTTTGAATTC TGAGATGGCA GCTGAACGAC ACTCTGGGGT AAAGAAAAGC GAGGATGATA TAGGAGGGGA ACCAGCACCA AAAATCGACC AGGAAACCAT TACTCCGGTT GAGGAATGCT CTTCAGCCTT ATTGGGTCCT GTCTTTCGTC ACGAGACTAC TCGGTCTCTA GCCTATCGTT TACTTGGTGT AGATCTCGGC TATAGTAGCG ACGAAGGTGG AGACGATGAC AGCGCATTTT TTGTTGATGG TGTCGATCAG ACATACACTT CAATGCAGCA ACTTCAAGAT ATTGTCCGTC TACCGGCCGA AAGCGAAGCA AAATCAACTA CTCCTGTAGA CGACAGTACC ATTCCAGTGC GCCAAAACGA GTTTTCTGTA ATGGGCGATA GATCAAAACT TGCGACCGCA GTTGCTCTTA CGTCGAAAGA GCCTTCGACT GAGGAATTTG ACAATTTTCC TTTGCCTTCG TCGAAAGAGT TATTGGCCTC AGAAAAAGAG CAGCAAAGCC AGCAAGCTAA CAGTGCAGAG CTTCTATCAA AGTCAAGCAA ACGTTCAACT GTCAAAGTAG AGCAAGTTTC TATCGTAACC GGGGACATTA TACACATTTG GGCAAATGTT GAGTCTGCGG CTGCGACGCT TCAGCTTCCG CTCCCCCAGC TGAGGCAGGT TCTCCGGGGA GAATACGACG AAGAAATTGG CGATGAAGTC GGGGGTTACA AATGGCGCTA TGCCTTGGTG GGGGCGAAGG TCACTGCTGG AAATGGATCG ACCGGGCGAG GTGGCAGCGG ACGAAAGGCT AAAGAGGCAT GGCTTGAGTT TCGCGACAAG CTCTACGACC CCAACGAGCC CCACAGCTAC AAGAATGGCA ATCGCCTTCG AGATTATCAA GTAGAGGGAG TCAACTGGCT AGCGAGTACT TGGTACAAGA AGCAAGGATG TATTTTGGCT GACGAGATGG GTCTCGGAAA GGTACGTGTC GGACGACTTC ATAATGAATG TTGGGGTACT GGTCTTACAC CGATTGTATG TTCTACTGTA GACTGTACAA ATTGTCTGTT ATATCGAGCA CATTTTCCGT GTTGAAAAGG TTCATCGACC ATTTCTCGTC GTGGTTCCGT TGTCAACAGT GGAACACTGG CGGAGAGAGT TCGAGGGCTG GACTGACATG ATATGCTGCA TCTATCATGA CAGGCAAAGG GTATGGCGAG ATGTTTTACG AGAATACGAA TGGTATTACG AAGATCGCCC ACACACGGCC GAGTTCCTTA AGTTCGACGT TCTTGTGACC ACATATGACA CCCTGATTGG AGACTTTGAC GTCATCAGCC AGATCCCGTT TCGAGTCGCT GTTGTCGACG AGGCGCATCG GCTTCGCAAC CAAAAGGGTA GACTGTTGGA ATGCATGCGG GAAATTAGCG CGAAGGGTAC CATGCAGTAT GGTTTCCAAA GCCGCGTCCT TATGTCTGGA ACTCCTCTCC AGAATGACTT GACGGTGCGT TCATTATTAC TTCCTCTGTC TTTTTGCTCT AGGTCCGCGT CTCAAAAGCC ATTTTTGTTT TTCTTGCCAG GAGCTTTGGA CTTTGTTGAA CTTCATTGAG CCGTTTAAAT TTCCCGACCT TGATAATTTC CAGTTGAACT TCGGGAATAT GGCCAATAGA GAACAGGTCG AAAGTCTGCA GCAGATGATT TCTCCGTATA TGCTACGACG AGTGAAGGAA GACGTGGCCA AAGATATTCC AGCGAAGGAA GAAACTGTAA TTGACGTCGA GCTCACTAGT ATTCAGAAGC AGTACTATCG AGCTATTTTT GAACACAATC ATGCCTTTTT GAATATTGGG GCAACACGAA ACACAGCACC AAAATTGATG AATATCCAAA TGGAACTTAG AAAGGTTTGC AATCATCCCT TTCTTTTGGA AGGGGTTGAG CACAGAGAAA CAGACAGACA GTTTAAGGAA TTTTCGGAAA AGGGTCTCTT CGAAAACAAG GCACCGGAAG AGCAACAGCG TCTTCTGAAC GAGCATGGCT ACATCATGAC AAGTGGAAAA ATGGTTTTAT TGGACAAGCT ACTCCCGAAG CTGAAGCAAG AAGGTCACAA AATTCTTATA TTTAGTCAAA TGGTAAAAAT GCTTGACCTG ATCTCAGAGT ACTGCGACCT GCGAGACTTC AGATATGAGA GACTGGATGG ACGTGTAAGA GGAACGGAGC GACAAAAATC AATCGATAGA TTTGAGAACG ATCCAGAGAG TTTCATATTC TTGCTTTCGA CTCGAGCGGG TGGTGTCGGA ATAAATCTTA CGGCGGCTGG TATGTAG
|
Protein sequence | MLSAVAENAA RKRAELGDVP KKTMNQTMYL VKWAGLGYEH CSWETKKDVN DDKLIAEFHK LNNTFPDEPD MPMEAKEAWL EFRDKLYDPN EPHSYKNGNR LRDYQVEGVN WLASTWYKKQ GCILADEMGL GKTVQIVCYI EHIFRVEKVH RPFLVVVPLS TVEHWRREFE GWTDMICCIY HDRQRVWRDV LREYEWYYED RPHTAEFLKF DVLVTTYDTL IGDFDVISQI PFRVAVVDEA HRLRNQKGRL LECMREISAK GTMQYGFQSR VLMSGTPLQN DLTELWTLLN FIEPFKFPDL DNFQLNFGNM ANREQVESLQ QMISPYMLRR VKEDVAKDIP AKEETVIDVE LTSIQKQYYR AIFEHNHAFL NIGATRNTAP KLMNIQMELR KVCNHPFLLE GVEHRETDRQ FKEFSEKGLF ENKAPEEQQR LLNEHGYIMT SGKMVLLDKL LPKLKQEGHK ILIFSQMVKM LDLISEYCDL RDFRYERLDG RVRGTERQKS IDRFENDPES FIFLLSTRAG GVGINLTAAG M
|
| |