Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49054 |
Symbol | |
ID | 7195424 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | - |
Start bp | 440854 |
End bp | 443415 |
Gene Length | 2562 bp |
Protein Length | 853 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183749 |
Protein GI | 219127034 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCGCC CATATGATGG CCAGACTCCA GCAGAGCAGG CAACTTTCGC TCGTGAAGAG CTTATGCATC TGGATGAGGA ATCTATGATG GACGACTACA ACGACGACGA GACGGAGAAT AACCAAAGTT TTCTCAATTT GCTCCCGGAA GACGACGAAG CCGGAAACGA CGACCCGGCA CAGCTCTTTG CCATTCTTGA AGATGCCATC AATCAAACCG CCAAACAAAC AGGCGATACA GGGCGAGCTT TAACGGCGGT AGAACAACAG CAAAAGCGCG AAAACGCGGA ACACACGTGG GACCGCGTGC GGCGCTGGAT GTGGGCGCAT CCGCAAGTGG AACAACGCCA AGCCGCCGCG TTTATTCGCG GGCAGGCCGA CGCCACAGCG TTGCATCTCA TGTGCAAACT GCATAATCCT CCCGACGATA TTATCCAAGC CATCGTCGAC TCCGCCGTCG ATGTTGTGTC CTGGACGGAC GCGCACGGTT GGTTGCCTTT GCACCATGCC TGCGCTAACG GAGTGTCTCC GGAAACCATG AAAGTCCTCG TGGATGCCTA TCCCGCCGGT AAGCTGCAAC AGGACAATAT GAGGCGAACA CCGCTCCACT TTTATGCCAC GCGGAATTCC GATAGTACGG TTATCATGAC AACTAACACC GAACTCCTGG CTGACGATGG CGCGGCCGAA CTCACCGATC GTGGCGGCAT GTTACCCATG CATTACGCGT GTGCGTACGG GACCGACCCG GCCGTACTCG AAGTCTTGGC TCAGGTCTAC CCCGCCTCTC TCACGGCCAA AGAAAACAAG GGTCGAACGC CGATGCATTT GGCCATGGTA AATGCTCACC GCGACGCCAG TCCGAACGTG ATTCATTTCT TATTGGAGTA CGCTGAATCC CGAGCGACCG TCAACGTACG CGATCAAGAC GGTTACCTAC CCTTGCATTT GTTGGCTCTC GGTCTCAAAG GATACACGGC GGATGAATCC ACCAAACGCA GTAACGTGAG CGCTTGCTTG AGTATGTACA TGGACGCGGA GCCTCAAGCC GCAGCCGACT TTTTGACGGC GATTCAGGAT TTGCCGGATT GGCTTCAAGA TACTGCAGTC GTATCGAAAC ACGTGCGGAA CGTTTTGAAC TACAAAATTG TTCAGCCCTT TCCCACGTCT ATTCTCATGT TGGACGGATA CTTTCTGATT GTCCTGATTG TATGTTTTGC GTTCTCGACG AAATATTATA TTGATTTCCT CTTCGAGCCG GAGAGCAACA AAACAGACAT TGACATTTAC ATTGTGTTCC TTTTCATTGG AGCGTCGTAC TTTTTGATGC GCGAGCTGGT ACAAGTGGCT TCGTTGGTTA GTCTCGGCAG TTTCAGTAGT TGGTGGTCAG ACGCCGCCAA TTGGTTGGAT GTTGGTGTAA TTACAATGGT TTACTACTAC GCCATCTGTA TGAGCATGGA CAGCCCAGTG GATAGAAATC TTTTTCGAAG TGGCGCCGCC TTTGGACAAG GAATTCTGTA CGTGGCAGTC ATCGTTTATC TCAAAAGCAC ATACGTCGAC TTTGCCGTAT TCGTGGGTGG CGTACTGTAC GTCGTACAAC GGCTAACGGC ATTTCTGACT GCCGTGGGTG TTATCTTGTT GGCCTTTGCC CAAATGTTCT ACTTTATATA TGTGGATGGC GAAATGTGCA CCAACAGGCT GCAGCCCGAT GATCTAAACT ACGACGCAGA CCTTGACTGT CGCTTTTCTC ATTGCTCCTT TGAAGAATCC CTTCTGCGAG TGTACAGCAT GATGATGGGA GAGATCGGAG ATGAAAAACG CTATTCCGCG GGCCCGACGA GTCTAGTGGC ACAGATATTG TACGTCGGCT ACGCCTTTTT GGTTGTCATT CTGCTGTCGA ATGTTCTTAT TGCCATTGTC ACCGATAGCT ACGAAATTAT TCAGAACGAC CGCGCTGCGA TTGTATTTTG GACGAATCGG TTGGATTTCG TGGCCGAGAT GGATGCCATC ACGTACGGAG CACAGCGTCG CCTTGGTTGT TTTCGAAAAA AGAAGGCGCC CATGCAGGTA GAGGAAATTC CCAATGCCCC GGGATTAGTC TCGGATGACT ACAATGACAG GAATGGCTCT TCCGACTTCT TTCGATACGG TTGGCAACAA GTCATGCTCT TGTTTGACGC AAACCTGTAC GAAGAGATCG ATCCTATCGA GTCGCTGGTG TACAATATTT TTCGAATCGC TTGTATTGTT TTCGTTATAC CGTTGTGGTT GATCATGGGC GCGGCAACTG CTGGGTGGTT ATGGCCTCCA CAAGTACGAG AGGCTCTATT TTGTCAGAAA GGCACGACCA CCTCTCGTTC GGAGATTGAG CGCAAAAAAC TCGAGCAGCT GCGAACTATT CAAACCGATC TGACATCATT GAAGAGTGAT ATAGTGCGAG AAATGACTAG CGATCGCGAC GAAATGATAC GAATGAAGCT GGAAGTGGAA GGAGTCCAGG CAGAAGTCTT ATCCGATCTA ATACAAGTTC GAGAGCTTAT CACGTCGCTT CTTGGAACTT AA
|
Protein sequence | MSRPYDGQTP AEQATFAREE LMHLDEESMM DDYNDDETEN NQSFLNLLPE DDEAGNDDPA QLFAILEDAI NQTAKQTGDT GRALTAVEQQ QKRENAEHTW DRVRRWMWAH PQVEQRQAAA FIRGQADATA LHLMCKLHNP PDDIIQAIVD SAVDVVSWTD AHGWLPLHHA CANGVSPETM KVLVDAYPAG KLQQDNMRRT PLHFYATRNS DSTVIMTTNT ELLADDGAAE LTDRGGMLPM HYACAYGTDP AVLEVLAQVY PASLTAKENK GRTPMHLAMV NAHRDASPNV IHFLLEYAES RATVNVRDQD GYLPLHLLAL GLKGYTADES TKRSNVSACL SMYMDAEPQA AADFLTAIQD LPDWLQDTAV VSKHVRNVLN YKIVQPFPTS ILMLDGYFLI VLIVCFAFST KYYIDFLFEP ESNKTDIDIY IVFLFIGASY FLMRELVQVA SLVSLGSFSS WWSDAANWLD VGVITMVYYY AICMSMDSPV DRNLFRSGAA FGQGILYVAV IVYLKSTYVD FAVFVGGVLY VVQRLTAFLT AVGVILLAFA QMFYFIYVDG EMCTNRLQPD DLNYDADLDC RFSHCSFEES LLRVYSMMMG EIGDEKRYSA GPTSLVAQIL YVGYAFLVVI LLSNVLIAIV TDSYEIIQND RAAIVFWTNR LDFVAEMDAI TYGAQRRLGC FRKKKAPMQV EEIPNAPGLV SDDYNDRNGS SDFFRYGWQQ VMLLFDANLY EEIDPIESLV YNIFRIACIV FVIPLWLIMG AATAGWLWPP QVREALFCQK GTTTSRSEIE RKKLEQLRTI QTDLTSLKSD IVREMTSDRD EMIRMKLEVE GVQAEVLSDL IQVRELITSL LGT
|
| |