Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_20188 |
Symbol | |
ID | 7200900 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 142282 |
End bp | 145668 |
Gene Length | 3387 bp |
Protein Length | 693 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179982 |
Protein GI | 219118417 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.248743 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGCTA CGAACAGATC CTCTTCGTCG GTGTTTCTGC CGCCGTCCCA AACGCGCTTG GTCACGATTG CCGCGCACGT GGATCACGGC AAAACAACCT TGGCGGACAA TCTTATCGAG GCCAACGGGC TTATTTCGGA ACGTCTCGCC GGGACTCTCC GCTACCTCGA CTCGGATCCG GAAGAACAAC GCCGTGGCAT TACTATGCGC AGCTCCGCGA TTGGGCTCCA GCACGTGTAC CAGAAACACC ACAAACCGAA CCACACGGGC GGCGATCACA CTCCAGCCAA CCAAGGCCAA AAGCACGTGA TTCATCTCTA CGATTCTCCC GGACACACGG ATTTTTCTCG CGAAGTATCG TCCGCCATGT CCTGCTGCGA TACGGCCTTG CTCGTCGTGG ATGCCGTCGA GGGTATGGGA CCCCGTACAC ATCAAGTCTT TCGGGAAGCC TACGCGCAAC AGCTGGTTCC CATTCTCGTG CTCAATAAAA TCGATCGATT GTGTTTGGAT CTGCGCCTCA CACCCACCGA GGCGTATCTG CGTTTGCGGA ATCTACTCGA AACGGTCAAC GCCGCCGCTT CCACCTTGTT GACCAGCTCG CGGCACGCGG ACCACGCGTC AGGAAGCAAT GGCGATCCGT CAACCGAGAT AACCACGGAA TTGGAAACAC AGTGGACGTT TGATCCGGCC CGTAATAACG TGGTGTTTGC GTCCGCCCTG TTTGGATTTG GATTTACGGC ACAAAATTTG GCGAGAGCCT TGTATCAGAC CAAGGCTATT CCATCGTCCC TTAAACCACC CGTATTTCGT TCCCTGGTTT TTGCCGACGC CAAACTCAAA GGTGATAAGG TACTGAAGTG GAAGGCACGG GACCAGACAG ACGATGCTCC CATTTTTGCC ATCTATGGTT TGCAGCCACT GTGGGATGTT TTGGAAGGCG TTGCGACGGC GGCCGCGGCA GCGGGACTCG GATCGTCACA ACTGTTTCAC CATGGAAGCT CCAACACGGT AGATCATCAC CACAATGGCA CCCCGCCTTC GGTACCAACA ACGACTACTG TGGACGTGAA AATTAAAGCC GACACGACTG GTATGAATCA AACTCTACGA GCCCTGAGTA TCGGACCGAC CGGCAGCGAT GTACCGTCAA CCGTAGAAGC ATTGCAGACA ATCTTGACCC GGACGGGTGC CAATACGGAA GAAGCCATTG TGCGATCCCT GTTACGGCGC TTTCGACCTC TCTCACGGAC ATTGTTAGAC GTCCTTGTCG AATACGCTCC GTCGCCAATC CAGGCTGCAG CGTCGATGCG ACATCGGGCT CTGAGTTTAC AAATGCCTGA GAGAACGGCA ATTACAAATG CTGCTCAGGA GGAATATTCT CGAATTGCGG AGGCGGTTCA AAATTGCAGC GTTGCTCCGA ACGCCCCCAC CGTAGCGCAT GTGTACAAGT TTATGGCCGC GGAACGTTCC CAAATTCGGG ATCCATGTTT GCCTACGAAT CTGGAGAGTC ATGATGAGGA CCACACAAGC TTGATTCTGG GCGTAGCAAG GGTGTTGAGT GGGAGCTTGA AAACGGGAAA GTCTTACTAC GCAATGGGCC CAAAGCATTT GCACACCGAC TCCAATATTG TACCAAAACG AGCTATACGG CTGTACTTAC TCATGGGTAG TTCGTTTGTA CTCGTGGACG AGGTACCGGC TGGACATTTG TGTGGGGTCT ACAATTTGGA AGACACGCAG CTTAAAACAA TCACGCTATC TGACTCGCCC CACGGCATGC CTCTGACTGC CATGGAACAG GGTATCCGAC CCCTCGTGAA GGTCAACGTG GAAGCACAGG AAGCTTCTGA TACCATTGCC TTAGAACGCG GATTACGAAA ACTGGCCTTG GCTGATGCGG CCGTTGAAGT CACAGCGACG GCCAAAGGAG AACGGCTTTT GGCTTGTTTA GGAGAAATTC ATTTGGAACA ATCTATTCTG GATCTTCGGA ATGTTTATTG CGGTAGAGAA ATAAAATTGC GCATTTCTGA TCCCATTGTA GACTTTGGCG AAACCACCGA CTGGTTTGAA CACGAAATCG ACTACGCCAC ATTTTGGGAG GACCCAGCTC CGAGGCTGCG ACAAGTCTCG ATTCCACCAT ACAATGAGGA ATATGGCATA TCCCTTAGCA GACATGGTAG GATGAGATCG TTGGTATCGG GCCGCTCAGC TGCGATTCAT GTACGTGTAG TACCCTTGGC TTCGTCGATC TATCAATCTC TTTCGGACGA CAAGGTCGTG GAGAACACAG AAGAAGATCT GCTGAACCTG GCCAAAGCAC TCGGATATCA CTGTCTAAAT GCGGATGATG TACTGGAGAC ACTCAAGAGC GCGTTGTGCT CTTTGGGTAC GAATGGAAAT GCACTTATAC TAGGACCAGG ATTGTGCAAT GAATCCTGTG TGGTCGGTGT CGTTTCGGAC ACCGGCGAGG TTCACCTCCC ATCAATAGCA GCGGAAAAAT CGGGAAATTC TGACATCGCT CCGGTGGAGC CGGAATCCAC TTCGGCCGAT GTCTGGGACA AAGACGGAGT GGGAATGAAA GAGTTTCGAT CCATGCTAAG AAAGCTTCGA ACTGTTGGAG ATCAGAATGG ACACTCAAAT CTAGAAATGT CAGAAGTGGA TGTTGCTGCA CGAAAAATAT GGAGCGAAGA TATGCGCGGA TCAATGGTAG CTGGGTTTCA ACTTGCGGTT CGGGCCGGTC CAATTTGCGA AGAGCCCGTC CGAAATGTAT TAGTGGTCTT AGAGGGTGCC GAAGTTGGAC TAGCTAGGCG GGGAGATTCT TACGAAGCTG CAAAATCACT ATCTGGAGGA ATGCTGGTAG CCGCTCTTCG TTCAGGTATT CGTTGTGCGC TCTTAAGCAG ACCCGCTAGG TTAATGGAAG GCCACTTGAG ACTTACGCTC CACTCATCCA TGGCTGGACT CGGTCCTCTA TATTCGGTAC TTAACAAGCG TCGCGGCAAA GTCCTAGATG ATTCCATGGT TGATGGTGCT GACTTGCTCA TGATCACTGC GCTTATTCCT CAAGCGGAAG CATTTGGACT CGCACCGGAA CTTTACAGCA ATACCAGTGG GGAGGTCACC GCGCCAGAAC TAAATTTTAG CCACTGGGAT CGACTTGACG TGGACCCGTT TTGGATCCCA ACAAGTTTAG AGGAACGGGA GGATTTTGGC GAGTTACAGA TGGCTGGAGA TATGTCTACT GGTCTGGACA ATACCGCTCT CAAATATATT CGCAAAGTTC GAGAACAAAA AGGCCTGACT ACTGACTCGG CCCGTACAGT TTTAAATGCC GAAAAGCAGC GAACACTTAA GCGATAGAAA TAATGAAAGG AGTACGTAAC TTGTAAC
|
Protein sequence | MSATNRSSSS VFLPPSQTRL VTIAAHVDHG KTTLADNLIE ANGLISERLA GTLRYLDSDP EEQRRGITMR SSAIGLQHHV IHLYDSPGHT DFSREVSSAM SCCDTALLVV DAVEGMGPRT HQVFREAYAQ QLVPILVLNK IDRLCLDLRL TPTEAYLRLR NLLETVNAAA STLLTSSRHA DHASGSNGDP STEITTELET QWTFDPARNN VVFASALFGF GFTAQNLARA LYQTKAIPSS LKPPVFRSLV FADAKLKGDK VLKWKARDQT DDAPIFAIYG LQPLWDVLEG VATAAAAAGL GSSQLTAITN AAQEEYSRIA EAVQNCSVAP NAPTVAHVYK FMAAERSQIR DPCLPTNLES HDEDHTSLIL GVARVLSGSL KTGKSYYAMG PKHLHTDSNI VPKRAIRLYL LMGSSFVLVD EVPAGHLCGV YNLEDTQLKT ITLSDSPHGM PLTAMEQGIR PLVKVNVEAQ EASDTIALER GLRKLALADA AVEVTATAKG ERLLACLGEI HLEQSILDLR NVYCGREIKL RISDPIVDFG ETTDWLMEGH LRLTLHSSMA GLGPLYSVLN KRRGKVLDDS MVDGADLLMI TALIPQAEAF GLAPELYSNT SGEVTAPELN FSHWDRLDVD PFWIPTSLEE REDFGELQMA GDMSTGLDNT ALKYIRKVRE QKGLTTDSAR TVLNAEKQRT LKR
|
| |