Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_19901 |
Symbol | |
ID | 7200552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | - |
Start bp | 174994 |
End bp | 177906 |
Gene Length | 2913 bp |
Protein Length | 900 aa |
Translation table | |
GC content | 57% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179808 |
Protein GI | 219118050 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCTCGCGCA CTCACCGCCG CACATTATCG AGTGCGCCGC CGCCGAAGCC GGGGCCTCGC AAGATGCCTG TCATCTCCCG CCGGCCACGC GGTTCGCCGC CGTCGGACAA AAGGGCGCCA CGCTCTGGAT GACGGGATGT TCCGGTGCCG GCAAAACCAC CATTGCCACC GCACTCGAAG ATCAACTCGT CAAGAGTTAC GGGAAACACG TCTACCGTCT GGACGGGGAT AACCTCCGCA CCGGACTCAA CCGTGATTTG GGATTCTCCG AAGCCGATCG CGCCGAGTCG GTCCGACGGA CCGGGGAACT CGCCACACTC TTTGCCGACG CCGGTGTCGT CACGCTCGTC GGACTCATCT CGCCCTACCG CAAGGATCGC GACGCCGTAC GCAAACGTCA CGTCGACCAA GGCATTCCCT TTTACGAAGT ATTCCTCGAC GTGCCCGTGG ATGAACTCAA AAAACGCGAT CCCAAGGGAC AGTACGCTCG TGTCGAGTCC GGAGAACTCA AACACTTTAC CTGCATCGAC GACCCCTATG ATGAACCCTT GCAACCAGAA ATTACCCTCA AAACGCACGA ACTCACCATT GAACAGTCGG TGCAGATTCT CTTTCGACGA CTCGAACGAG ACGGAATTCT GGTCGGGGCG CCCAAACTTA GTCCGCCCGG TCTGCCCAAC CCCGACGGGG ACGTCTTGGT GGACTTGCAC GTTCCCGACG AATCCAAAGA AGCCCGTCGC GCCGAGGCGG CGACCCTCCC CAAGGTCTTG ATCAACGACA TTGATCTCAA CTGGTTGCAA ACCATTGGGG AAGGCTGGGC CTCACCGCTC CGAGGTTTCA TGCGCGAAGG CACACTGTTG GAAACCCTGC ACTTTAATTC GATCCTCACG GATCCCTTCA ACCTCACGGG CAACGCCCTG CGACTGGAAA CCCGCACGAA CTTTGATCAC TTTTCCGCCC ATCCGGCCCC CCAACGCGTC TCCATGCCCA TTCCCATCAC CCTCTCCTGT ACATCTTTTA CCAAGGACCT CATTGACGCC TCGTCCCACA ACGCCGTCGC TTTGGTGACA CAAATGGGAC ACACCGTGGC CATTCTACGC GATCCCGAAG TCTACGCCAA CCGCAAGGAA GAAATCGTGA CGCGTATGTA CGGTGTCGTG GATCCGGATC ATCCCTACAT TCAACACATT TATCGGGGCG GCGACTACTT GATTGGCGGA GAAATCGAAC TGCTGGATCG CATCCGCTAC AATGACGGCC TCGACCAGTG GCGCAAAACA GCGACGGAGC TCGTGCAAGA GTTCCAGAGC AAAGGGGCCG ACACGGTGTA CGCCTTCCAA ACGCGTAACC CGACCCACGC GGGTCACGCG TACCTGATGC GTTCCGCCGG TGAAGACCTG CGTCGTCAGG GGTACCAGAA ACCCGTCCTG TGGTTGAGTC CCCTGGGCGG TTGGACCAAG GCCGACGACG TGCCGCTCGA TGTGCGCGTC AAACAGCACG AACAAGTCCT GCAAGCGGGC ACCACCCATC CCGGTGGCCT CGATCCGGAA TCCACCGTCA TGGCTATTTG GCCCGCTCCC ATGGTCTACG CCGGACCCAC CGAAGTCCAG TTCCACGCCA AGTCACGGCG CTCCGCGGGA GCCTCGTACT TTGTGGTCGG CCGCGATCCC GCCGGAATGA AAGGATCGCC CAACGCGGTG GCGCACCCGG ACGATGACCT CTACGACGGT AACCACGGAC GTTACGTTCT GCAGAACTCG CCGGGCCTCG GAGATATGAA GATGCTGAGC TTTGTCAAAG TCATGTACGA CACCACCGAC AATATTATGA AGATTCCGGA CGAAGCGCGG CTGGCGGACT TTATCAGTAT TTCGGGCAGT AAAATGCGAC TGTTGGCCCG GAACGGGGCC ACCCCCTGCA GTCCCACCAA TATTCCGACG GATCTGGTCG AAGCCAACTG CGTCCCCAGC GGATTCATGG TACCGGACGG TTGGAATCAA GTGGTCGACT ACTACCGGAA TATTGATGAT GTGCAACGCT GGACGCCGTG GAGTCAACCT CGCGTAGATC CCCCCACGGC ACCGCGCACC ACGTATCAAG GCCAGTTTGG TTCCCGATCC TTCCACCTGA CTAGTACAGA ATACGAATCC TTCTGGCACG ACATTCCCCT GAGTCCATCG GGGCAATCCG AAACCGTAGT CAACATGGTG ACGGAAATTC CCATGTATTG CACGGCCAAA ATGGAGATTC AAAAGATGCT GTCCAACAGT CCCATTGCTC AGGACACCAA CAGCGACGGT TCGCCGCGTC ACTACAGCTA CGGTACGCCC TTTTTCAACT ATGGTCTCAT TCCACAAACA TGGGAAGATC CCAACCTAAA ATCTGCGCAA GGGTACGGTG GGGACAACGA TCCGCTCGAC GTTATCGAAT TGGGGTCGTC GCCCTTGCAA ATGGGTGGAC TAACGCCGTG TCGGGTGTTG GGATCGTTTG AGCTCATTGA CGAAGGCGAA ACGGACCACA AGATTCTGTG CATTGCCGTG GACGACAAAG ACGCCAACCA AATCCATTCC TTGGAAGATT TGGAGCGTGT CAAGCCGGGT CACTTGGACA AGCTCCGGGA TTGGTTGAAG CGGTACAAGA CGAGCGAGGG CAAAGCGGAA AACAATTTGG CGTCTGAAAC GCCGCGCACC GCGATGGAAG CCGTAGGCGT CATTCAAGAA ACGCACGGAC GCTGGCGATC ATTGTGTGGT AAGGATGGAA CGACAGTCTA TTCTCTTTCG AGCAAGACGG CCGGTTTCTG GCTCAGCAGT CCGGGGTGTA GGGGAACGTA ATCTTACAGT TAGTGTCGCA GTTCCCTCCA GCCCAAAATT ACACAAACCC TTATTTACTT TTTAGAAATT TGCCACGAGT CGT
|
Protein sequence | MTGCSGAGKT TIATALEDQL VKSYGKHVYR LDGDNLRTGL NRDLGFSEAD RAESVRRTGE LATLFADAGV VTLVGLISPY RKDRDAVRKR HVDQGIPFYE VFLDVPVDEL KKRDPKGQYA RVESGELKHF TCIDDPYDEP LQPEITLKTH ELTIEQSVQI LFRRLERDGI LVGAPKLSPP GLPNPDGDVL VDLHVPDESK EARRAEAATL PKVLINDIDL NWLQTIGEGW ASPLRGFMRE GTLLETLHFN SILTDPFNLT GNALRLETRT NFDHFSAHPA PQRVSMPIPI TLSCTSFTKD LIDASSHNAV ALVTQMGHTV AILRDPEVYA NRKEEIVTRM YGVVDPDHPY IQHIYRGGDY LIGGEIELLD RIRYNDGLDQ WRKTATELVQ EFQSKGADTV YAFQTRNPTH AGHAYLMRSA GEDLRRQGYQ KPVLWLSPLG GWTKADDVPL DVRVKQHEQV LQAGTTHPGG LDPESTVMAI WPAPMVYAGP TEVQFHAKSR RSAGASYFVV GRDPAGMKGS PNAVAHPDDD LYDGNHGRYV LQNSPGLGDM KMLSFVKVMY DTTDNIMKIP DEARLADFIS ISGSKMRLLA RNGATPCSPT NIPTDLVEAN CVPSGFMVPD GWNQVVDYYR NIDDVQRWTP WSQPRVDPPT APRTTYQGQF GSRSFHLTST EYESFWHDIP LSPSGQSETV VNMVTEIPMY CTAKMEIQKM LSNSPIAQDT NSDGSPRHYS YGTPFFNYGL IPQTWEDPNL KSAQGYGGDN DPLDVIELGS SPLQMGGLTP CRVLGSFELI DEGETDHKIL CIAVDDKDAN QIHSLEDLER VKPGHLDKLR DWLKRYKTSE GKAENNLASE TPRTAMEAVG VIQETHGRWR SLCGKDGTTV YSLSSKTAGF WLSSPGCRGT
|
| |