Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_46901 |
Symbol | |
ID | 7204736 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | + |
Start bp | 753332 |
End bp | 755437 |
Gene Length | 2106 bp |
Protein Length | 591 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185779 |
Protein GI | 219121096 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AACGCCCATC GTTCCTCTCT TCATCGGAAG ATTCTTCTAA AGCAACTTGG AGAACGAGAG AGTCATTTTA ATCGTGGCAC GAACAGTCCT AAATCTCTGA AACAGCAGTT TTTATCCATA TTAGCGAAGC TAGCTCCGTT GGACGACGGG TAAAGCCAGT AAACGGGTGG CAATTCCGCC AAAGACAAAA ATACAGGCAA CAACGGACCT TCTAACGTCT ATTGCTCTGT TGCAGTAACT GTGCTTCCTC CGACTCTTGG TTGTCACGGC AAACCGGACA GAACACTACA TCTGATTCCA AAGCCCTACG GCTCCACGAA CGATACAGTC ATGACGGAAA CAACAACCGT AGGCAGCGAT GTAAGCAGCG GATCATTCAC AGCGCCCGAT TTGGACTGCA GTGGTGCTTG GATTAAGGAT CCAACCTTGT CTGATGTACT TTGTGGACGC GGTGGTTCGA TCAACTCGCA TGCAGGCAAT GAGCGATTTC GAGAATTGGT CGAAAAGCGG AAGCGTGTCT ACCTTACCGC ACGCTTCAAG AGAGAAAAAC GTCTGATTGC CAGTAGCATC GTTTCGGAAA TTCGAGGGCT AAAGCCTTCG GGGAGATTTC TATCAAGAGA TAGTAAGACA GGCTTATGGA AAGATATTGG TGATGAAAAA GCTAGGGACA AAACATCGCA AGCGTTACGC GAGAATGCTC CTTCCATCCG CGCCGAGATT GAAACGGAGA TCAGCGAGCA GCGCAGGGAT TATCAACAGG AAGATCAGGA GGTAGCACCT CACGGTACTT CTCATCCAGG TTATTATGCC CCTCCGACCT GGGGCTTTCC TACTTATGCA TACCCCAGTT ATCATCAAGT GCATCCCGCT CCACCGGGCG GAGCCCAACC TCCGCATGGA ACTCCAATGC CACCACCTGG CTACTCATAC GATCCACGGG GTCCGCCGCT GCCACCACAG CATTACCCTC CACACTATCC GCCGCCAGGC TATTCACGAG ACTCCCACTA TCCGCCACCG CACTCACACC ATCACACACC AACTACACAG CCCACTGCAC AAGCTACACC AAAGTCGGCC CTCGAAGCCA CTGCGGAGAT CATTACTTCT GGTGCAGAGA CACTTAAGAA GTGGACTCAT TCCAATTTAT CATTGACTGG TGTGCCTTCA AACGATGGTC ATGATTCTAG GTCAACAAGT TCACGCAGTT CAAAGCCTAT CGCATACGTT CACCAAGATT ACACCAAGAA GCGCCGTATG GTTAAGTTCC GAGACGATTA CGACCGTCGA ACAAGCTATT CGCCGGTTTT TTCTGGAAGT GCACTCGGGG GAGGCCAAGC GCACAACGAG CAAATCGAAC CTCAAAATTT GCATGATCAA GAGAGCTCCC TCATGACACA AGTTGCCGAT CGCATTCTGG GGTCGTTAGG TTCCTGGGAC ACTGGTACTT TCTGTGGCGG TCATGATGCA GACGATGAAC GTAAATCGTT CTTTCCGGTA TCTCTTTCCA GTAACGCAGG GCCACCGACC CAAGATGAAG ACAACATGGC GGTCGAATGG GAAGGTCAAG AAGTACAGCT GGTCGACAAG TCTTTGGAGA GTCAGTCTGT CGCTTCCGAC GAACGTATGC CCCCGCCTCT AGTTCGGCAG CAACCACGCC CGGACCAAGC ATCATCGCTC GGTGGATTCT CGTCCCTGGG AAGCTGCCAT TCTTGGCTGC TTCCGGAAAG TGCCGCTTCT TATTTCAGTA AATCGGGTGC TTCACCCTCG AACTCAGTCG ACATGGGGTA CTCGGCCGTC GGCATGGAAC AACATTCAAT CAACGGTTCA ATCGGCGGCG CTTCGCTCAC GCGTGTTTTT GAAAACGAGC CTCAGTCTAC GCCTCATTCC CCTGGCATGT CGCTGAGATC GTTGTCTCAA ATGCCATCGT GGGAACGGTC CCTGCGTAGC AAATCGCCTC TGTCCATTGC ATCAGACGAG GATGATTCAT TGATATCCCG TTCGTCGAGC AAGATATCGG ATGGACACCT TAGTCCGTTC CACGCGCCAT CGAGTCCGAT GCCTATGGTG AACGAGGACG ATATGGTGTG GGAAACCAAG GAATGA
|
Protein sequence | MTETTTVGSD VSSGSFTAPD LDCSGAWIKD PTLSDVLCGR GGSINSHAGN ERFRELVEKR KRVYLTARFK REKRLIASSI VSEIRGLKPS GRFLSRDSKT GLWKDIGDEK ARDKTSQALR ENAPSIRAEI ETEISEQRRD YQQEDQEVAP HGTSHPGYYA PPTWGFPTYA YPSYHQVHPA PPGGAQPPHG TPMPPPGYSY DPRGPPLPPQ HYPPHYPPPG YSRDSHYPPP HSHHHTPTTQ PTAQATPKSA LEATAEIITS GAETLKKWTH SNLSLTGVPS NDGHDSRSTS SRSSKPIAYV HQDYTKKRRM VKFRDDYDRR TSYSPVFSGS ALGGGQAHNE QIEPQNLHDQ ESSLMTQVAD RILGSLGSWD TGTFCGGHDA DDERKSFFPV SLSSNAGPPT QDEDNMAVEW EGQEVQLVDK SLESQSVASD ERMPPPLVRQ QPRPDQASSL GGFSSLGSCH SWLLPESAAS YFSKSGASPS NSVDMGYSAV GMEQHSINGS IGGASLTRVF ENEPQSTPHS PGMSLRSLSQ MPSWERSLRS KSPLSIASDE DDSLISRSSS KISDGHLSPF HAPSSPMPMV NEDDMVWETK E
|
| |