Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44747 |
Symbol | |
ID | 7199722 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | + |
Start bp | 120974 |
End bp | 122230 |
Gene Length | 1257 bp |
Protein Length | 362 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178707 |
Protein GI | 219115824 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0219967 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCTGTTCCG ACCTACCAAC TGTAGGCCTA TTCGCTGTAT TGCCAGAATA TATAACCTAC CGAGGCCACC AAGGACGCTT GCGATTCACC CTTTCCTCTC GTTGACGCTC ACACGGTGCC CTCTTACCTG TTACCAATTG TCGATCATTC CTTTCACTGT CCACACTCAT GAATAGCACG CTAGCGGTGA CTCCCCACGG AGCCGTCACG GAGAACGCCA CGACAACGCA ATCTGCAGAG ACGGGAAAAG CCGTAACGTC CTCGACGATG GCGACGGACG CGGCGTCGAC GCAGAGCGGT TTACAACACG ATTCACGATT CAGTTGCAAT ATTTGTTTAG AAGCCGTCAC GGCACCCGTC GTGACACAGT GCGGACATTT GTACTGTTGG AGTTGTCTGT ACCGATGGCT CGAGCCAGGA ATGGTTCCGG GAGAACGACA AGCACTCACG GGAATGGTCC GGTACGGACC CATCGACGAA ACGCGACGTG TGTGTCCGGT GTGCAAAGCG CCCTGTTCCG TTCCCACCAT TGTTCCCATT TACGTGCGCA ACGAACCCAC TAGTCCCAAC AAACGAACAT CGAGCTTGGC GGATTCCTTG GACGACACTG ACGACGACGA TGTCGAGTAC GATCATCAAC GAGAGGCCTC GTACGGAGAA CAGGCCCACG TGGCGGCGCA CCACACTGAT ATTGTGCGTG CAGCCAATTC CTGGGAAACG TCCGTGACGG AACATGCGGA CCTCGATGGA CCTACAAGTC CACTGCCACC ATCCGTGGTC AACGTGACTG CCTCATCCGC ACCCGATTCC TCCTTCACAA ACACTGGACT CCGGCAACGG TTGCGGTTTC GCAGTCGCGA CAGTGAAATC CCCTCCGCCG AAGACTATCA CGTGGTCCCA GCCCGTCCAG CCGCCAACTC CCCGGTTCGC CATCGCAGTC TTTCCGAATC TAACGTCGGC GCGGCATCCC TCCCCCGCAA CCCCGCATGG TTGACCCCTC TCAATCCTGC TACCAACCGG GCTTCTCTCA GCAACGGACT CGCCTTGTCG CTCCAGCACG CCTTTCGACA ATCTCTTCCA ACGACTGCTG CGGCACAGCC CGATCAGAGC ATTCCCCCAC TCCATCGCAG AGAAGGACAC GGCAGTGCCG CCGTCATGAA CAGCATTTCG GAACAAGATC CCAACGCCAC CGAATTCTTG TCCAGAATTC TATTGTTACT CGGTTCGTTC GTGATCCTGT GTTTGTTGTT GTTTTGA
|
Protein sequence | MNSTLAVTPH GAVTENATTT QSAETGKAVT SSTMATDAAS TQSGLQHDSR FSCNICLEAV TAPVVTQCGH LYCWSCLYRW LEPGMVPGER QALTGMVRYG PIDETRRVCP VCKAPCSVPT IVPIYVRNEP TSPNKRTSSL ADSLDDTDDD DVEYDHQREA SYGEQAHVAA HHTDIVRAAN SWETSVTEHA DLDGPTSPLP PSVVNVTASS APDSSFTNTG LRQRLRFRSR DSEIPSAEDY HVVPARPAAN SPVRHRSLSE SNVGAASLPR NPAWLTPLNP ATNRASLSNG LALSLQHAFR QSLPTTAAAQ PDQSIPPLHR REGHGSAAVM NSISEQDPNA TEFLSRILLL LGSFVILCLL LF
|
| |