Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_22797 |
Symbol | |
ID | 7194937 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | - |
Start bp | 445363 |
End bp | 446351 |
Gene Length | 989 bp |
Protein Length | 288 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183488 |
Protein GI | 219126487 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.0349586 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGACGC TATTACTGGA TCCAGATATT CGCGACTGGG TCGTTTTGCC ACTCTTTGTG ATAATGGTAG CTGCTGGTTT GTTGCGACAT TATGTTGGTT TATTGTTGCA AGGCGAAAAG CAGCCAATGC CCGTTATTGC ACAGCGTTCT CAGAATCTGT TAGCGCAAAC GGTGCGGATT CGCTCCGGAG CATCACACTA CATTTCCACT TGGCAGTGGC ATGTACGAAA ACAACATTAC GCGGCACTCC TGCAACAGGA AGCCGAATGG GCCGAAGCGG AGCAGCAGAA AAAGGCAGAT TCTTCCGACG ATGACCCAAT GTCAGCCATG CTGAATAATC CGCTGGGAAT GCTCAAGGGA AACATGGTCT TTATGGTACA AAAGTAAGTG CGAATGTTCA TCGCTAGATA CCATCTTTGT GACGCTTCAC TCGTAGGTAC GATGAATCAA TCCAAATCTT GTCCAAATTT CGACTCACTC ATAGTACGCG TCTACAGTAT GGTCATGATG CAGGGCATTC AGCACTTTTT CTCCGGGTTC ATTCTCCTCA AAGTGCCTTT TCCTCTGACG GCTGGGTTCA AGGACATGTT TCAAAAGGGT CTAGCGGAAC TCCCGGACCT GGAATCTTCA TACGTGAGTA GCGTTAGCTG GTATTTCTTG GTCATGTACG GACTCCGAGC CTTTTTTCGT CTCGCAATTG GCGATCCCAG TTTGGAGGCC CGGGAACAAG ATATGCTGCT TGCGCAGTTT GGTCTGCAAA ATCCTCCCAA TCCGGGCCAA AAACAGGACG GCGAATCCAT GGCAAAAACA CTACGACAAG AAGCTGAGAA TCTGGAGTTG TTTTTGCAGT CGCACAAGTC CGAATTAGAC ACTGTGGAAA AGCGACTTCT TGGCAAGCGC ATGCCGCGTA AAACATACGG GGACCAGGAC GATTTCTTAC TCGGCGCGAG TTCGGGGAAA CCTAAACGTA AAACCCAATA AATGTGGCT
|
Protein sequence | MTTLLLDPDI RDWVVLPLFV IMVAAGLLRH YVGLLLQGEK QPMPVIAQRS QNLLAQTVRI RSGASHYIST WQWHVRKQHY AALLQQEAEW AEAEQQKKAD SSDDDPMSAM LNNPLGMLKG NMVFMVQNMV MMQGIQHFFS GFILLKVPFP LTAGFKDMFQ KGLAELPDLE SSYVSSVSWY FLVMYGLRAF FRLAIGDPSL EAREQDMLLA QFGLQNPPNP GQKQDGESMA KTLRQEAENL ELFLQSHKSE LDTVEKRLLG KRMPRKTYGD QDDFLLGASS GKPKRKTQ
|
| |