Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42440 |
Symbol | |
ID | 7196644 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 58601 |
End bp | 60029 |
Gene Length | 1429 bp |
Protein Length | 475 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177009 |
Protein GI | 219110515 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.727315 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATGGCAGAA AGACGACCGC ATCCTCCTTC TCTTTCGGTA GCTATTTTAC AAATTCTGAT TCTTTATCAC ATGCTGCAGC GAGCCGCGGC GTTGGCAACG TCGCGAATTT CTTCTCCATA CACAAAACGC GGTCTGGTTC GAGTGGTCGC ATTTTCGGTT TCCAACCCCA ACGAAGAACA GGTGCAGGAC ATGCTATCCA ACGTTCTGTC ACGAGCCACA CGCACGTTGT CCTCGAAATC CTATACCAAT TTCTCTGTAA ATAGCACCTG CTCTTCCGAG CCGTGGCGGA ATGAATCGCG TGTCGACGAT GATTCCGTAT TGGATATGCT GGAGGCAATT CGAGTGGGCA CCGCGAATGG GAACTCGTCA ATGGTTCGCA CCGATGCGAC AAGGAGCACC AACAGTGCTG TATCCTCTTC TTTCTCTCCG TCGGTAACAT CCGCTATTAG ATTGGATTCC CTTTTCACGA ATTCCATATC GCGAGCCAAA CGCACATTCA CCACCAAAGT GGAATTCCAG GAAGCATCCA AATCGAACAA ATTGGTCGAC GATACGGTCA CAACCATACT CGACAAGGTA CAGCAGCAAG GTCGCGTCCC ATCGCCGACA CCCGCTCCAG ATCCCTTGCC TCGTCCCACG GACACTGATT TGCACTACCA GCAAAATCCC GCCATTTCAG CCACTGCCCT GGCCCATTCC TTGTGGGGTT ACGTGCTCCG ACCGGGCCTG GATTCGGCCA TTGACGCAAC GGCAGGAAAC GGTGGGGATG CCGCTACAAT TGCCACGATG CTCTTTTCAA ACGTGACCCA AAGCTCGACG TCATTAATGC CAACCTCGCG ATCCGAACTC GTTTGTGTCG ATGTGCAGAC CCAAGCGTGT GCGAATACGC GCAGCGCGTT GGAAGACTGC GTCGGCTCGG ACGTAGTGGA GCAACGCGTT CGGATAATCC AAGCATCTCA CGCACCATTG CCACTACCCA CTGATACCTC CTCAATCGCC CTGGTAGTCT TTAATCTCGG GTTCTTACCC CAATCGGAAA ATAAAGCCCG TCAAACCCAG ACCGACACAA CCTTGGCTGC CATGGCCGAC GCCTGTACGG TCTTACGCAT TGGCGGCCTG CTATCCGTCA TGACCTACCC AGCGTCCAAC GCTCACGAAG ATGCGCTGGC CCGGGCCTTT ATGGAATCTT TGGCGCTATA CAGCTCCAAA ACAGAAAATT GGGAAACCTT TGTGGACGAA TTGATGTTTC CAGCCGAGAA TGATGACACT GCAACAGCCG ACGCCGAAGA TTGGAACGAA CAGCTCCGAC GGTCACTACG GTATGTTTAC GAAGAAAATG GACCCACACA GACCTGGCGG GTTCACGAAC ACCGGAAAAT TGGATGGAAG AACGCGCCCG TCTTGCTCAC CGCAATCCGA ATCAAGTAG
|
Protein sequence | MAERRPHPPS LSVAILQILI LYHMLQRAAA LATSRISSPY TKRGLVRVVA FSVSNPNEEQ VQDMLSNVLS RATRTLSSKS YTNFSVNSTC SSEPWRNESR VDDDSVLDML EAIRVGTANG NSSMVRTDAT RSTNSAVSSS FSPSVTSAIR LDSLFTNSIS RAKRTFTTKV EFQEASKSNK LVDDTVTTIL DKVQQQGRVP SPTPAPDPLP RPTDTDLHYQ QNPAISATAL AHSLWGYVLR PGLDSAIDAT AGNGGDAATI ATMLFSNVTQ SSTSLMPTSR SELVCVDVQT QACANTRSAL EDCVGSDVVE QRVRIIQASH APLPLPTDTS SIALVVFNLG FLPQSENKAR QTQTDTTLAA MADACTVLRI GGLLSVMTYP ASNAHEDALA RAFMESLALY SSKTENWETF VDELMFPAEN DDTATADAED WNEQLRRSLR YVYEENGPTQ TWRVHEHRKI GWKNAPVLLT AIRIK
|
| |