Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43374 |
Symbol | |
ID | 7197119 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 277490 |
End bp | 279514 |
Gene Length | 2025 bp |
Protein Length | 617 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | 9-cis-epoxycarotenoid dioxygenase |
Protein accession | XP_002177588 |
Protein GI | 219111673 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0989754 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTTCGCCCTC GGTCAGAGGC AACACAACAC TGAACATAAG GTCAAGACAG CAAATCTTTA CCACGGTATC TTGACTGTGA AACCGAGAAG ATTTCCAGTC GGTGCTTCGA AGAATCCCAG TTTTCTAGTT GAACCTTGCT ATTGCATTTA TTTGCTCCGC ATATCTACAG CATGGTTCTT TCCAGACTCG TCAGTTGGAC GAGTGTTGCA GTGTGGTTAA TTACTCAGAA CTTCTCCAGT TCCGGTATCA GTGTAATGCT AGCAGATGCC TTTAGTCCCG TGATGTTGAC CACCAGTCGC ACGGACGTCG CCTTCCAGCC GACGCTCCAC GCAACGACGA GTCCCGACTC GGTTTCCGTG CGTAAGATTC AGCGCGAACG CGAGCGTTCG GTACGCTCGT ACCACGATAC CGAGGCGTGG AATACGTTGT TTGTGGCCGT TCCCGAACGA AGCACGCCCG TACCGTACGA TACCAGCGCT GGTGGTAGTC ATGCCAACGA CGTTGCGCAA AACTTCCGCA CCGCCAGGGC TGCCCCGTTG CCGAGCAATT TCCCGCCGGG GTGCTTGCTG CGTCTCGGTC CCAACGGAGC CCCGCAGAAC GAAGGATTCT TTGACGGCGA CGGTATGGTA CAGTGCATTA CATTTCCTCC AAGCACTGAC CGTGAGCATG TCGGGATGTT TTCGTGCTCG TATGTTGATA CCAGGGGTCG CCAGCTAGAA GGCGAACGCC AGAAAGTCTT TTTGGGGACA CTTGGTGCTG TCCCTCGTGG CTTGCCGTTG CTCTTCAACG TACTTTCCAA TATGCTAACG TTCCGTACGT TACAAGGACA GAAGGATACC TGTAATACGG CGTTGGCAAC GCACGGGGGC CGCGTCCTGG CCTTGATGGA GCAGTGCCCG CCGGCCGAAA TTGCCATTGG CCGGGACGGA CGCATATCTA CCGTGCAAGC GAACTGCAAT CTGGACGGAG CCATCCCGTT TGCGCCAATT ACTGGAGGAT CACTCAGCGC CCACGGAAGG ACCTGTCCCG AAACTGGTGA ACGGGTACAC GTTTCTTACA GTAGCGGCAA TGCTCCCTAT GTGCGGGTCG ATACGTTCGC ACCAGACGGC TGGAATTTGG TGCGATCGGT TGGTGTAAAT GTTCCGTGTG CGACCATGTT ACACGATTGT GCTATTACGG AAAATTATGT AGTGGTGCTC GACTTTCCGC TCACACTCCG GACGACACGG TTTCTAGCCG ATCAGTTCCC CGTCGAGTAC GAACCTTCAT ACGGGGCTCG CATAGGATTG CTGCCACGGC ATACCACTGA TGCAGACGAT TCGGGCACCA TTTGGTTTGA CTGTGCACCA GGGGTGATAT TGCATCTGGT CAATGCATAC GAAACGAACG ACGGCAAAGT GATTGTGCAG GGTTTGCGGT CGGAACCAAG CACGTCGCAA GGATATTTGG AGGCCTTTTC GCCCAGCTTT TTATACGAGT ACGAATTGGA TCTTGTCTCG CGACGTACGT CCCGGGAAGG TTGCCTGAAT CCGTACGAAA TTGTCGAGTT TCCTATTCTT GACGAATCTC AGAACGGCAA GGTAGCGCCT CACGTGTACA CCATCGGCGT CCGATCGATC GGTGGACCCC TGGCGACGCA CCAACAACCC GTCATTGGTT TAACACTGGA CAGCGTTGTC AAGTTTAATC TTGTCAACGA TACCGAGAGT AGTACAAAGG GTGACGTGCT GGGCAAGTTT GTCTTGCCCG ATCGATGGTT TGCCGTATCG GAGCCTACGG TGGTTGCCAA AACGGACGGA ACCGGGGGTG AGTACGTCTT GATAATTGCC ACAGTCGTGC CGGAGGGCAG TGACTGGAAG CAAGTTGAGG CACTCAAACC AGAAAACGCA GATGAATTGA CTTCGCATGT ATTGGTGTTG GATGGAGACA AATTGGACGA CGGACCGGTC TGGATGCGGG AAATGCCGCA TCGCATTCCG TACGGTTTGC ATTCGTTGTT TGTTCCGTGG GAACTGATGA AATAA
|
Protein sequence | MVLSRLVSWT SVAVWLITQN FSSSGISVML ADAFSPVMLT TSRTDVAFQP TLHATTSPDS VSVRKIQRER ERSVRSYHDT EAWNTLFVAV PERSTPVPYD TSAGGSHAND VAQNFRTARA APLPSNFPPG CLLRLGPNGA PQNEGFFDGD GMVQCITFPP STDREHVGMF SCSYVDTRGR QLEGERQKVF LGTLGAVPRG LPLLFNVLSN MLTFRTLQGQ KDTCNTALAT HGGRVLALME QCPPAEIAIG RDGRISTVQA NCNLDGAIPF APITGGSLSA HGRTCPETGE RVHVSYSSGN APYVRVDTFA PDGWNLVRSV GVNVPCATML HDCAITENYV VVLDFPLTLR TTRFLADQFP VEYEPSYGAR IGLLPRHTTD ADDSGTIWFD CAPGVILHLV NAYETNDGKV IVQGLRSEPS TSQGYLEAFS PSFLYEYELD LVSRRTSREG CLNPYEIVEF PILDESQNGK VAPHVYTIGV RSIGGPLATH QQPVIGLTLD SVVKFNLVND TESSTKGDVL GKFVLPDRWF AVSEPTVVAK TDGTGGEYVL IIATVVPEGS DWKQVEALKP ENADELTSHV LVLDGDKLDD GPVWMREMPH RIPYGLHSLF VPWELMK
|
| |