Gene PHATRDRAFT_43374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43374 
Symbol 
ID7197119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp277490 
End bp279514 
Gene Length2025 bp 
Protein Length617 aa 
Translation table 
GC content54% 
IMG OID 
Product9-cis-epoxycarotenoid dioxygenase 
Protein accessionXP_002177588 
Protein GI219111673 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0989754 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTTCGCCCTC GGTCAGAGGC AACACAACAC TGAACATAAG GTCAAGACAG CAAATCTTTA 
CCACGGTATC TTGACTGTGA AACCGAGAAG ATTTCCAGTC GGTGCTTCGA AGAATCCCAG
TTTTCTAGTT GAACCTTGCT ATTGCATTTA TTTGCTCCGC ATATCTACAG CATGGTTCTT
TCCAGACTCG TCAGTTGGAC GAGTGTTGCA GTGTGGTTAA TTACTCAGAA CTTCTCCAGT
TCCGGTATCA GTGTAATGCT AGCAGATGCC TTTAGTCCCG TGATGTTGAC CACCAGTCGC
ACGGACGTCG CCTTCCAGCC GACGCTCCAC GCAACGACGA GTCCCGACTC GGTTTCCGTG
CGTAAGATTC AGCGCGAACG CGAGCGTTCG GTACGCTCGT ACCACGATAC CGAGGCGTGG
AATACGTTGT TTGTGGCCGT TCCCGAACGA AGCACGCCCG TACCGTACGA TACCAGCGCT
GGTGGTAGTC ATGCCAACGA CGTTGCGCAA AACTTCCGCA CCGCCAGGGC TGCCCCGTTG
CCGAGCAATT TCCCGCCGGG GTGCTTGCTG CGTCTCGGTC CCAACGGAGC CCCGCAGAAC
GAAGGATTCT TTGACGGCGA CGGTATGGTA CAGTGCATTA CATTTCCTCC AAGCACTGAC
CGTGAGCATG TCGGGATGTT TTCGTGCTCG TATGTTGATA CCAGGGGTCG CCAGCTAGAA
GGCGAACGCC AGAAAGTCTT TTTGGGGACA CTTGGTGCTG TCCCTCGTGG CTTGCCGTTG
CTCTTCAACG TACTTTCCAA TATGCTAACG TTCCGTACGT TACAAGGACA GAAGGATACC
TGTAATACGG CGTTGGCAAC GCACGGGGGC CGCGTCCTGG CCTTGATGGA GCAGTGCCCG
CCGGCCGAAA TTGCCATTGG CCGGGACGGA CGCATATCTA CCGTGCAAGC GAACTGCAAT
CTGGACGGAG CCATCCCGTT TGCGCCAATT ACTGGAGGAT CACTCAGCGC CCACGGAAGG
ACCTGTCCCG AAACTGGTGA ACGGGTACAC GTTTCTTACA GTAGCGGCAA TGCTCCCTAT
GTGCGGGTCG ATACGTTCGC ACCAGACGGC TGGAATTTGG TGCGATCGGT TGGTGTAAAT
GTTCCGTGTG CGACCATGTT ACACGATTGT GCTATTACGG AAAATTATGT AGTGGTGCTC
GACTTTCCGC TCACACTCCG GACGACACGG TTTCTAGCCG ATCAGTTCCC CGTCGAGTAC
GAACCTTCAT ACGGGGCTCG CATAGGATTG CTGCCACGGC ATACCACTGA TGCAGACGAT
TCGGGCACCA TTTGGTTTGA CTGTGCACCA GGGGTGATAT TGCATCTGGT CAATGCATAC
GAAACGAACG ACGGCAAAGT GATTGTGCAG GGTTTGCGGT CGGAACCAAG CACGTCGCAA
GGATATTTGG AGGCCTTTTC GCCCAGCTTT TTATACGAGT ACGAATTGGA TCTTGTCTCG
CGACGTACGT CCCGGGAAGG TTGCCTGAAT CCGTACGAAA TTGTCGAGTT TCCTATTCTT
GACGAATCTC AGAACGGCAA GGTAGCGCCT CACGTGTACA CCATCGGCGT CCGATCGATC
GGTGGACCCC TGGCGACGCA CCAACAACCC GTCATTGGTT TAACACTGGA CAGCGTTGTC
AAGTTTAATC TTGTCAACGA TACCGAGAGT AGTACAAAGG GTGACGTGCT GGGCAAGTTT
GTCTTGCCCG ATCGATGGTT TGCCGTATCG GAGCCTACGG TGGTTGCCAA AACGGACGGA
ACCGGGGGTG AGTACGTCTT GATAATTGCC ACAGTCGTGC CGGAGGGCAG TGACTGGAAG
CAAGTTGAGG CACTCAAACC AGAAAACGCA GATGAATTGA CTTCGCATGT ATTGGTGTTG
GATGGAGACA AATTGGACGA CGGACCGGTC TGGATGCGGG AAATGCCGCA TCGCATTCCG
TACGGTTTGC ATTCGTTGTT TGTTCCGTGG GAACTGATGA AATAA
 
Protein sequence
MVLSRLVSWT SVAVWLITQN FSSSGISVML ADAFSPVMLT TSRTDVAFQP TLHATTSPDS 
VSVRKIQRER ERSVRSYHDT EAWNTLFVAV PERSTPVPYD TSAGGSHAND VAQNFRTARA
APLPSNFPPG CLLRLGPNGA PQNEGFFDGD GMVQCITFPP STDREHVGMF SCSYVDTRGR
QLEGERQKVF LGTLGAVPRG LPLLFNVLSN MLTFRTLQGQ KDTCNTALAT HGGRVLALME
QCPPAEIAIG RDGRISTVQA NCNLDGAIPF APITGGSLSA HGRTCPETGE RVHVSYSSGN
APYVRVDTFA PDGWNLVRSV GVNVPCATML HDCAITENYV VVLDFPLTLR TTRFLADQFP
VEYEPSYGAR IGLLPRHTTD ADDSGTIWFD CAPGVILHLV NAYETNDGKV IVQGLRSEPS
TSQGYLEAFS PSFLYEYELD LVSRRTSREG CLNPYEIVEF PILDESQNGK VAPHVYTIGV
RSIGGPLATH QQPVIGLTLD SVVKFNLVND TESSTKGDVL GKFVLPDRWF AVSEPTVVAK
TDGTGGEYVL IIATVVPEGS DWKQVEALKP ENADELTSHV LVLDGDKLDD GPVWMREMPH
RIPYGLHSLF VPWELMK