Gene OSTLU_43309 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_43309 
Symbol 
ID5005382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp58496 
End bp59974 
Gene Length1479 bp 
Protein Length492 aa 
Translation table 
GC content59% 
IMG OID640420803 
Productpredicted protein 
Protein accessionXP_001421195 
Protein GI145353812 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.231154 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00124797 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGCGCG CGGGCGCGGC GACGCGGCGA CGCGCGCTGC AGACGACGGA GGACACGCCC 
GTGAGCGCGG AGGCGAGGGC GGAGGCGTAC GATTGGCGCG CGCACTGGTA CCCGGCGGCG
TACGTCGCCG ACGTGGAGAA GGACGCGCCG CTGACGTTTA CGCTCCTGGG CGAACCGCTG
GTGTTCTGGC GGGACAAGAG CGGCGAGATG CGCTGCGTCG CGGATAGGTG CCCGCACAGG
TTGGTACCGC TGAGCGAAGG ACGCGTCAAC GAGACGGGTG AGCTCGAGTG CGGGTATCAC
GGGTGGACGT TTACGGGAGA GGGTAAGTGC ACGTCCATTC CGCAAATCGA GCAAGGGACG
GGTTTGGAGA CGGCTTTGAA GTCTCCGAGA TCGTGCGTCG CGGCGTACCC GACGAAAGAG
GCGCAAGGCA TGCTTTGGGT GTATCCGACC TCGATGGATA AAGCGCCGGC GACGCTGCCA
GATTTGCCGC TAATCCCAGA GTACGACGAC CCGGAGTGCG TGTGTCAAGA CATCTTTCGG
GATCTTCCCA TGGATTGGGC GACTTTGCTT GAAAACGTCA TGGATGTTAG TCATGTGCCG
TTTACGCATC ACAACAGCGT TGGTAAGCGA GAAAATGCGA CGCCAGTGAA TTTGGAATTG
GCGAGCGCCG CGGGCGTCAC GGCGAACGGA TTCGAAGGGA TATGGAAGGA AGGTCCGAGG
AAAGGTAAAT ATGGGTCTCA ATACACCGAG TTCAAAGCAC CGACGTTAAT GCGCCACACG
CTCAAGACGG AGGCTTTCAC GACGCTCACC GTCGTGTACG CGGTGCCGAC GACACCCGGC
CGATGCCGAC TTATGGCGCG ATTTCCATTC ATCTTCAAAT CGGCGTTGCC GCGATTCTTT
TTCGGTCTTT ACCCGCAATG GTTCTCGCAC ACGAATCAAA ATGCAATTTT AGAGGATGAC
CAAATCTTCT TGCACAAGCA AGAGCGATTG ATCGAGGTTG AGCAAAAAGA AGGCAAGTCA
TACGCGCAGT CGTGCTACAT GCCCACCAAG GCAGACGTCT ACGTTTCGGC GTTCCGCAAG
TGGATTGTCG ACGTCGCCGG CGGCGGTCCA GCGTGGCCGA AGGATATGCC CACTGATTTG
CCACCGCAAG AAACCACGCG CGAGGCTTTG CTCGATCGCT ACCATTCGCA CACGATAAAC
TGCAAGTCGT GCGCCAGCGC TTTGGCGAAA ATCGGCAAGG CGCGAAAGGC GCTTCGCGTG
CTCACCTTTG TCGCTCTCGC CGCTGCCGTG GCGACTTTTG CGCGAGCAGT ACCGTTGAAG
TACACGATCG CCTTGTCGGT ACTATCCGCC GCGTGCGCTT TGGTTCGCGA GAAACTCGGC
GCGTTTGCCG CGAAGATGAA AATCGGGCCG TATCCGCCAC CGCGCAGACC GCCATCCATG
ATGGAGGCAG CGTTGCAACA AGCGCGAATC GCGTTTTAA
 
Protein sequence
MTRAGAATRR RALQTTEDTP VSAEARAEAY DWRAHWYPAA YVADVEKDAP LTFTLLGEPL 
VFWRDKSGEM RCVADRCPHR LVPLSEGRVN ETGELECGYH GWTFTGEGKC TSIPQIEQGT
GLETALKSPR SCVAAYPTKE AQGMLWVYPT SMDKAPATLP DLPLIPEYDD PECVCQDIFR
DLPMDWATLL ENVMDVSHVP FTHHNSVGKR ENATPVNLEL ASAAGVTANG FEGIWKEGPR
KGKYGSQYTE FKAPTLMRHT LKTEAFTTLT VVYAVPTTPG RCRLMARFPF IFKSALPRFF
FGLYPQWFSH TNQNAILEDD QIFLHKQERL IEVEQKEGKS YAQSCYMPTK ADVYVSAFRK
WIVDVAGGGP AWPKDMPTDL PPQETTREAL LDRYHSHTIN CKSCASALAK IGKARKALRV
LTFVALAAAV ATFARAVPLK YTIALSVLSA ACALVREKLG AFAAKMKIGP YPPPRRPPSM
MEAALQQARI AF