Gene OSTLU_17656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_17656 
Symbol 
ID5004699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009366 
Strand
Start bp395668 
End bp396687 
Gene Length1020 bp 
Protein Length339 aa 
Translation table 
GC content56% 
IMG OID640420120 
Productpredicted protein 
Protein accessionXP_001420838 
Protein GI145353039 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID[TIGR01464] uroporphyrinogen decarboxylase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.0099841 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0141903 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCGCG CGATCCGAGG CGAAGACGTC GAGCGACCGC CGATCTGGAT GATGCGACAG 
GCGGGGAGAT ACATGAAGGT GTATCAAGAC TTGTGCAAAA AGCACACGAC GTTTCGAGAG
CGAAGCGAGA CGGTGGATCT CGCGGTAGAG ATCTCGCTGC AGCCGTACGA GGCGTTTAAA
CCCGATGGGG TGATTTTGTT CAGCGATATT CTCACGCCGT TGCAAGGGAT GAACATTCCG
TTCGACATCG TAAAGGGAAC GGGGCCGATC ATCTTCAACC CGGTGCGCGA GATGGACGAT
ATCAAGAGCA TCACGCCGTT GGAGCCGGAG AAGAGCGTGC CGTTCGTCGG GGAGTCGTTG
AAGGTGTTGA GAAATGAAAT TGGGAACGAG GCGACGCTGC TCGGGTTTTG TGGAGCGCCG
TTTACGTTGG CGTCTTACAT CGTCGAGGGC GGGACGAGCT CGCACTACAA GGTGATTAAG
AAAATGGCGT TCGATTCGCC AGCGGTGTAC GAGGCGCTCA TGAATAAGCT CACGGACGCC
GTGATCGAGT ACACGCGGTA CCAAGCGGAC TCTGGGGCGC AAGTGGTGCA AATTTTCGAC
TCGTGGGCAA GCGAGTTTTC GCCCGCGGAT TTCGAAGTGT ACTGCCTGCC GTACCTCCAA
CGCATCGTCG CCGAGTGCAA GCAAACCCAC CCACACGTGC CGCTCATCCT TTACAGCTCT
GGAAGTGCCG GTTTCTTGGA GCGCTTAGCG ACGACGAACG CGGATGTTAT CAGCTTAGAC
GGGACGATTG ACATGGCGGA CGCGCGCGCT CGACTCGGTA TGGATCAGGC GGTGCAAGGG
AACATGGATC CGCTCCACCT CTTTGCGTCG CAAGATTTCA TCACGAAAAA GGTGCACGAA
ACGATCGCCA AGGCTGGAAA CAAGAAGCAC GTCATGAATC TCGGTCACGG GGTCATGGTC
GGCACCCCAG AGGAAAACGT TGGACACTTC TTCAAGACCG TTCGAGACTT CCGGTATTAA
 
Protein sequence
MLRAIRGEDV ERPPIWMMRQ AGRYMKVYQD LCKKHTTFRE RSETVDLAVE ISLQPYEAFK 
PDGVILFSDI LTPLQGMNIP FDIVKGTGPI IFNPVREMDD IKSITPLEPE KSVPFVGESL
KVLRNEIGNE ATLLGFCGAP FTLASYIVEG GTSSHYKVIK KMAFDSPAVY EALMNKLTDA
VIEYTRYQAD SGAQVVQIFD SWASEFSPAD FEVYCLPYLQ RIVAECKQTH PHVPLILYSS
GSAGFLERLA TTNADVISLD GTIDMADARA RLGMDQAVQG NMDPLHLFAS QDFITKKVHE
TIAKAGNKKH VMNLGHGVMV GTPEENVGHF FKTVRDFRY