Gene OSTLU_31592 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31592 
Symbol 
ID5001907 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp194789 
End bp195914 
Gene Length1126 bp 
Protein Length325 aa 
Translation table 
GC content59% 
IMG OID640417328 
Productpredicted protein 
Protein accessionXP_001417948 
Protein GI145346959 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0181] Porphobilinogen deaminase 
TIGRFAM ID[TIGR00212] porphobilinogen deaminase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.784447 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGTGA GAGCGGCGGT GGACGCGCCC ATCGTGAAGA TTGGTACGCG AGGATCGCCG 
CTGGCGCTCG CGCAGGCGTA CATGACGCGC GATTTGTTGA AGGAGAACTT CCCGGAACTC
GCCGAGGACG GCGCGTTGGA GATTTGCATC ATTAAGACGA CCGGGGATAA GGTTTTGGAC
CAGCCGTTGG CGGATATCGG GGGTAAGGGT TTGTTCACTC GCGAGCTCGA CGACGCCTTG
CTCGACGGGC GCATCGACAT CGCCGTGCAC TCGATGAAGG ATGTGCCGAC GTACTTGCCG
GAAGGGATGG TGTTGCCGTG CATGTTGCCG CGTGAAGATG TCAGAGATGC GTTCTTGTGC
TTGAAGTATG ACTCCTTGTC GCAATTGCCG GAAGGGGCGG TCGTCGGCAC GGCGTCTCTT
CGCCGCCAGT CGCAGCTCTT GTACAAGTTT CCAACGCTCA AGTGCGTGAA CTTTAGAGGT
AACGTGCAGT CGCGCATTCG CAAGCTCAAG GAGGAAGTTG TTGACTGCAC CTTGCTCGCT
ATCGCGGGTT TGAAGCGCAT GGACCTGGCC CAACACGCCA AGGTCATCAT CCCCACCGAA
GAAATGTTGC CCGCCGTCGC GCAAGGCGCC ATCGGTATCA CCTGCCGCGC GGGCGACGAC
AAGCAGCTCG CGTTCTTGGC CAAGCTTAAC CACGAAGACA CGCGCATGGC TGTTGAAGGC
GAGCGCTCTT TCTTGGCCGC TCTCGATGGC TCTTGCCGCA CCCCGATCGC CGCTCACTGC
CACCTCGTCG ACGGTAAGAT GCAGTTCCGC GGTTTGATCG CCTCCCTCGA CGGCAAGCAA
GTTCTCGAGA CCACCCGCGA AGGTGCCTGG GACGCCGCGT CGTTGTTGGA CGCCGGTAAG
GACGCCGGCG CCGAGCTCAA GGGTAAGGCC CCGGCTGATT TCTTCGCCAA CTTGATCGAA
AACGGCGGTG GCTGGTAATC GCTCCGTCCA TTTCTCGCCC GACGTTCGCT CCGAGCGCTC
GCCGTCGCGA GAAATTTATT CGCTCTATCC ACCACATCAC TATCCTTTCG CGTGAGTTTA
GTTATGATCG AATCTTTGTC AATTTCTTAT ATTCGCCATA TTACGC
 
Protein sequence
MVVRAAVDAP IVKIGTRGSP LALAQAYMTR DLLKENFPEL AEDGALEICI IKTTGDKVLD 
QPLADIGGKG LFTRELDDAL LDGRIDIAVH SMKDVPTYLP EGMVLPCMLP REDVRDAFLC
LKYDSLSQLP EGAVVGTASL RRQSQLLYKF PTLKCVNFRG NVQSRIRKLK EEVVDCTLLA
IAGLKRMDLA QHAKVIIPTE EMLPAVAQGA IGITCRAGDD KQLAFLAKLN HEDTRMAVEG
ERSFLAALDG SCRTPIAAHC HLVDGKMQFR GLIASLDGKQ VLETTREGAW DAASLLDAGK
DAGAELKGKA PADFFANLIE NGGGW