Gene OSTLU_31680 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31680 
Symbol 
ID5001837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp348263 
End bp349365 
Gene Length1103 bp 
Protein Length268 aa 
Translation table 
GC content68% 
IMG OID640417258 
Productpredicted protein 
Protein accessionXP_001417993 
Protein GI145347053 
COG category[R] General function prediction only 
COG ID[COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0246535 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.737397 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCGACGCCGC GCCGGCGGAC GCGCGACGCG ACGCGACGCG ACGCGACGCG AGATGACGCG 
CTTCGTCGCG CTCGGTCGCG GGCTCGGCGC GTTCGATCGC GGCGACGACG CGACGCGCGA
CGCGCGACGA CGACGACGGC GACGCGCGCG CGAGGACGCG ACGGCGACGG CGCGGGCGGT
GCGCGACGGC GCGCGACGCG CGGCGCCGAT GGTCGGGAAG GCGGCGAACG CGGTCGCGCT
CGGCGCGGCG CTCGCGGGCC CGGGCGAGCG CGCGGACGGG GCGCGAGGAG GGGGGCGACG
ACGCGAGGGC GGACGCCGCG CGGCGGCGCG CGGCGTGTTC GACGATCCCG ATCGCGCGTG
CACGAACGCC ATCATGATCG GCACGGCGAG CGCGTTCGCG TTACAGATTT TGAGCGGACA
GGCGATCACC GCGCTCGGGG CGAAGGTGAA CGAACGGATC GCCGCGGGGC AGCTGTGGCG
GTTGGCGACG CCGATTTTTT TGCACGGCGG GCTGCCGCAC TTGATGGTGA ATATGTACTC
GTTGAACAGC ATCGGACCGC TCATGGAGGC GACGTTCGGG CGCGAACAGT TTTTAGCGGT
GTATTTCGGC GCGGGCGTGG CTGGAAATTA CGCGAGTTAT CGGTTTTGCG CGTCGAATAG
CGTCGGCGCG AGCGGCGCCG TCTTCGGCTT GGCCGGCGCG TTGGCGGTGT ACTTGCAGCG
CCACAAGCGA TATTTAGGCG AGCGCGCGGA CATGCAGCTG CAACAACTCG GCACGGCGTT
GGCGGTGAAC ATGGGTTTCG GTCTCACGAG TAGACGAATA GACAATTGGG GGCACGCCGG
CGGACTCGTC GGCGGCGCCG CGTTGGCCTT CTTAACCGGA CCTAATCTCG TCATGGAGAC
CGACGGTGGC TACGGTCTGC GACGCAAACT CGTGAACAAA CCCAAGTTAC AATCCACGAT
CCGCGCCATC AAGGATTTCT GGGACGAAGA CGACGAAGAC GAGGACGAGC GATGACTTCC
AGGCTGCCCA GTCCCGAGCA ACAAACGTAG CGCAAGTAGA ACGACAATCG CGTCGAAATC
GCACCGTTAG CGCGTGAGCG CGT
 
Protein sequence
MVGKAANAVA LGAALAGPGE RADGARGGGR RREGGRRAAA RGVFDDPDRA CTNAIMIGTA 
SAFALQILSG QAITALGAKV NERIAAGQLW RLATPIFLHG GLPHLMVNMY SLNSIGPLME
ATFGREQFLA VYFGAGVAGN YASYRFCASN SVGASGAVFG LAGALAVYLQ RHKRYLGERA
DMQLQQLGTA LAVNMGFGLT SRRIDNWGHA GGLVGGAALA FLTGPNLVME TDGGYGLRRK
LVNKPKLQST IRAIKDFWDE DDEDEDER