Gene OSTLU_43137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_43137 
Symbol 
ID5005576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp352378 
End bp353583 
Gene Length1206 bp 
Protein Length402 aa 
Translation table 
GC content61% 
IMG OID640420997 
Productpredicted protein 
Protein accessionXP_001421269 
Protein GI145353969 
COG category[R] General function prediction only 
COG ID[COG1204] Superfamily II helicase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGGCG CGCCCCCGCC CCTGTACCCG TGGCAGGCCG AAGCGATAAA CCTCGTCGCC 
GCGTCGCGTC GCTCGCTGTG CTACACGGCG CCCACGGGCG GCGGTAAAAG TCGCGTCGCC
GATGAGTTGT TACGACTGTC CCTGACAAAC TTTCCCGCGG CGAAAGCGCT CGTCGTGCTG
CCGTACGTCG CGCTCGTGCG GGAAAAAGTG GCGTCGTTGG AGTCGTTGTT ACGCCCGCTC
GGGATCAAGG TGAGGGGCTA CGCCGGGGTG GAGTGCGAGG GGGCGCCGTT GGGGAGCTCG
AGAGAGCGCT GCGCGGTGAC GACGATTGAA AAGGCGTCGT CGTGCGTCAA TAGACTGTTC
GAAACGGGCG AGATATCGCT GCTCAGCGTC GTGGTGGTGG ATGAGCTGCA CATGGTCAGC
GAAGACGAAC GTGGGTGCGC GCTGGAGGGG ATGTTGGCGA AGATACGACA CGGAGTGAAG
TCGGGAAAGG TGTCGAGCGA CGGCCCGCAA ATCGTGTGCA TGAGCGCGAC GGTGGGAAAG
TCATCGATGG AACGCTTAGC GAGGTGGTTA GACGCGGAGA TTTACGTCAG TCATCATCGG
CCGGTGGAGT TGAAAGAGTA CGTGGTGTGC GTGGGTGGGG TGTATGCGAA GGAGAATCGA
GGTGAGGCGG GCTGGGAGCT GACGCGCGTG GCCGATTCGC CGTCGCGAGT GGAGTTGGAG
ATCGTCGCCG AACTGGTTGG TCAAGTGTTC GTCAACGCGC ACAGCTCGTT GGTATTTTGC
TCGAGCAAGA GTCAGTGTTC AGTTTATGCG ACGAAATTGG CGAGCTTGCT TCCGGTGAAT
CCGAACACGG CGCACCTGCG AGAAGAGTGC GTGGCGAGAC TCTACGAAGC TGCGGAGGGC
GAGCCCGACC AAGCGCTAGT AGCGTGTGTT CGCTCCGGTC TCGCGTGGCA TCACGCCGGA
TTAACGACGG CAGAGAAGAG AGTAATCGAA GAGGGCTTTC GAGCTGGTGC GATTTTAGCG
CTCACATGCA CGACGACTCT TGCTGCGGGC GTCAACTTGC CCGCTCGCCG TTGCGTCATC
CTTCGCGGCT TCATCGCCGG TTTACCGACG CCTTCGATGG CTCAGTACAA ACAAATGGCT
GGTCGAGCTG GAAGAAAAGG GCAAAGCGAT TTCGGTGAAT CTTTCCTAGT CACGACGAAA
CAAGAG
 
Protein sequence
MRGAPPPLYP WQAEAINLVA ASRRSLCYTA PTGGGKSRVA DELLRLSLTN FPAAKALVVL 
PYVALVREKV ASLESLLRPL GIKVRGYAGV ECEGAPLGSS RERCAVTTIE KASSCVNRLF
ETGEISLLSV VVVDELHMVS EDERGCALEG MLAKIRHGVK SGKVSSDGPQ IVCMSATVGK
SSMERLARWL DAEIYVSHHR PVELKEYVVC VGGVYAKENR GEAGWELTRV ADSPSRVELE
IVAELVGQVF VNAHSSLVFC SSKSQCSVYA TKLASLLPVN PNTAHLREEC VARLYEAAEG
EPDQALVACV RSGLAWHHAG LTTAEKRVIE EGFRAGAILA LTCTTTLAAG VNLPARRCVI
LRGFIAGLPT PSMAQYKQMA GRAGRKGQSD FGESFLVTTK QE