Gene OSTLU_37994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_37994 
Symbol 
ID5003977 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp482518 
End bp484149 
Gene Length1632 bp 
Protein Length521 aa 
Translation table 
GC content60% 
IMG OID640419398 
Productpredicted protein 
Protein accessionXP_001420012 
Protein GI145351283 
COG category[L] Replication, recombination and repair 
COG ID[COG1112] Superfamily I DNA and RNA helicases and helicase subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.273766 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCCGA GCACTCTCAA AGCCGTGAAG TCTAAAGTTT TGAACGACGA ACAGTTGCGT 
GTGGTCGAAG ACGTCGTGTC AGGTGCGGGG AAGCAGTATC CATATATTGT TTGGGGGCCG
CCCGGTACCG GAAAGACACT CACGATCGTG GAATGCGTCG CGCACGTGCT CGAGATGTTC
CCTCACGCGA GAGTACTGCT CGCGGCGCCC TCGGCGTTCG CCGCGGATAT TCTTTGCTCG
CGCTTGGCGA AGCGACTCAC CCCTTTCAAA AAGAAAATGA TCGTACGCGT GAACGACGTT
CGTCGCACGC CTGAATCCGT GAAAGCCGAC GTGCGATTTC ATTCGCTCGA AATTTGGCGA
GACGACCCGG AGGAAGCGAA ACAGTACGCG AGCGTGCCAT TTCACTTCTT CAAGCGACCG
GATCCTCTGA AACATTTGAA ACATGCGCGC GTCGTCGTGT GCACGTGCAC GAGCGCTGCT
TTGTTGCGCA AGCTGCCGAT GCCTGTCGAT AGTGTCGTCG AGAACTGGAC GCCGACGCAT
ATTTTTGTCG ACGAGGCGGC GCAGGCTTTG GTTCCGGAGA CACTCATTCC TTTGTCGCTC
GCCAGTTCGG AAACTAGCAT CGTTCTCGCC GGCGATTCCA AGCAGCTCGG TCCCAACGTG
CACTCGAAAG AGGCTGCGCA AGCTGGTTTG CGAAAGTCTC TGCTCGAAAT GTGGATGGAT
CACTCAAAGG AAGAAGTCGC TCGAGGCGTC TGGAACGGCA CGCAACTCCG AGCGTGCTAC
CGCTCGCATC CCGACATCGT CGCGCTGCCA TCGAGAATGT TTTACGACGG TACCGTGGAG
AGTTGCGCGC CGACGGCAAA CACGGATTTG CCAGCAAATT GGGAGAACTT TTCTCGAGGC
GCGGGCAACG GACGCGCGAG TCGTTTCCTC TTCTACGGCG TCAAGGGACG ACAGCGCAGA
GAAGGCAACA CGAGCAGCTG GACCAATCCG ATCGAATGCG CCGAACTGGT CGACTTACTC
GAAGCCTTAC TGGATAGCAC GAACCTCACA CCCGCCGACG TCGCCGTGAT GGCGACGTAT
CGTCGACAAG TCGTGCTCAT TCGCATCGCG CTTCGCGCGC GCTCGCTCGG CGCCATTCGC
GTCGGTACCG TCGACGATTT CCAAGGGCAA GAGGAGAAAA TCATCTTCAT CTCCACCGTC
GTCACGCGTC CAACGACCCT CGACGCGTTG GATTCCGAGA TTGGCTTCCT GAACAACCCC
AAACGTTTCA ACGTCGCAAT CTCCAGAGCG ATGGCGTTAA ACGTCATCGT CGGACATCCC
CTCGTGCTTC TTCAGAATCC CCTATGGGCC GAGCTCGTGC GCGAATGCGT TCGCCGCGAC
GCCTTTCGCG GCGCCGGCGC CGAGTACCTT CCTCGTTTCG CCGGTGGCGG CCACGATTTC
GCCCTCCCGT CGTCTCTCGA CGACGACGAC GTCCGTCCCT CTCGCGGCGC CTCCGACGCC
GTCGCCGACG CCGTCGCCGC CGTCGCCGAG CTCGCGCTCC TCGGCGGCGG CGCCTCGGAC
GCGTTGTCGT CCCAGGACGG TCACGCGTGG GACGATTGGG GCGACGAGCC CTCCTGGCGC
GTCGCCGTGT GA
 
Protein sequence
MTPSTLKAVK SKVLNDEQLR VVEDVVSGAG KQYPYIVWGP PGTGKTLTIV ECVAHVLEMF 
PHARVLLAAP SAFAADILCS RLAKRLTPFK KKMIVRVNDV RRTPESYASV PFHFFKRPDP
LKHLKHARVV VCTCTSAALL RKLPMPVDSV VENWTPTHIF VDEAAQALVP ETLIPLSLAS
SETSIVLAGD SKQLGPNVHS KEAAQAGLRK SLLEMWMDHS KEEVARGVWN GTQLRACYRS
HPDIVALPSR MFYDGTVESC APTANTDLPA NWENFSRGAG NGRASRFLFY GVKGRQRREG
NTSSWTNPIE CAELVDLLEA LLDSTNLTPA DVAVMATYRR QVVLIRIALR ARSLGAIRVG
TVDDFQGQEE KIIFISTVVT RPTTLDALDS EIGFLNNPKR FNVAISRAMA LNVIVGHPLV
LLQNPLWAEL VRECVRRDAF RGAGAEYLPR FAGGGHDFAL PSSLDDDDVR PSRGASDAVA
DAVAAVAELA LLGGGASDAL SSQDGHAWDD WGDEPSWRVA V