Gene OSTLU_3761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_3761 
Symbol 
ID5003211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp600630 
End bp601649 
Gene Length1020 bp 
Protein Length340 aa 
Translation table 
GC content60% 
IMG OID640418632 
Productpredicted protein 
Protein accessionXP_001419215 
Protein GI145349596 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0513] Superfamily II DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.695785 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.516627 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AAGGATGTCG TCGCCGTCGC GAAGACGGGG AGCGGGAAGA CGTTGGCGTT TCTGTTGCCG 
ATGTTTCACG GTATGAAGCG ACACGGTGGC GTGGAGGGAC TCGTCGTCGC CCCGACGCGG
GAATTGGCGA TACAGATTCA AGCCGAAGCG GAGAAGTTCG GCGCGGCGCA TGGGTTTCAA
AGCGTCGTGG TGTACGGCGG CGCGAGCGCG TACGAGCAAA AGAACGCGTT GCGAACGAAA
AAGCCGTGCC TCGTCATCGG CACGCCGGGG CGATTGACGG ACTTGATGAG TCAAGAGGGG
GTGCTTTCGC TCGCCGAGCT TTCGGTGATC GTGCTGGATG AGGCGGATAG GATGTTAGAT
ATGGGGTTTG AGCCGCAGAT TAAGCAAATC TTCGGCGCGA CGCCGACGAA GCGGCAGACG
CTCTTGTTTT CGGCGACGTG GCCGAAATCC GTGCGTAAGC TCGCGGCGGG GTATTTAAAT
CAAGATAAAT CGTGCGTCGA AGAGATTTTC ATCGGCGAAG GCGCGTCGGA CGGCGAACTG
GCGGCGAACA AGGCTATCAC GCAACGCTTC ATCGAGGCGA GAGACCACGA AAAAGACGAG
CACTTGTACA ATCTCATTTG CGAGTTTCCA GACGAGTCTC GCGTCGTCGT GTTCGCGAAT
ACCAAGCGTC GCGTCGAAAA TCTGGCGAAA ACGTTCGCCG CGGAAGGTTT CGGCACCGTC
TCCGTGCACG GCGATAAATC TCAAGCCGAC CGCGAGGCGT CTCTGCGCAA ATTCGTCGAA
AACAAGGCGC CGCTCATGAT GGCCACCGAC GTCGCCGCGC GCGGTTTAGA CATCAAGGGC
GTCACCCACG TCATCAATTA CGACATGGCG CGCGACGTTG AGAGTTACGT CCACAGAATC
GGTCGAACCG GCCGCGCCGG CGAACTCGGC GCCGCCGTCA CGTTTTGGAA CGTCGATTAC
GACAAGCCCT GCACCCCGGC GCTGTGCAAA ATCGCTCGAG ACGCCGGTCA GGCTGTCCCA
 
Protein sequence
KDVVAVAKTG SGKTLAFLLP MFHGMKRHGG VEGLVVAPTR ELAIQIQAEA EKFGAAHGFQ 
SVVVYGGASA YEQKNALRTK KPCLVIGTPG RLTDLMSQEG VLSLAELSVI VLDEADRMLD
MGFEPQIKQI FGATPTKRQT LLFSATWPKS VRKLAAGYLN QDKSCVEEIF IGEGASDGEL
AANKAITQRF IEARDHEKDE HLYNLICEFP DESRVVVFAN TKRRVENLAK TFAAEGFGTV
SVHGDKSQAD REASLRKFVE NKAPLMMATD VAARGLDIKG VTHVINYDMA RDVESYVHRI
GRTGRAGELG AAVTFWNVDY DKPCTPALCK IARDAGQAVP