Gene OSTLU_36189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_36189 
Symbol 
ID5000476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp124230 
End bp127226 
Gene Length2997 bp 
Protein Length998 aa 
Translation table 
GC content62% 
IMG OID640415897 
Productpredicted protein 
Protein accessionXP_001416316 
Protein GI145343362 
COG category[L] Replication, recombination and repair 
COG ID[COG1643] HrpA-like helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.804187 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAGGTC GACGCGAGGG CGAAGAGGCG ACGAACGCGC GAGCGGCGCG CTTGCGGGCG 
AAGATCGCGA GCGATGCGTC GTTGGCGGCG ATACAGACGA AACGCGAACA GTTACCGGTG
CGGGAGTTTA AGGATGCGAT ATTGAACGCG GTACGGGCGA ATCAAGTCGT GCTCGTCGCC
GGGTCGACGG GTTGCGGGAA GACGACGCAG GTGCCGCAGT ACGTCTTGGA CGATGCGTGG
GCGAACGGGC GCGGGGCGTC GATCGTGTGC ACGCAACCGC GAAGGATTAG CGCGATGACG
GTTTCCGAGC GCATCGCGAA CGAGCGCGGG GAGAGCATCG GGCAGAGCAC GGTCGGTTAC
CAGATTCGAT TGGAAAGCCG GGTCTCGGCG GATTGTTCGT TGTTGTTTTG CACGTCCGGC
GTGCTGTTGC GACGACTCAC GAGCGAGGCG TCGGATAAGC TGTGCGAGTC ATTGACGCAT
ATCATCATCG ACGAGCTGCA CGAGCGAGAT TTGTTTGCGG ATTTCCTAAC CATCATTTTG
AAGGGCGTGA TTCCGAAGCA TCCGCACCTA AAGCTCGTGC TGATGTCGGC GACGATGCGC
GAAGATTTGT TTAGCGAATA CTTTGGTGGG TGTCCGGTGA TTTCAGTGCC AGGTTATACG
CATCCGGTGA ATGAGTATCA CCTGGAAGAT ATCTTGCCCA TGATCGGATG GGGCGGCGTG
CATCACACGT CGAAGAAGGC GAGCGGAGGC GGCGGCGGCG AACCGAGAGT GCGCGCACCC
ACTTCGGGCG CGAGCGTGGA CGTCATGCGC GAGGCAATCA TGCGAGCATT TTTAGAGGAC
ACCGACGAGT CGTTCGATTG GCTCATGCAG TGCGCGCGCG AGACAGATTC TGCGAGCGGG
TTGTCGCACG TAAACGTCGC GCACTCCACG GGCGCCACCG CGCTCATGGC GGCGGCGGGT
AAGGGAAGAC AGATGGAAGT GTCGCAGCTT TTAGGTTTAG GAGCGTCGCC CGCGATGCGA
AGCACCGACG GGAGCAACGC CGCGGATTGG GCGGATAAGT TTGGACACGT CGAGCTCGCG
GACGCGTTGC GAAGCGTGGA CGACGAAAAC GAAGACGCGG GAAGTCACGA GCAGTCGGCG
CTTCTATTGA GCGATTACCA GCTCTCCGTG GATCCAGACG AGGTGGACGT GGACTTAATC
CACAATTTGA TCGTTTGGAT CATGAAAGAG CGCGCGATCG ACGAAGGATC CGAGGGCGCG
ATTTTAGTCT TTTTGCCCGG CTGGGACGAA ATCTCCAAAC TTCGCGACTC GTTGACGGCG
GATTACAACG TCTGTCACTC GGCGAGCGTC CTACCTTTGC ACTCCATGGT CGCCCCGGCG
GATCAACGAA AAGTCTTCCA ACGTCCACCT AAAGGGTTGC GCAAAATCGT CCTCTCCACC
AACATCGCGG AGACGGCGGT GACGATTGAC GACGTCGTCT TCGTCATCGA CAGCGGGCGG
TTGAAGGAAA AGAGTTACGA CGCGTACTCT GCGGTCTCTA CGCTCCAGGC GGCTTGGATC
TCGCAAGCGA GTGCGAAACA GCGACGCGGT CGCGCCGGTC GCGTGCGTCC CGGCGAGTGC
TATCGCGTGT ACTCCACCTC ACGGTACGAC TCGTTCGCGC AGTACCAGTT GCCCGAGATG
CAGCGGTCGC CGCTCGAGGA GCTGTGCTTG CAGGTGCGCG TGTTGGCCGA AAGCGGCGCG
GGCGTCGTGG ACGATGGGCC GGGAAGCACG GCTGGGTTTC TCGCGCGCGC GGTCGAGCCC
CCTGTGGCGC AAGCGACGGA CAATGCGGTG CAATTGCTCA AGGACATCGG CGCTTTGACG
GAGGAGGAGC GCCTCACGCG ACTCGGCCGC CATCTCGGCG AGCTTCCGTT GCACCCGCGC
GTGGGGAAGA TGATCTTGTA CGCCGCTCTG TTTGGCGTTC TCGATCCGAT TCTCACCGTC
GCGTGCGCTG CGGCGTATCG TCCGCCCTTC ATCATCTCCG CCGACGGTCG AAAATCGGGC
GACGCCAGTC GCGCGGCGTT TTCCAACGAA GCCGGCGGCG GGAGCGATCA CTTGGCGGTG
ACCAAGGCGT ACATGGCGTG GGAGCAAGTT CAGCGCGATG GGCGTCAAAA TGAAAGGTAC
TTTTTGAACG CGAATTCTTT GTCGCCGTCG ACGCTGCACA TGATCAAGGG CATGCGACAG
CAATTAATCA CGGCGTTGAT TCAGCGCGGC ATCATTTCAG ATTTGCGAAG CGCGAGCGCA
AACTCGTCAT CCGGCGCGCT TGTGCGCGCG GTGCTCGCCG TGGGCATGTA CCCTTTGGTG
GGACGATTTT TACCAAAGTG CAAAGCGCCG ACGTTGGCAA CGCTTCGCGG CGAGCGCGTG
CGCGTGCACG CGTTTAGCGT CAACGGCAAA CTCGACGTGA GCGCGCTCGG CGAGCTCAAC
GAATCGGGTG AAAAAATTGC CACCTTGGCG TGCTTCGACG AACTCATTCG AGGCCCTCAC
GCGGTGCAAG TGCGCGAGTG CACGTTGGTC GCCGCCGCGG CGATCGTGTT CGTGTGCTCC
ACGCTCACGG TGAAACCAGA CGTGCCGCAA ATCGATCCCG AAACCGGCGA GGCGCGCGCG
AGAGACGGTC CGCCGTCGGC GTTGTTGGTC GTGGACAATT GGTTGAGATT TCGCGTGCCC
TTGCGCGCGG TGGCGCAGAT CACGGTATTG CGCTTACGTT TGCACAAAGC GTTCGCCATG
CGCGTCGAGC GACCGAAAGA CGCGCTACCG GCGGATATGC GAGGCGCCGT GGACGCCATC
GCGCGCGTGC TGAGCGACGC CGACGCCGCG TTCATCGAGT CCTCGAGTTT CGCTCGCAGT
TTCGCAGGCT TCGGCGGCGG GCGCGGCGGC GGCGGGCGCG GCGATGGAGG TCGCGGCGGT
CGCGCGCGCG GCGGTCGCGC GCGCGGCGGC GCGAGAATCC CGGCGCCTCG ACGATAG
 
Protein sequence
MRGRREGEEA TNARAARLRA KIASDASLAA IQTKREQLPV REFKDAILNA VRANQVVLVA 
GSTGCGKTTQ VPQYVLDDAW ANGRGASIVC TQPRRISAMT VSERIANERG ESIGQSTVGY
QIRLESRVSA DCSLLFCTSG VLLRRLTSEA SDKLCESLTH IIIDELHERD LFADFLTIIL
KGVIPKHPHL KLVLMSATMR EDLFSEYFGG CPVISVPGYT HPVNEYHLED ILPMIGWGGV
HHTSKKASGG GGGEPRVRAP TSGASVDVMR EAIMRAFLED TDESFDWLMQ CARETDSASG
LSHVNVAHST GATALMAAAG KGRQMEVSQL LGLGASPAMR STDGSNAADW ADKFGHVELA
DALRSVDDEN EDAGSHEQSA LLLSDYQLSV DPDEVDVDLI HNLIVWIMKE RAIDEGSEGA
ILVFLPGWDE ISKLRDSLTA DYNVCHSASV LPLHSMVAPA DQRKVFQRPP KGLRKIVLST
NIAETAVTID DVVFVIDSGR LKEKSYDAYS AVSTLQAAWI SQASAKQRRG RAGRVRPGEC
YRVYSTSRYD SFAQYQLPEM QRSPLEELCL QVRVLAESGA GVVDDGPGST AGFLARAVEP
PVAQATDNAV QLLKDIGALT EEERLTRLGR HLGELPLHPR VGKMILYAAL FGVLDPILTV
ACAAAYRPPF IISADGRKSG DASRAAFSNE AGGGSDHLAV TKAYMAWEQV QRDGRQNERY
FLNANSLSPS TLHMIKGMRQ QLITALIQRG IISDLRSASA NSSSGALVRA VLAVGMYPLV
GRFLPKCKAP TLATLRGERV RVHAFSVNGK LDVSALGELN ESGEKIATLA CFDELIRGPH
AVQVRECTLV AAAAIVFVCS TLTVKPDVPQ IDPETGEARA RDGPPSALLV VDNWLRFRVP
LRAVAQITVL RLRLHKAFAM RVERPKDALP ADMRGAVDAI ARVLSDADAA FIESSSFARS
FAGFGGGRGG GGRGDGGRGG RARGGRARGG ARIPAPRR