Gene OSTLU_38583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_38583 
Symbol 
ID5001815 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp205573 
End bp207837 
Gene Length2265 bp 
Protein Length754 aa 
Translation table 
GC content53% 
IMG OID640417236 
Productpredicted protein 
Protein accessionXP_001417951 
Protein GI145346965 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1197] Transcription-repair coupling factor (superfamily II helicase) 
TIGRFAM ID[TIGR00580] transcription-repair coupling factor (mfd) 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.449683 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.114344 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCGAG ATAAGTTTAA CGACGGCGGC GCGATTTCGG CATGGGACAC GTTTAGAACG 
AGCATGGATA GCTCTGAAGC CGATGAACAG GTGGATGACA TGCTTGAAAG CGCGGCGGAA
GTAGAAGAAA AGAATAGACG CGTGCGAGAC GGTTATGTGG AAGATGCGTT CGGGGTGAAC
AACGCGATTG ATCCTTTTAA ACTCGTCACG GGTGAGTACG TGGTGCATCG AAAGTACGGC
ATAGGTCAGT TTCTCGGCAT GAAGGTATTA GCCGTGGAGT CAGCCAATGA AGGTACGCAA
AACAAACCGT TCTTGTTCCT GAAGTATCAG GATGCGACGG CGAAGATTAG TCCAGAGGCG
TCGAGGCGAT TGCTCTACCG CTTTTGTTCT CCTGGAGGAT TGGTGAAACC ACCAAAGCTC
AACAAGCTCA ACGACAAATC GACTTGGGAT TTGAGGGAAA GGAAGACCGA GGCAACGATT
CGTCGTCTAG TGGTGAACCA AATGGTGGTG TATCTCCAAA GGTTGCAATG TGTGCGAGAG
CCGTACCCGT TGCCTGACCC CGAGAGGGCG AAGCAGTTTG ACGCGTCGTT CCCGTTTACG
CTCACGCCAG ATCAAACGAG CGCGATTCAA GAAATCACCG AAGATTTGCA GCAAGACGCT
CCGATGGATC GATTAGTTAT TGGTGACGTC GGTTTTGGGA AGACGGAAGT CGCCATGCGC
GCGATGTTTC ATGTGGCAAG CAGCGGAGGG GGCGTATTCA TGATGGCGCC GACCACCGTC
TTAGCGAAGC AGCACGCGGC AAACCTCGCC GTTCGATTTC GCCCGTTGGG TATAAACGTT
GAATTAGTCA CTAGGCACAT CCAAGCCGCA AAGCAAAACA CAATCTTCGA TGATTTTAGG
GACGGTAAAG TGCAAATCAT CGTCGGTACG CATAAGCTGG TGAACTTGGA GCAAGAGTAT
TACAAGCAGC TCAGATTACT CGTTATAGAC GAAGAACAAA GATTCGGTGT CAAGCACAAG
GACCAGATAA GTGCGTTGAA AGCTGAAGTC GATGTTCTCA CGTTGTCGGC GACGCCAATC
CCGCGCACGC TGCACATGGC CATGTCGGGA TTCCGCGACG CGTCGCTGGT GCAGACGCCG
CCACCGGAGC GCCGTCCAAT TAACACGGTG CTCGCGCCGC AGAACGACGA CGACATCAGA
AAGGCGATCG AGTACGAAAT CTCGCGGAAT GGGCAGATAT ATTACATCGT ACCGCGGATC
AATATGATGC GCGACGCATG CGATCGACTG TTGCGCCTTT TCCCGAATTT ACAAATCATG
ACGGCGCACG GACAAATGGA CGGCGAAGCC ATCGACGACG CCATGGAGTC ATTTTCAAAC
GGTTCGGCGG ACGTACTGAT CGCGACGACG ATCGTCGAGT CTGGTTTGGA CATTCCCAAC
TGTAACACGA TCATCATCGA AAATGTGCAG TTTTTTGGGC TCGCTTCGTT GTATCAGCTT
CGCGGTCGCG TTGGTCGGGC CGGTCGCCAG GCGTACGCTT ACATGTTCTA CTCCGCAGAC
GAGAGTGAGC TGACGACGGG CGCGCAGGAG CGCTTGGCCG CGCTGGAGGA ATGCTGCGGA
TTGGGCGAAG GGTTCCGTCT GTCAGAGCGA GATATGGGCA TTCGAGGTGT CGGCACGATG
TTTGGCGAAA AGCAAAGCGG AGACGTCGAT AGTGTCGGAG CCGATTTATA CCTGGAGCTT
CTCTACAAAC AGCTGCAACG CATCGATAAT CTAAGAATCA AGACGATTGA TGCCGATGAC
GTTCGAGTCG GTGCCGCTGG TTATGAATTC GGGATCACGC CGTTCTACAT CGCCACCACG
GAGGCGAGCG ACGAAGTCAA GGCGACGATT GACTCAATCA CCGCGCACGA ACAAGTGCAC
GACGTCCTCG CGCTGATGCG TGATACGTTT GGTGAACCTG ACGAATTCAG CCTGTCATGC
GTCTTTGCCA GGGAAATGCG CATACTGGCT GGTGATCTCG GCATTCAAGG AATTTTGCTC
GACAGTCCCA CCGCTCCCAT CATCGATTTG ATCACGGATG CGTCGATCAT GGTGAAAGAA
CTTCTCGTCG AAGGTATTAG CGATGCGTAC GACGTGGAAA TCATCGACAC AGGTATCCGG
CTCAAGACAA TGACTGATAT GACGATGCAC GGCAAGGTGA TGTACACGGT TAAAATCTTG
CGCCAAATCA CTGGCTCCAT CCCATCCTTC GTGAAGTACT TGTAG
 
Protein sequence
MVRDKFNDGG AISAWDTFRT SMDSSEADEQ VDDMLESAAE VEEKNRRVRD GYVEDAFGVN 
NAIDPFKLVT GEYVVHRKYG IGQFLGMKVL AVESANEGTQ NKPFLFLKYQ DATAKISPEA
SRRLLYRFCS PGGLVKPPKL NKLNDKSTWD LRERKTEATI RRLVVNQMVV YLQRLQCVRE
PYPLPDPERA KQFDASFPFT LTPDQTSAIQ EITEDLQQDA PMDRLVIGDV GFGKTEVAMR
AMFHVASSGG GVFMMAPTTV LAKQHAANLA VRFRPLGINV ELVTRHIQAA KQNTIFDDFR
DGKVQIIVGT HKLVNLEQEY YKQLRLLVID EEQRFGVKHK DQISALKAEV DVLTLSATPI
PRTLHMAMSG FRDASLVQTP PPERRPINTV LAPQNDDDIR KAIEYEISRN GQIYYIVPRI
NMMRDACDRL LRLFPNLQIM TAHGQMDGEA IDDAMESFSN GSADVLIATT IVESGLDIPN
CNTIIIENVQ FFGLASLYQL RGRVGRAGRQ AYAYMFYSAD ESELTTGAQE RLAALEECCG
LGEGFRLSER DMGIRGVGTM FGEKQSGDVD SVGADLYLEL LYKQLQRIDN LRIKTIDADD
VRVGAAGYEF GITPFYIATT EASDEVKATI DSITAHEQVH DVLALMRDTF GEPDEFSLSC
VFAREMRILA GDLGIQGILL DSPTAPIIDL ITDASIMVKE LLVEGISDAY DVEIIDTGIR
LKTMTDMTMH GKVMYTVKIL RQITGSIPSF VKYL