Gene OSTLU_94724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_94724 
Symbol 
ID5003802 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp411356 
End bp413296 
Gene Length1941 bp 
Protein Length633 aa 
Translation table 
GC content61% 
IMG OID640419223 
Productpredicted protein 
Protein accessionXP_001419800 
Protein GI145350831 
COG category[L] Replication, recombination and repair 
COG ID[COG5260] DNA polymerase sigma 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.127729 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000571267 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGATCAGCA CGCGGTTCGA GGGCGTGCGG GTGGCGCCGT TTGGGTCGTA CGTGAGCGCG 
TTTCACTCGG CGGGGAGCGA TATAGATATT TCGCTGCAGA TAGACAAGAA TGGACCGTGG
TACGATGAAA AGGAAGAAGC GCAGGCGCGC CGATCGCAGC GGGGCGGGGT GCGCGCGCGT
CGACAACAGC GCCAGGGTCG AACGAAACGC GCGCAATTGT TGCGCAAAGT CGCCTCGGAG
TTGCGGTATC GCAATTACCG CGACGTCCAA CTCATCTCCA AGGCTCGGGT GCCTTTGATC
AAGTTCAAAG ACCCGCAGAC GGGCGTCGCG TGCGATGTGT GCATCGAGAA CGACGGGGTG
TACAAGAGCG CCGTGCTCGG CGTCGTCGCG GATATCGATC AACGTTATCG CGACTTGGTG
TTCTTGATAA AGCTTTGGGC CAAGCATTAC GATGTGAACA ACGCCATGGA GGGATCGTTC
AACTCGTACT CGTTGTGTTT GCTTTGTATG CATCATTTGC AGCGCCGACC AGTGCCAATT
CTGCCGCCGA CGATGCTGCT CACGCTTCCT CGTCCCGATT TGGTGGAATC GGAAAAGCGC
GAACTCGAGG AGCATTTGAA AAGTGAGGAC GACCAGTTCG ATACTTGGAA AGTTAGTAAA
GCTCGCGTCG TGAGTGATGC ATCGAGGGAC ATCGCGGCGG TAAAGTACCG CGCCGATAAG
TTCGCGGGTT TCGGAAAGGA AAACACCGAG ACGCTCGCGG AACTCTTTGT GAGCTTCTTC
GCGCACTTGT GCGCCATCAA AGATTTGTTT CGGAACGCGG TGAACGCGTC CACGTATCAT
GGTACGTTCA TCGTCGGTAG CTCTTGGCAA GCGTTCAAAT ATCCACTCGG TGTGGAAGAT
CCGTTTGCCG CCGGCGACAA CGTCGCTCGA GCGGTTCAAA TGCGCACGAG AGATTACGTG
TTGAACGCTT TTCCTGCGGC GTGTGCAGAT ATATCCAAGA TGCTGCACGC CACGGACAAC
GTACAGTTCA TGCGCTCGTT ACTGTGCTTG CTCGGTGATA AGAGCGTACC ATCCGAAGTC
TTGGCGCGTC TCAGACCGAC GCTCCCCGGC ATGGGCGGCG CGCCGCAGCC GCCGGGGTTG
CCGGGAGCAC CGCGACCTCC TCAAGGCCCA CCTGTGATGC TTCAACAGCC AGCAAAGTCG
CTCAATGAAC ACACGTTGGA TATGCTCGGC AGACAAGTCG CGCCAGGAGC GTCGGCGGAG
GAGATTTTGG CGATGTTGAC GCGTCAACGA CAGGTGCAAG CGGAGGCGCA GCGAGACCAG
TCGCAACCGA GCGAGCAGCA GATGTTGTTG CTGCGACGGC AGCAAGAGCT TTTGCGCATG
GAACAAGCAA AAATCCAGCA GCACATGCAA CAGGGACAGC CGCCACCGCA GCCAGGTCGC
ACGACGCAGA TCCCGGTGGC GAGTTTGTTT GGGCAGCCGC AGCAACAGCC GCAGCAACAG
CGCGGCCTAC CGCCCGGTTT CGGTCCGACT TCGCAGCTGC AGATGCCACC ACCGCAGCCA
CAGATGCCGA TGGCAACGGC GCTACCTTCG TTTGGCGCAC CCCCTCCGGC CAACGGTGGC
TTTTCGAACG GCGGCCTCGG CGGCGGCGTC TTCTCGAGCA TCGCCTCCGG CGGCGGCGGC
TTATTTTTCG ACGCCCCGGC GCGCCAATCC AATCCGCCGC CGCCGACCGC GCACGCCGTC
GACGAAATCT CCCAGCATTT TGCCACCGGC ATGTCCATGT TCGGCTCAAA CGTCCACGCA
CCTCCACCCG ATCTCGGCGT CCGCTCGCCG CCCGATCTCG CCGCCGGCGG TTCGCCGCCG
GCGTCGCGCG TCGTCCCTCA GTCCGAGCTT CCTCGCACTC GCAGCGGCGT CGCCATCCCC
AAACCGCGCA ACGCGCGTTA G
 
Protein sequence
MISTRFEGVR VAPFGSYVSA FHSAGSDIDI SLQIDKNGPW YDEKEEAQAR RSQRGGVRAR 
RQQRQGRTKR AQLLRKVASE LRYRNYRDVQ LISKARVPLI KFKDPQTGVA CDVCIENDGV
YKSAVLGVVA DIDQRYRDLV FLIKLWAKHY DVNNAMEGSF NSYSLCLLCM HHLQRRPVPI
LPPTMLLTLP RPDLVESEKR ELEEHLKSED DQFDTWKVSK ARVVSDASRD IAAENTETLA
ELFVSFFAHL CAIKDLFRNA VNASTYHGTF IVGSSWQAFK YPLGVEDPFA AGDNVARAVQ
MRTRDYVLNA FPAACADISK MLHATDNVQF MRSLLCLLGD KSVPSEVLAR LRPTLPGMGG
APQPPGLPGA PRPPQGPPVM LQQPAKSLNE HTLDMLGRQV APGASAEEIL AMLTRQRQVQ
AEAQRDQSQP SEQQMLLLRR QQELLRMEQA KIQQHMQQGQ PPPQPGRTTQ IPVASLFGQP
QQQPQQQRGL PPGFGPTSQL QMPPPQPQMP MATALPSFGA PPPANGGFSN GGLGGGVFSS
IASGGGGLFF DAPARQSNPP PPTAHAVDEI SQHFATGMSM FGSNVHAPPP DLGVRSPPDL
AAGGSPPASR VVPQSELPRT RSGVAIPKPR NAR