Gene OSTLU_31476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31476 
Symbol 
ID5002058 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp21117 
End bp23721 
Gene Length2605 bp 
Protein Length781 aa 
Translation table 
GC content59% 
IMG OID640417479 
Productpredicted protein 
Protein accessionXP_001417645 
Protein GI145346336 
COG category[L] Replication, recombination and repair 
COG ID[COG5260] DNA polymerase sigma 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGTGCCCGAG GCCGGCGGCG ACGCTCGCGA GCGCGCGCGA ACGCGCACGA ACGCTGCGTA 
AGAAATCGTC TCGAGCGTTG CGTCGCGCGC ACCGAGCGTC GCGCGCGGGC ACGCGCGCGC
GCACCGCGAA CGCCGCGAAC GTGGACCCGC GCGCCCATCG GGTGGCGTTC GAGCGCGCGG
GAACTGTGAT ATCATGTCGC ACGCGCGCGC GAGCGACGGC GCGAACGGGA CGTCGACGCT
CGCGAAGGAT GCAGATGAAA TCCACGCATC GAACGGTACC GACGCACGCG CGACGTCAAG
CGCGATCGAG CGTACGACGG GCGATGGGCG CGAGGTTGGA CACGAGAGCT GGGAGGTTGT
GAGCGGGAAG AAATCTTCAA AAACGTCTGG TAAAACGAGC GCGGAAAGCG CGGGAACGAG
CGAGGATGGG AAGAAACGAA GAAATAGTGG AAGCGGCTCT CGCGCTGGGG GGGCGCAGAG
GGCGGATAAG GTGAACGCGG GTAAAGGAAG TCCACCAGAC GTTCAAGCGG CGAGCGGCGA
GGCCTCGAGC GGCGGCGCCG AACCGTCGGG ATGGGCCGCC ATCGTCACCG GAAGACTTAG
CCCGAGCGAA GTGCCGGCGT CGTCGGTGGT GTCAAGTGAG ACCACGGGCA CACCAGCCGT
TGAGCGCCTC AACTTGTCGC GAGAGAAGGA TGAGGCGATG CGCGAAGTCG CGGCGCAAGA
CGACTCTACA GATGGTTCGC CGCGAACCGC GCGCGAGACG CACATCATAG ACGGAGCTCG
CGAAAACGCA CAAGCTGTCG GTGCTGGTGC GAAGGCATTA CGTGGCTGGG CTTCTATTTT
AGGTGGCAGT AGCGGATTCC CCGAGGAAAA TCCTCGAAGC AAGTCGCCGA ACGCGACCCA
AGAGACGGCG GTAGCGGCAA CGGAGGTGGC GCATGAGCCG ACTAAGGCCA CGGCGGCGCC
TAGCGCTCCG GCGTGGGGTG GATGGGGAGC TCAAGCACCG CCAAAGGTTG ACTTGAAGGC
GACGATGGAA GAGGACGCTG CGGCGGCTGC GGCCGCTCCT CCTCCTCCTC CGCCTTCGAA
GAAGGAGTCG AAAAAGACGG AGGACAAGCC AGGCAAACAA AATGGGCGCA AGAAGGATTC
TAAGAAGGAC AAGTCTTCTA GCAAGCAACG TCAGCAAGAT ACGTATCGCA AGGATAACAA
TAAACCGGTC CGAAGCTCGG TGCCGGTGAC CAAGGTTTCA GAACCATCCG TCGACGCGTC
TGCTGACCGT CCCAGAGCCC CTATGACGTC TGAAGTTGAA GATTTGCTCG TCGCAAAACT
GGGAAGTGAG CTGCTTGACT GCGCGCACAT GCAATTTTTC ACTCCCACGG CGCATTTGCT
GAAGAAGGAG AGACGTGCGG TGGACATCGT CATTCGAGCC ATGAGCGCCA TCGTTCAGAC
GCTGTTTCCG GCGACCGGTC TCGATGTTTT TGGTTCGTAC CCTACGCGCG CATGGGTGCC
AGGGTCGAGT AATTTGGACT TGTCGCTGGA TCTCCCTCCC GAGGCGATGA TCGCTTCCCA
TCCGGAGCGG CGAATGGAGG CTTTAAATAC GCTCGCCATG GCTTTGCGTA CGAATCCTTG
GGTGCTCGAC GTCACCGTCG TCCCGAGCTC GCACAGACCT TTGCTGCGCA TGACGACGCA
TACTGCGTTC TTTCAGGCAA TGCCGCAGCA ACTGCCGAGT AAAACATCGG CGTCGATTGC
GGCAGCTATC GCCGCCGTCA CACCGCCGCC GGTAACCTCT GGGGACGGAA CGCCGCCTCT
GCCACCTGGC CCGCGTCCCG GCGCGCCGCC ATTCGGTATT CCAGGCTTGG GTCAAAACGG
TCTAGGCTTG CCGCTCGAAG TACACATTTC TCTGAAAGAT GCGAATCACA AGGGGCTGTC
GTCGATGCAA TTTGTGCAGG CTGCAGAAGA GCAACACGGC GCTTTGGCTC CGTTGGTGTG
CGTGCAAAAG GCAGTTTTAG CGAGTAAAGG TTTGCGAGGC GTCTACCGCG GTGGTCTGGG
AAGCTACGCG CTCACTTTGA TGGCCCTCAC CGCGATTCAA CTTCGCAATA GCCAAGAGAG
CGAAACGGAG GATGAAACCG TCGTGAGAGT GAGCACGTCC AAGGACGAGG CAAAAAAGTC
GGACGAAGAC GAGGCCAATT CGCGAGACGC CCTCATTCTC GGCCGAGCCA TGTTGAACTT
CCTCAAACTG TACGGATTCG AAACGGATCT TTCCAAGGAC ATTATTTCAG TGCACAGCGG
TGGTGACGGC GTATGGGGCG TTCTTTCAGA AGCCGCTCAA TTTCCGGCGC CGCTCGGTTC
TGGTTTACGC GTCAAGGATC CATTAGATGG ATCAAATAAT GCTGGAGCGG GATGTTTCGG
CATTGCGGGT GTGCAAGCAG TATTCAGAGA GCAGTTGGAA ACGCTCCGAA AGGCTGCGGA
GAATGGCTTT GAGAGTAATG TCCCACTCTT GATGCAGCTC TTCACGCTAG GTGGATCGCA
AAAAGTCTTT GTCGTTTGAG TAGCACGTTG ATTGAAAAAC ACGGCGCGCG AATAATGCAA
TATTATTACA TGAACAGCTA AAATC
 
Protein sequence
MSHARASDGA NGTSTLAKDA DEIHASNGTD ARATSSAIER TTGDGREVGH ESWEVVSGKK 
SSKTSGKTSA ESAGTSEDGK KRRNSGSGSR AGGAQRADKV NAGKGSPPDV QAASGEASSG
GAEPSGWAAI VTGRLSPSEV PASSVVSSET TGTPAVERLN LSREKDEAMR EVAAQDDSTD
GSPRTARETH IIDGARENAQ AVGAGAKALR GWASILGGSS GFPEENPRSK SPNATQETAV
AATEVAHEPT KATAAPSAPA WGGWGAQAPP KVDLKATMEE DAAAAAAAPP PPPPSKKESK
KTEDKPGKQN GRKKDSKKDK SSSKQRQQDT YRKDNNKPVR SSVPVTKVSE PSVDASADRP
RAPMTSEVED LLVAKLGSEL LDCAHMQFFT PTAHLLKKER RAVDIVIRAM SAIVQTLFPA
TGLDVFGSYP TRAWVPGSSN LDLSLDLPPE AMIASHPERR MEALNTLAMA LRTNPWVLDV
TVVPSSHRPL LRMTTHTAFF QAMPQQLPSK TSASIAAAIA AVTPPPVTSG DGTPPLPPGP
RPGAPPFGIP GLGQNGLGLP LEVHISLKDA NHKGLSSMQF VQAAEEQHGA LAPLVCVQKA
VLASKGLRGV YRGGLGSYAL TLMALTAIQL RNSQESETED ETVVRVSTSK DEAKKSDEDE
ANSRDALILG RAMLNFLKLY GFETDLSKDI ISVHSGGDGV WGVLSEAAQF PAPLGSGLRV
KDPLDGSNNA GAGCFGIAGV QAVFREQLET LRKAAENGFE SNVPLLMQLF TLGGSQKVFV
V