Gene P9303_26481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_26481 
SymbolholB 
ID4776761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2336713 
End bp2337693 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content56% 
IMG OID640088171 
ProductDNA polymerase III subunit delta' 
Protein accessionYP_001018643 
Protein GI124024336 
COG category[L] Replication, recombination and repair 
COG ID[COG2812] DNA polymerase III, gamma/tau subunits 
TIGRFAM ID[TIGR00678] DNA polymerase III, delta' subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.937616 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTTG AGATTGAGAC GCGTGGGCTG TTTGAAGATT TGATCGCTCA GCCGTTGGCA 
GTTGCATTGC TGGAGGCAGC GTTGAGTCAG GGTCGACTGG CTCCGGCATA TCTCTTTTCT
GGTCCAGATG GAGTGGGTCG CAGTCTGGCG GCATTGCGTT TTCTCGAGGG CGTCATCAGC
AGTGGTAAGC CGGCCCTGCG TGAACGCCGT CGTCTAGAGG CGTTCAACCA TCCTGATCTG
CTTTGGGTCG AACCGACTTA TCAGCATCAA GGCCGCCTGG TCCCTAAGTC CCAGGCAGAG
GAAGAAGGCG TGAGCCGCCG TTCACCCCCA CAGGTGCGCC TCGAACAGAT TCGGGGGGTG
ACAAGATTCC TAGGACGACG ACCTGTTGAG GCCCCAAGGG GCATGGTGGT GATTGAAGCG
GCGGACTCTA TGCCCGAAGC TGCTGCCAAT GCTTTGCTTA AAACGCTGGA AGAACCAGGT
CATGGCTTGT TGATTCTGCT TTCGGCCGCT TCTGAGCGCT TGCTCACGAC AATTCGTTCT
CGATGCCAAC AGATTCTTTT TGCTCGTCTT GAAGGAGCAG ACATGCAGAC TGTTCTGGCC
AGGACGAGCA CAGCGGAGAT GCGGTCGTCC TTGTTGGCAT TGGATCAGCC GGAACTGGTT
GCGATGGCCG CTGGTTCTCC TGGGGCGCTG TTGCAGCATT TTTGGCTCTG GCAGGCAGTT
CCAGAGGAGT TTTGGCTACG CCTTGAAGAG CGTCCCCAAA AGCCAATGGA AGCCTTAGCT
CTGGCGAGGG ACCTTACTGA GGCTCTTAAT GGGGAGCAGC AGCTTTGGCT GATTGATTGG
TGGCAACAAC ATTTTTGGAT CCAACGGCCT GATCCAAGAC CACTGAAACG TTTGGAAAGG
CTGCGTTCAC ACCTGCTGGG GTTTGTGCAA CCCAGGCTGG CCTGGGAAGT GGCTTTGCTG
GAACTGATTC CTTGCGTTTA A
 
Protein sequence
MSLEIETRGL FEDLIAQPLA VALLEAALSQ GRLAPAYLFS GPDGVGRSLA ALRFLEGVIS 
SGKPALRERR RLEAFNHPDL LWVEPTYQHQ GRLVPKSQAE EEGVSRRSPP QVRLEQIRGV
TRFLGRRPVE APRGMVVIEA ADSMPEAAAN ALLKTLEEPG HGLLILLSAA SERLLTTIRS
RCQQILFARL EGADMQTVLA RTSTAEMRSS LLALDQPELV AMAAGSPGAL LQHFWLWQAV
PEEFWLRLEE RPQKPMEALA LARDLTEALN GEQQLWLIDW WQQHFWIQRP DPRPLKRLER
LRSHLLGFVQ PRLAWEVALL ELIPCV