Gene P9303_23501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_23501 
SymbolcobJ 
ID4778144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2068804 
End bp2070588 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content58% 
IMG OID640087871 
Productbifunctional cbiH protein and precorrin-3B C17-methyltransferase 
Protein accessionYP_001018350 
Protein GI124024043 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1010] Precorrin-3B methylase
[COG2073] Cobalamin biosynthesis protein CbiG 
TIGRFAM ID[TIGR01466] precorrin-3B C17-methyltransferase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.659645 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCAAAGGC TTGAAAGACG CCGCCAGGTG GATCAACTGG CCCTTACCCC GCGAGCAGCA 
AACACTCTCG ACAAAACACC TCAAAGCCTT GTAGTTGGCC CCGCCAGCGA AGTGCTGAGC
AGGAACTGGC AGCCAAACGG GGTCTTGCTC ATCGTGGGTG CTATCGGAGC AGTGACTCGC
TTAATCGCTC CATTACTCAT CGGGAAGGAC AAAGATCCCG CCGTCATCGT CCTTGATGCC
CATGGCAAAC ACGTCGTGCC ATTGCTGGGC GGTCACCAGG CGGGCGCAGA ACAACTTGCA
ATGGAACTGG CCGCAGAGCT AGGAGGTCAT GCCGTACTCA CCGGAGACGC CAACAGCCAA
AACCGCTTGG CCCTCGATAG TTTTGGCGAA GGCTGGGGCT GGCGACGTAG CGGCAGCCGC
GACAACTGGA ATCAGTTGAT GCTGACCCAA GCCCAAGGTG GGAAGCTTTT GCTTCTGCAG
TGTTCTGGCG CGACGTTTTG GCAAACAACT GCTGCTGCAC AAGAAGTTTT TGATGCCAAT
GCCAATGCCA ATGGTGATAG AGCTCGGCCA GCCCAGCTGA ACATCGGCCC ACAAAGCAGC
ATGTCTTGCA GCTGGCATCC CGCCAGTCTA TGGATTGGAA TTGGCTGCGA ACGCGATACA
AGCCTGAGCC TGCTTGAGCG CGCCGTGGCA GCTGCCCTGG CAGAGGCAGG CCTTGCCCAA
GAGGCAGTAG CAGGCCTTGC CAGCATTGAC CGAAAGGCGA ATGAAACCGC TCTGCTCGCT
CTCGCTCAAG CAAAGGACTG GCCAGTGCGC TTCTTCAACG CTGATGCCCT CGCAGAAGTT
GAAGTACCCA CCCCTTCTGC TGCGGTAGCA AAAGCTATGG GCACCTATTC CGTGGCCGAA
GCAGCAGCAC TACTCGCAGC TAGCGACAAA GGCACTTTGC TGCAACCCAA ACAGATTCAC
CACGCCCAAA ACGCTGAACA TGGTGCTGCC ACTATCGCCA TCAGCGAAGC CAATCAACCT
TTTGCACCCC AAAGAGGCGA ATTGCACCTG ATCGGAAGCG GTCCTGGAGA TTTGGCCTTT
CTCACCCACG ATGCCCGAGC AGCCCTAGCC CGCAGTGCCA TATGGGTTGG ATACGGCCTC
TACCTCGATC TACTGGAACC GTTGCGGCGC CCAGACCAAG CCCGTCTGGA CGGCCAACTC
ACTCGCGAAC GGGATCGCTG CCTTAAGGCC CTGGAGCTAG CCGAACAGGG AGCACGAGTC
GCCTTGATCT CTTCAGGGGA CAGTGGCATC TACGGCATGG CCGGCCTAGC CCTGGAACTG
TGGCTCACAA AAACCCCTAG TGATCGGCCA GACTTTCAGG TGCATCCTGG CCTCTCAGCA
CTGCAATTAG CTGCCGCCAA GGTAGGTGCC CCTCTGATGC ACGATTTCTG CAGCGTGAGT
CTCAGTGATC GCCTCACTCC CTGGTCAAAG ATTGAAACCC GCCTGAAAAG TGCAGCCAGC
GGAGACTTCG TGGTAGCCCT TTACAACCCA CGCTCTCAAG AACGCGACTG GCAGCTGACC
CGTGCCCTTG AACTCTTGCT GGAGCATCGA GCCCCCAGCA CGCCAGTGGT CGTAGCACGT
CAACTTGGAC GAGCCCAGGA AGACATACAG CTCCACACAC TTCAGACGAT GCCAGTCAAA
GAGGTGGACA TGCTCACCAT TGTGCTAATT GGCAATAGCA GCAGTCGTTT ACAAGACAAT
CATGTGGTGA CACCAAGGGG ATACCCCGGA GCCGAACTCG CTTGA
 
Protein sequence
MQRLERRRQV DQLALTPRAA NTLDKTPQSL VVGPASEVLS RNWQPNGVLL IVGAIGAVTR 
LIAPLLIGKD KDPAVIVLDA HGKHVVPLLG GHQAGAEQLA MELAAELGGH AVLTGDANSQ
NRLALDSFGE GWGWRRSGSR DNWNQLMLTQ AQGGKLLLLQ CSGATFWQTT AAAQEVFDAN
ANANGDRARP AQLNIGPQSS MSCSWHPASL WIGIGCERDT SLSLLERAVA AALAEAGLAQ
EAVAGLASID RKANETALLA LAQAKDWPVR FFNADALAEV EVPTPSAAVA KAMGTYSVAE
AAALLAASDK GTLLQPKQIH HAQNAEHGAA TIAISEANQP FAPQRGELHL IGSGPGDLAF
LTHDARAALA RSAIWVGYGL YLDLLEPLRR PDQARLDGQL TRERDRCLKA LELAEQGARV
ALISSGDSGI YGMAGLALEL WLTKTPSDRP DFQVHPGLSA LQLAAAKVGA PLMHDFCSVS
LSDRLTPWSK IETRLKSAAS GDFVVALYNP RSQERDWQLT RALELLLEHR APSTPVVVAR
QLGRAQEDIQ LHTLQTMPVK EVDMLTIVLI GNSSSRLQDN HVVTPRGYPG AELA