Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_23501 |
Symbol | cobJ |
ID | 4778144 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 2068804 |
End bp | 2070588 |
Gene Length | 1785 bp |
Protein Length | 594 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640087871 |
Product | bifunctional cbiH protein and precorrin-3B C17-methyltransferase |
Protein accession | YP_001018350 |
Protein GI | 124024043 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1010] Precorrin-3B methylase [COG2073] Cobalamin biosynthesis protein CbiG |
TIGRFAM ID | [TIGR01466] precorrin-3B C17-methyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.659645 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCAAAGGC TTGAAAGACG CCGCCAGGTG GATCAACTGG CCCTTACCCC GCGAGCAGCA AACACTCTCG ACAAAACACC TCAAAGCCTT GTAGTTGGCC CCGCCAGCGA AGTGCTGAGC AGGAACTGGC AGCCAAACGG GGTCTTGCTC ATCGTGGGTG CTATCGGAGC AGTGACTCGC TTAATCGCTC CATTACTCAT CGGGAAGGAC AAAGATCCCG CCGTCATCGT CCTTGATGCC CATGGCAAAC ACGTCGTGCC ATTGCTGGGC GGTCACCAGG CGGGCGCAGA ACAACTTGCA ATGGAACTGG CCGCAGAGCT AGGAGGTCAT GCCGTACTCA CCGGAGACGC CAACAGCCAA AACCGCTTGG CCCTCGATAG TTTTGGCGAA GGCTGGGGCT GGCGACGTAG CGGCAGCCGC GACAACTGGA ATCAGTTGAT GCTGACCCAA GCCCAAGGTG GGAAGCTTTT GCTTCTGCAG TGTTCTGGCG CGACGTTTTG GCAAACAACT GCTGCTGCAC AAGAAGTTTT TGATGCCAAT GCCAATGCCA ATGGTGATAG AGCTCGGCCA GCCCAGCTGA ACATCGGCCC ACAAAGCAGC ATGTCTTGCA GCTGGCATCC CGCCAGTCTA TGGATTGGAA TTGGCTGCGA ACGCGATACA AGCCTGAGCC TGCTTGAGCG CGCCGTGGCA GCTGCCCTGG CAGAGGCAGG CCTTGCCCAA GAGGCAGTAG CAGGCCTTGC CAGCATTGAC CGAAAGGCGA ATGAAACCGC TCTGCTCGCT CTCGCTCAAG CAAAGGACTG GCCAGTGCGC TTCTTCAACG CTGATGCCCT CGCAGAAGTT GAAGTACCCA CCCCTTCTGC TGCGGTAGCA AAAGCTATGG GCACCTATTC CGTGGCCGAA GCAGCAGCAC TACTCGCAGC TAGCGACAAA GGCACTTTGC TGCAACCCAA ACAGATTCAC CACGCCCAAA ACGCTGAACA TGGTGCTGCC ACTATCGCCA TCAGCGAAGC CAATCAACCT TTTGCACCCC AAAGAGGCGA ATTGCACCTG ATCGGAAGCG GTCCTGGAGA TTTGGCCTTT CTCACCCACG ATGCCCGAGC AGCCCTAGCC CGCAGTGCCA TATGGGTTGG ATACGGCCTC TACCTCGATC TACTGGAACC GTTGCGGCGC CCAGACCAAG CCCGTCTGGA CGGCCAACTC ACTCGCGAAC GGGATCGCTG CCTTAAGGCC CTGGAGCTAG CCGAACAGGG AGCACGAGTC GCCTTGATCT CTTCAGGGGA CAGTGGCATC TACGGCATGG CCGGCCTAGC CCTGGAACTG TGGCTCACAA AAACCCCTAG TGATCGGCCA GACTTTCAGG TGCATCCTGG CCTCTCAGCA CTGCAATTAG CTGCCGCCAA GGTAGGTGCC CCTCTGATGC ACGATTTCTG CAGCGTGAGT CTCAGTGATC GCCTCACTCC CTGGTCAAAG ATTGAAACCC GCCTGAAAAG TGCAGCCAGC GGAGACTTCG TGGTAGCCCT TTACAACCCA CGCTCTCAAG AACGCGACTG GCAGCTGACC CGTGCCCTTG AACTCTTGCT GGAGCATCGA GCCCCCAGCA CGCCAGTGGT CGTAGCACGT CAACTTGGAC GAGCCCAGGA AGACATACAG CTCCACACAC TTCAGACGAT GCCAGTCAAA GAGGTGGACA TGCTCACCAT TGTGCTAATT GGCAATAGCA GCAGTCGTTT ACAAGACAAT CATGTGGTGA CACCAAGGGG ATACCCCGGA GCCGAACTCG CTTGA
|
Protein sequence | MQRLERRRQV DQLALTPRAA NTLDKTPQSL VVGPASEVLS RNWQPNGVLL IVGAIGAVTR LIAPLLIGKD KDPAVIVLDA HGKHVVPLLG GHQAGAEQLA MELAAELGGH AVLTGDANSQ NRLALDSFGE GWGWRRSGSR DNWNQLMLTQ AQGGKLLLLQ CSGATFWQTT AAAQEVFDAN ANANGDRARP AQLNIGPQSS MSCSWHPASL WIGIGCERDT SLSLLERAVA AALAEAGLAQ EAVAGLASID RKANETALLA LAQAKDWPVR FFNADALAEV EVPTPSAAVA KAMGTYSVAE AAALLAASDK GTLLQPKQIH HAQNAEHGAA TIAISEANQP FAPQRGELHL IGSGPGDLAF LTHDARAALA RSAIWVGYGL YLDLLEPLRR PDQARLDGQL TRERDRCLKA LELAEQGARV ALISSGDSGI YGMAGLALEL WLTKTPSDRP DFQVHPGLSA LQLAAAKVGA PLMHDFCSVS LSDRLTPWSK IETRLKSAAS GDFVVALYNP RSQERDWQLT RALELLLEHR APSTPVVVAR QLGRAQEDIQ LHTLQTMPVK EVDMLTIVLI GNSSSRLQDN HVVTPRGYPG AELA
|
| |