Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0543 |
Symbol | |
ID | 3909582 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 607462 |
End bp | 609357 |
Gene Length | 1896 bp |
Protein Length | 631 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637882431 |
Product | cobalt chelatase, pCobT subunit |
Protein accession | YP_484165 |
Protein GI | 86747669 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG4547] Cobalamin biosynthesis protein CobT (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole phosphoribosyltransferase) |
TIGRFAM ID | [TIGR01651] cobaltochelatase, CobT subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.23003 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACAT CCAACACCAA ATTCCGCACC GGATCGAAGG AAGCGCCGAC CGAGCCGTTC AAGCGGGCGG TGACGTCGTG CCTGCGGGCG ATCGCCAAGA CGCCGGAGCT CGAAGTCAGC TTCGCGGCGG AGCGCCCCGG CCTGTCGCCC GGCAAGGCGC GGCTGCCGGA GCCGGCGCGA AAGATGAGCA AGCGCGACGC CGCGATCGTG CGCGGCCATG CCGATTCGAT CGCGCTGAAG CTCGCTTGCC ACGATCCGAA AGTGCATCGC AAGCTGATGC CGGGCAATCC GCAAGCGCGC GGCGTATTCG AAGCCGTCGA GCAGGCCCGC GTCGAAGCGC TCGGCGCTCG GCGGATGGCG GGGGTCGCCA AGAACCTCAC CGCGATGCTC GACGATCACT TCCACCGCGG TAAGTTCGAC GAGATCACCG ACCGTGCCGA CGCGCCGTTG TCGGACGCGC TGGCGATGCT GGTGCGCGAG CGCCTCACCG GGCTGGCGCC GCCGGCGGCG GCGAAGAAGC TGGTCGACCT GTGGCGGCCG GTGCTGGAAG ACAAGATCGG GCCGAAGCTC GACGAGCTCG AACGCTTCGC CGAGAACCAG GCGAAGTTCG GCGACGCCCT CCACGACCTG CTCGACGTGC TCGAACTCGG CGACGATCGC GACGCCGATT CGGACGAGGA AGAGAACCAG GACGACAAGC AGGAAGGCGA GAACGATCAG TCCGGCGCCG AAGGCACGCC GGAGAGCGAA GCCGCCCAGG AGATGAGCGC CGACCAGGCC GAGGCGACCA GCGACGACCT CAGCGACAGC GCGATGGAAA GCGCGCAGGC TTCGGCCTCC GATGCCTTCG ACGATTCCGA GACCGGCGAG GACGACACGC CGGGCGAGGC GACGCGGCCG AACAATCGCG GCGCCAACGA GCCGCGCGGG CCGGAATATC ACGCCTTCGC GCCGAAATTC GACGAGGTCG TCGCCGCCGA GGACCTGTGC GACCACGACG AGCTGGAGCG GCTGCGCAGC TATCTCGACA AGCAGCTCGC GCATCTGCAG GGCATCGTCG CGCGGCTCGC CAACCGGCTG CAGCGCCGCC TGATGGCGCA GCAGAACCGC GCCTGGGATT TCGACCTCGA GGAAGGCATT CTCGATCCGG CCCGGCTGTC GCGTGTCGTC ACCGACCCGT TCCATCCGCT GTCGTTCATG AGCGAGAAGG AAGCCACCTT CCGCGACACC GTGGTGACGC TGCTGCTGGA CAATTCCGGC TCGATGCGCG GCCGCCCGAT CACCGTGGCG GCGACCTGCG CCGACATTCT GGCGCGGACG CTGGAGCGCT GCGGCGTCAA GGTCGAGATT CTCGGCTTCA CGACGCGCGC CTGGAAGGGC GGTCAATCGC GTGAGGCGTG GCTCGCGGCC GGCAAGCCGG CGTCGCCGGG CCGGCTCAAC GATCTGCGCC ACATCATCTA CAAATCCGCC GACGCGCCGT GGCGCCGGGC CCGCAGGAAT CTCGGGCTGA TGATGCGCGA GGGGCTGCTC AAGGAGAACA TCGACGGCGA GGCGCTGGAC TGGGCGCACA AGCGGCTGCT CGGCAGAAGC GAACAGCGCA AGATCCTGAT GATGATTTCT GACGGCGCGC CGGTCGACGA TTCGACGCTG TCGGTCAATC CCGGCAATTA TCTCGAGCGG CATCTGCGCC ACATCATCGA GGAGATCGAG ACCCGCTCGC CGGTCGAGCT GATCGCGATC GGCATCGGCC ACGACGTCAC CCGCTATTAT CGTCGCGCGG TGACCATCGT CGACGCCGAG GAGCTCGGCG GCGCGATCAC CGAGAAGCTC GCCGAACTGT TCAGCGAAAC CGCGGACGTG CCCGCGCCGG GTCGCCGTCG TCGGCTGCAT TCGTGA
|
Protein sequence | MTTSNTKFRT GSKEAPTEPF KRAVTSCLRA IAKTPELEVS FAAERPGLSP GKARLPEPAR KMSKRDAAIV RGHADSIALK LACHDPKVHR KLMPGNPQAR GVFEAVEQAR VEALGARRMA GVAKNLTAML DDHFHRGKFD EITDRADAPL SDALAMLVRE RLTGLAPPAA AKKLVDLWRP VLEDKIGPKL DELERFAENQ AKFGDALHDL LDVLELGDDR DADSDEEENQ DDKQEGENDQ SGAEGTPESE AAQEMSADQA EATSDDLSDS AMESAQASAS DAFDDSETGE DDTPGEATRP NNRGANEPRG PEYHAFAPKF DEVVAAEDLC DHDELERLRS YLDKQLAHLQ GIVARLANRL QRRLMAQQNR AWDFDLEEGI LDPARLSRVV TDPFHPLSFM SEKEATFRDT VVTLLLDNSG SMRGRPITVA ATCADILART LERCGVKVEI LGFTTRAWKG GQSREAWLAA GKPASPGRLN DLRHIIYKSA DAPWRRARRN LGLMMREGLL KENIDGEALD WAHKRLLGRS EQRKILMMIS DGAPVDDSTL SVNPGNYLER HLRHIIEEIE TRSPVELIAI GIGHDVTRYY RRAVTIVDAE ELGGAITEKL AELFSETADV PAPGRRRRLH S
|
| |