Gene Rsph17029_0684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0684 
Symbol 
ID4895093 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp688290 
End bp690149 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content69% 
IMG OID640111268 
Productcobaltochelatase 
Protein accessionYP_001042569 
Protein GI126461455 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4547] Cobalamin biosynthesis protein CobT (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole phosphoribosyltransferase) 
TIGRFAM ID[TIGR01651] cobaltochelatase, CobT subunit 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGC CCAGCGACAA CCCCGCCGAT CCGTTCAAGA AGGCCCTTGC CGAGGCGACG 
AAGACCCTCG CCTGCGATCC GGAGATGACG GTGACCTTCA CGGTCGATCC CGCGGGGATG
AACCGCGAGG GGATGCGGCT GCCGCAGGTC AGCCGCCGGA TGACGCGCGA CGAGGTGATG
CTCGCCCGCG GCACGGCCGA TGCCTTCGCG CTGCGCCGCC GCTATCACGA CGATGCCCTG
TCGGCCCGCT ACCTGCCGCA GGGCCAGATG GCGCGCGAGA TCTACGAGGC GATGGAGACG
GCGCGCTGCG AGGCGGTGGG CGCGCGCACC ATGCCCGGCA CCGCCGGCAA CATCGACGCG
CGGATCGGAC ACGAGGCCGA GCGGAAGGGC TATGCCCAGA TCACCCAGGC CGCGGATGCG
CCGCTCGCCA CGGCGGCGGG CTATCTCGTG CGCCATCTGG CGACAGGACG CACCCTGCCC
GGCGGCGCCG ACAATGTGAT GGAGCTCTGG CGCGGCTTCA TCGAGCAGCA GGCGGGCGAC
ACGCTTCAGA ACCTCGACGA GGTGCTGTCC GATCAGGCGG CCTTCGCCCG GCTCGCGCGC
AAGGTGATCT CGGATCTGGG CTACGGCGAC CAGCTCGGCG ACGATCCCGA TACCGACGAG
CAGGACGAGG CGGGCGAGGA CGCGGAGACC GACGAGGACG ACTCCCAGAG CCAGGGCGAA
GACGAGTCCG AGGAGTCGCC CGAGGCGCAA CCCGAGCGCA GCCAGGAAGA GCAGCAGGAT
CCGACCGAGG CGCAGGTCTC GATGGACGAC CAGCCCGATT CGGATCAGGG CGAGGAGATG
GAGCTGCCCG AGGGCGAGGC GCCGCTCGAG CCCCCGCCGC CTGCAGCCTA TTCCGAGGCC
GATCCGGCCT ATGTCGTCTT CGCCACCGAG TTCGACGAGG AGATCCGCGC CGAGGATCTG
GCCGAACCGG CCGAGCTCGA GCGCCTGCGG GCCTATCTCG ACCAGCAGCT CGAGCCGCTG
AAGGGCGCGG TGAGCCGGCT TGCGAACAAG CTGCAGCGAA GGCTTCAGGC GCAGCAGAAC
CGCAGCTGGG AATTCGACCG CGAGGAAGGC ATCCTCGATG CCGGCCGCCT CGCGCGGGTG
GTGGCCAACC CGACCACGCC GCTCTCGTTC AAGGTCGAGA AGGACACCGA ATTCCGCGAC
ACCTGCGTGA CGCTGCTGCT GGACAATTCC GGCTCGATGC GCGGCCGGCC GATCTCGATC
GCCGCGATCT GCGCCGATGT GCTCGCGCGC ACGCTGGAAC GCTGTCAGGT CAAGGTCGAG
ATCCTGGGCT TCACCACGCG GGCCTGGAAG GGCGGGCAGA GCCGCGAGAA ATGGCTGGCG
GCGGGCCGTC CGCAGCAGCC CGGGCGGCTC AACGACCTGC GCCACATCGT CTACAAGGGC
GCCGACGCGC CCTGGCGGCG GGCGCGCGAG AATCTCGGCC TGATGATGAA GGAAGGGCTC
CTCAAGGAGA ACATCGACGG CGAGGCGCTG GAATGGGCGC ATCGTCGGCT GAGCAGCCGG
GCCGAGACGC GCAAGATCCT CATGGTGATC TCGGACGGGG CGCCGGTGGA CGATTCCACC
CTGTCGGTGA ACCCGGCGAG CTATCTTGAA AAACATCTCC GCGACGTGAT CGCGATGGTC
GAGCGCAAGC GGCAGGTGGA GCTCATCGCC ATCGGCATCG GCCATGACGT GACGCGCTAC
TATGCGCGCG CCGTGACGAT CACCGACGTG GAGCAGCTGG CCGGCGCCAT GACCGAGCAG
CTGGCATCGC TTTTCGACGC CGATCCGAAA AAGCGCATCG GCAAACGGCG GGTGGCCTGA
 
Protein sequence
MSKPSDNPAD PFKKALAEAT KTLACDPEMT VTFTVDPAGM NREGMRLPQV SRRMTRDEVM 
LARGTADAFA LRRRYHDDAL SARYLPQGQM AREIYEAMET ARCEAVGART MPGTAGNIDA
RIGHEAERKG YAQITQAADA PLATAAGYLV RHLATGRTLP GGADNVMELW RGFIEQQAGD
TLQNLDEVLS DQAAFARLAR KVISDLGYGD QLGDDPDTDE QDEAGEDAET DEDDSQSQGE
DESEESPEAQ PERSQEEQQD PTEAQVSMDD QPDSDQGEEM ELPEGEAPLE PPPPAAYSEA
DPAYVVFATE FDEEIRAEDL AEPAELERLR AYLDQQLEPL KGAVSRLANK LQRRLQAQQN
RSWEFDREEG ILDAGRLARV VANPTTPLSF KVEKDTEFRD TCVTLLLDNS GSMRGRPISI
AAICADVLAR TLERCQVKVE ILGFTTRAWK GGQSREKWLA AGRPQQPGRL NDLRHIVYKG
ADAPWRRARE NLGLMMKEGL LKENIDGEAL EWAHRRLSSR AETRKILMVI SDGAPVDDST
LSVNPASYLE KHLRDVIAMV ERKRQVELIA IGIGHDVTRY YARAVTITDV EQLAGAMTEQ
LASLFDADPK KRIGKRRVA