Gene Rsph17025_3073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3073 
Symbol 
ID5083160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp3143641 
End bp3145500 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content69% 
IMG OID640484645 
Productcobaltochelatase 
Protein accessionYP_001169262 
Protein GI146279103 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4547] Cobalamin biosynthesis protein CobT (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole phosphoribosyltransferase) 
TIGRFAM ID[TIGR01651] cobaltochelatase, CobT subunit 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGC CCACCGACAA TCCCGCCGAT CCGTTCAAGA AGGCCCTCGC CGAGGCGACC 
AAGACGCTCG CCGACGACCC GGACATGACG GTGACCTTCT CGGTCGATCC CGCGGGGATG
AACCGCGAGG GGATGCGCCT GCCGCAGGTC AGCCGCCGGA TGACGCGCGA CGAGGTGATG
CTCGCCCGCG GCACGGCCGA CGCCTTCGCG CTGCGCCGCC GGTATCATGA TGATGCAATC
TCGGCCCGCT ACCTGCCGCA GGGCCAGATG GCGCGCGAGA TCTACGAGGC GATGGAGGCG
GCCCGTTGCG AGGCGGTGGG CGCCCGCACC ATGCCGGGCA CCGCGGGCAA CATCGACGCC
CGGATCGGCC ACGAGGCCGA GCGCAAGGGC TATGGCCAGA TCACGCAGGC TGCGGATGCG
CCGCTCGCCA CGGCGGCGGG CTATCTCGTG CGCCATCTGG CGACGGGGCG GACCCTGCCC
GGCGGCGCCG ACAATGTGAT GGAGCTGTGG CGCGGCTTCA TCGAGCAGCA GGCGGGCGGG
ACGCTCCAGA ACCTCGACGA GGTGCTGGCC GATCAGGCGG CCTTTGCGCG ACTGGCCCGC
AAGGTGATCT CCGATCTGGG CTACGGCGAC CAACTCGGCG ACGATCCGGA CACGGACGAG
CAGGACGACG CGGGCGAGGA CGCGGAGACC GACGAGGACG AGTCGCAGAG CCAGGGCGAG
GACGACTCCG ACGAGTCGCA GGAAGCGCAG CCCGAGCGCA GCCAGGAGGA ACAGCAGGAT
CCCAGTCAGG CGCAGGTCTC GATGGAGGAC CAGCCCGACG CCGATCAGGG CGAAGAGGTG
GAGATGCCCG ACGGCGAGGC GCCGCTCGAG CCGCCTCCCC CTGCCCCGCA CTCCGAGGCC
GACCCCGCCT ATACGGTCTT CGCCACCGAC TTCGACGAGG AGATCCGCGC CGAGGATCTG
GCCGAGCCCG CCGAACTCGA GCGGCTGCGC GCCTATCTCG ACCAGCAGCT CGAACCGCTG
AAGGGCGCGG TCAGCCGCCT TGCCAACAAG CTCCAGCGCC GGTTGCAGGC GCAGCAGAAC
CGCAGCTGGG AGTTCGACCG CGAGGAGGGC ATCCTCGACG CCGGCCGCCT TGCGCGGGTG
GTGGCCAACC CCACCACGCC GCTCTCCTTC AAGGTCGAGA AGGACACCGA GTTCCGCGAC
ACCTGCGTGA CGCTGCTGCT GGACAACTCG GGCTCGATGC GCGGCCGGCC GATCTCGATC
GCCGCGATCT GCGCCGATGT GCTGGCGCGG ACGCTGGAAC GCTGTCAGGT CAAGGTCGAG
ATCCTCGGCT TCACCACCCG CGCCTGGAAG GGCGGGCAGA GCCGCGAGAA GTGGCTCGCC
GCCGCAAGGC CGCAGCAGCC CGGTCGGCTC AACGATCTGC GCCACATCAT CTACAAGGGC
GCCGACGCGC CCTGGCGGCG CGCGCGGGAA AATCTCGGAT TGATGATGAA GGAAGGGCTC
CTCAAGGAGA ACATCGACGG CGAGGCGCTG GAATGGGCGC ACCGCCGCCT GAGCCACCGC
CCCGAGGCGC GCAAGATCCT GATGGTGATC TCGGACGGGG CGCCGGTGGA TGATTCCACC
CTGTCGGTGA ACCCGGCGAG CTATCTTGAA AAACACCTCC GCGACGTGAT CTCGATGGTC
GAGCGCAAGC GGCAGGTAGA GCTTATCGCC ATCGGCATCG GCCATGACGT GACGCGCTAC
TATGCCCACG CCGTCACGAT CACGGATGTG GAGCAACTGG CGGGCGCCAT GACCGAGCAG
CTCGCCTCCC TTTTCGACGC CGATCCGAAG AAGCGCATCG GCAAACGGCG GGTGGCCTGA
 
Protein sequence
MSKPTDNPAD PFKKALAEAT KTLADDPDMT VTFSVDPAGM NREGMRLPQV SRRMTRDEVM 
LARGTADAFA LRRRYHDDAI SARYLPQGQM AREIYEAMEA ARCEAVGART MPGTAGNIDA
RIGHEAERKG YGQITQAADA PLATAAGYLV RHLATGRTLP GGADNVMELW RGFIEQQAGG
TLQNLDEVLA DQAAFARLAR KVISDLGYGD QLGDDPDTDE QDDAGEDAET DEDESQSQGE
DDSDESQEAQ PERSQEEQQD PSQAQVSMED QPDADQGEEV EMPDGEAPLE PPPPAPHSEA
DPAYTVFATD FDEEIRAEDL AEPAELERLR AYLDQQLEPL KGAVSRLANK LQRRLQAQQN
RSWEFDREEG ILDAGRLARV VANPTTPLSF KVEKDTEFRD TCVTLLLDNS GSMRGRPISI
AAICADVLAR TLERCQVKVE ILGFTTRAWK GGQSREKWLA AARPQQPGRL NDLRHIIYKG
ADAPWRRARE NLGLMMKEGL LKENIDGEAL EWAHRRLSHR PEARKILMVI SDGAPVDDST
LSVNPASYLE KHLRDVISMV ERKRQVELIA IGIGHDVTRY YAHAVTITDV EQLAGAMTEQ
LASLFDADPK KRIGKRRVA