Gene Rleg2_3590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3590 
Symbol 
ID6982351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3715209 
End bp3717110 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content65% 
IMG OID643398315 
Productcobalt chelatase, pCobT subunit 
Protein accessionYP_002283083 
Protein GI209551166 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4547] Cobalamin biosynthesis protein CobT (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole phosphoribosyltransferase) 
TIGRFAM ID[TIGR01651] cobaltochelatase, CobT subunit 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.94588 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.603807 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCTC GCGGTGACAA TTCGAAAGCA AAGCCCGGCG CGCCCGTCGA CGTCGAGCCA 
TTGCGCCGGG CGATATCCGG CTGCGTGCGC TCGGTCGCCG GCGACGGCGA TGTCGAGGTG
ACCTTCGCCA ATGAACGGCC TGGCATGACC GGCGAGCGCA TCCGGCTGCC GGAGCTTTCC
AAGCGGCCGA CGGCGCATGA GCTGGCGGTC ACCCGCGGGC TCGGCGATTC CATGGCGCTG
CGCCTTGCCT GCCATGACGA GAAGATGCAT GCGACGATGG CGCCGCAGGG TTCGGATGCC
CGGGCGATCT TCGATGTCGT CGAGCAGGCG CGCGTCGAAT CGATCGGCGC GCTGCGCATG
GAGGGCATGG CGACCAACCT GCGCTCCATG ACCGAAGAGA AATATTCCAA GGCGAACTTC
ACCGGCATCG AGCGCCAGGA AGACGCACCG GTCGGCGAAG CCGTCGCGAT GATGGTGCGC
GAGAAGCTCA CCGGTCAGCG CCCGCCTGAA ACCGCCGGCA AGGTGCTCGA CCTCTGGCGC
GGCTTCATCG AGGAAAAGGC GGGGGCCGAA CTCAACAATC TGTCGGGTGC GATCAACGAC
CAGCAGGCCT TCGCCAAGGT CATCCGCAAC ATGCTGTCGG CCATGGAAAT GGCCGAGGAA
TACGGCGATG ACGACAACGA CGCCGACAAT GACGACCAGT CGGATCAGGA AGACCAGCCG
AGCGGCGACG AGCAGGATCA GGACGAGGTC GACGAGGATG CCGGCACCGA TGCCGCCCCG
GTCGAAGACA GCGAAGTCGC CGACGAGCAG ATGGAGGACG GCGAGACCGA AGGCGCCGAA
ATCTCCGACG ACGACATGAT GGAAGAGGGC GAGGACGATT CGGAAACGCC GGGCGAGACC
CGCCGTCCGA ACACGCCTTT CTCAGATTTC AACGAGAAGG TCGATTATCA CGTCTTTACC
GAAGAGTTCG ACGAGATCAT CACCGCCGAG GAACTCTGCG ACGCCGCCGA ACTGGAGCGC
CTGCGCGCCT TCCTCGACAA GCAGCTGGCA CACCTGCAGG GCGCGGTCGG CCGCCTCGCC
AACCGGCTGC AGCGCCGGCT GATGGCGCAG CAGAACCGCT CCTGGGATTT CGATCTGGAA
GAGGGTTATC TCGATCCGGC CCGGCTGCAG CGCATCATCA TCGATCCGAT GCAGGCGCTG
TCCTTCAAGA TGGAGCGCGA CACGCAGTTC CGCGACACGG TCGTCACCTT GCTGATCGAC
AATTCCGGCT CGATGCGCGG CCGGCCGATC ACGGTTGCCG CCACCTGCGC CGATATCCTC
GCCCGCACGC TGGAGCGCTG CGGCGTCAAG GTCGAGATCC TCGGTTTTAC CACCAAGGCC
TGGAAGGGCG GGCAGGCGCG GGAAAGCTGG CTTGCCGGCG GCAAACCGCA GACGCCCGGC
CGCCTCAACG ACCTGCGCCA CATCATCTAC AAATCGGCCG ACGCGCCGTG GCGTCGGGCA
CGCGCCAATC TCGGGCTGAT GATGCGCGAG GGCCTGCTCA AGGAAAATAT CGACGGCGAG
GCGCTGATCT GGGCGCATAA CCGCCTGCTC GCACGCCGCG AGCAGCGCCG CATCCTGATG
ATGATCTCGG ACGGCGCGCC AGTCGACGAT TCGACGCTGT CGGTCAATCC GGGCAATTAT
CTCGAGCGGC ACCTGCGCGC CGTCATCGAA CAGATCGAGA CACGCTCGCC GGTGGAATTG
CTGGCAATCG GCATCGGTCA CGACGTGACG CGCTACTATC GCCGCGCCGT GACGATCGTC
GATGCCGACG AACTTGCCGG CGCGATGACC GAGCAGCTCG CCTCGCTGTT CGAAGATCAA
TCCACCCAGC CGCGTGGCGG CCGGCTCCGT CGTGCCGGCT GA
 
Protein sequence
MAARGDNSKA KPGAPVDVEP LRRAISGCVR SVAGDGDVEV TFANERPGMT GERIRLPELS 
KRPTAHELAV TRGLGDSMAL RLACHDEKMH ATMAPQGSDA RAIFDVVEQA RVESIGALRM
EGMATNLRSM TEEKYSKANF TGIERQEDAP VGEAVAMMVR EKLTGQRPPE TAGKVLDLWR
GFIEEKAGAE LNNLSGAIND QQAFAKVIRN MLSAMEMAEE YGDDDNDADN DDQSDQEDQP
SGDEQDQDEV DEDAGTDAAP VEDSEVADEQ MEDGETEGAE ISDDDMMEEG EDDSETPGET
RRPNTPFSDF NEKVDYHVFT EEFDEIITAE ELCDAAELER LRAFLDKQLA HLQGAVGRLA
NRLQRRLMAQ QNRSWDFDLE EGYLDPARLQ RIIIDPMQAL SFKMERDTQF RDTVVTLLID
NSGSMRGRPI TVAATCADIL ARTLERCGVK VEILGFTTKA WKGGQARESW LAGGKPQTPG
RLNDLRHIIY KSADAPWRRA RANLGLMMRE GLLKENIDGE ALIWAHNRLL ARREQRRILM
MISDGAPVDD STLSVNPGNY LERHLRAVIE QIETRSPVEL LAIGIGHDVT RYYRRAVTIV
DADELAGAMT EQLASLFEDQ STQPRGGRLR RAG