Gene Rleg_3883 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3883 
Symbol 
ID8015864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3951619 
End bp3953520 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content65% 
IMG OID644826453 
Productcobalt chelatase, pCobT subunit 
Protein accessionYP_002977665 
Protein GI241206569 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4547] Cobalamin biosynthesis protein CobT (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole phosphoribosyltransferase) 
TIGRFAM ID[TIGR01651] cobaltochelatase, CobT subunit 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0136743 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCTC GCGGTGACAA TTCGAAAGCA AAGCCCGGCG CGCCTGTCGA CGTCGAGCCA 
TTGCGCCGGG CGATAACCGG CTGCGTGCGC TCGATCGCCG GTGACGGCGA CGTCGAGGTG
ACCTTCGCCA ATGAGCGGCC GGGGATGACC GGCGAGCGCA TTCGGCTGCC GGAACTTTCC
AAACGGCCGA CGGCGCATGA GCTGGCGGTC ACCCGTGGTC TCGGCGATTC GATGGCGCTG
CGGCTCGCCT GCCATGACGA GAAGATGCAT GCGACGATGG CGCCGCAAGG CTCGGACGCC
CGGGCGATCT TCGATGTAGT CGAACAGGCG CGCGTCGAAT CGATCGGCGC GCTGCGCATG
GAGGGCATGG CATCGAACCT GCGCTCCATG ACGGAAGAGA AATATTCCAA GGCGAATTTG
ACCGGCATCG AGCGCCAGGA AGACGCACCG GTCGGCGAAG CGGTTGCGAT GATGGTGCGC
GAGAAGCTGA CTGGCCAGCG CCCGCCAGCC TCCGCCGGCA AGGTGCTGGA CCTCTGGCGC
GACTTTATCG AGGACAAGGC AGGGTCCGAA CTCGACAATC TGTCGAGCGC GATCAACGAC
CAGCAGGCCT TCGCCAAGGT CATCCGCAAC ATGCTGTCGG CGATGGAAAT GGCCGAGGAA
TACGGCGACG ATGACAGCGA CGCCGACAAT GACGACCAGT CGGAGCAGGA AGACCAGCCG
AGCGGCGACG AACAGGACCA GGACGAGGTC GACGAGGATG CCGGTACCGA TGCCGCCCCC
GTCGAGGACA GTGAGGTCGC CGACGAGCAG ATGGAGGACG GCGAGACCGA AGGCGCCGAA
ATCTCCGACG ACGACATGAT GGAAGAGGGC GAGGACGATT CGGAAACGCC GGGCGAGACC
CGCCGGCCGA ATACGCCTTT TGCCGATTTC AACGAGAAGG TCGATTATCA CGTCTTTACC
GAAGAGTTCG ACGAGATCAT CACCGCCGAG GAACTTTGCG ACGCCGCCGA GCTGGAGCGC
CTGCGCGCCT TCCTCGACAA GCAGCTGGCG CATCTCCAGG GCGCGGTCGG CCGTCTCGCC
AACCGGCTGC AGCGCCGCCT GATGGCGCAG CAGAACCGCT CCTGGGATTT CGACCTGGAA
GAGGGTTATC TCGATCCGGC CCGGCTGCAG CGCATCATCA TCGATCCGAT GCAGGCGCTC
TCCTTCAAGA TGGAGCGCGA CACCCAGTTC CGCGATACTG TCGTTACCCT GCTCATCGAC
AATTCCGGCT CGATGCGCGG CCGGCCGATC ACGGTTGCCG CCACTTGCGC TGATATTCTC
GCCCGCACGC TGGAGCGCTG CGGCGTCAAG GTCGAGATCC TGGGCTTTAC GACCAAGGCC
TGGAAGGGCG GGCAGGCGCG TGAAAGCTGG CTTGCCGGCG GCAAGCCGCA GACGCCTGGC
CGCCTCAACG ACCTGCGTCA TATCATCTAC AAGTCGGCCG ACGCGCCGTG GCGGCGCGCG
CGCGCCAATC TCGGGCTGAT GATGCGCGAG GGCCTGCTCA AGGAAAATAT CGACGGCGAG
GCGCTGATCT GGGCGCATAA TCGCCTGCTC GCGCGCCGCG AGCAGCGCCG CATCCTGATG
ATGATCTCGG ACGGAGCGCC AGTGGACGAT TCGACGCTGT CGGTCAATCC GGGCAATTAT
CTGGAGCGGC ACCTGCGTGC CGTCATCGAG CAGATCGAGA CACGCTCGCC TGTCGAGTTG
CTGGCGATCG GCATCGGCCA TGACGTGACG CGCTACTATC GCCGCGCCGT GACGATCGTC
GATGCGGACG AGCTTGCCGG TGCGATGACC GAGCAGCTTG CCTCGCTTTT CGAAGACCAA
TCCGTGCAGC CGCGCGGCGG CCGGATACGC CGTGCCGGCT GA
 
Protein sequence
MAARGDNSKA KPGAPVDVEP LRRAITGCVR SIAGDGDVEV TFANERPGMT GERIRLPELS 
KRPTAHELAV TRGLGDSMAL RLACHDEKMH ATMAPQGSDA RAIFDVVEQA RVESIGALRM
EGMASNLRSM TEEKYSKANL TGIERQEDAP VGEAVAMMVR EKLTGQRPPA SAGKVLDLWR
DFIEDKAGSE LDNLSSAIND QQAFAKVIRN MLSAMEMAEE YGDDDSDADN DDQSEQEDQP
SGDEQDQDEV DEDAGTDAAP VEDSEVADEQ MEDGETEGAE ISDDDMMEEG EDDSETPGET
RRPNTPFADF NEKVDYHVFT EEFDEIITAE ELCDAAELER LRAFLDKQLA HLQGAVGRLA
NRLQRRLMAQ QNRSWDFDLE EGYLDPARLQ RIIIDPMQAL SFKMERDTQF RDTVVTLLID
NSGSMRGRPI TVAATCADIL ARTLERCGVK VEILGFTTKA WKGGQARESW LAGGKPQTPG
RLNDLRHIIY KSADAPWRRA RANLGLMMRE GLLKENIDGE ALIWAHNRLL ARREQRRILM
MISDGAPVDD STLSVNPGNY LERHLRAVIE QIETRSPVEL LAIGIGHDVT RYYRRAVTIV
DADELAGAMT EQLASLFEDQ SVQPRGGRIR RAG