Gene Rleg_3884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3884 
Symbol 
ID8014705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3953533 
End bp3954525 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content61% 
IMG OID644826454 
Productcobalt chelatase, pCobS small subunit 
Protein accessionYP_002977666 
Protein GI241206570 
COG category[R] General function prediction only 
COG ID[COG0714] MoxR-like ATPases 
TIGRFAM ID[TIGR01650] cobaltochelatase, CobS subunit 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.922452 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0087584 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCAAGA TCGACCTGGA TATATCAGAA CTGCCCGATA CCACCGTTTC GGTCCGCGAG 
GCCTTCGGCA TCGATTCCGA CATTCGCGTT CCCGCCTACA GCAAGGGCGA CGCCTATGTT
CCGGACCTCG ACACCGACTA CCTGTTCGAC CGCGACACGA CGCTCGCCAT TCTCGCAGGC
TTCGCCCATA ACCGCCGCGT GATGATTTCC GGCTATCACG GCACGGGCAA GTCCTCGCAT
ATCGAGCAGG TGGCGGCGCG GCTCAACTGG CCTTGCGTGC GCATCAACCT CGATAGCCAT
GTCAGCCGTA TCGATCTCGT CGGCAAGGAT GCGATCGTCG TCAAGGACGG GCTGCAGGTC
ACCGAATTCA AAGACGGCAT CCTGCCCTGG GCCTATCAGC ACAATGTCGC GCTGGTCTTC
GACGAATATG ATGCCGGCCG CCCCGATGTG ATGTTCGTGA TTCAGCGCGT ACTCGAATCC
TCCGGGCGCC TGACGCTGCT CGATCAGAGC CGCGTCATTC GGCCGCACCC GGCCTTCCGT
CTGTTTGCGA CTGCGAACAC GATCGGCCTC GGCGACACGA CCGGCCTCTA TCACGGCACG
CAGCAGATCA ACCAGGCGCA GATGGACCGC TGGTCGATCG TCACCACGCT GAACTACCTG
CCGCATGATC ACGAAGTGAA TATCGTCGCC GCCAAGGTGA AGAGCTTCGG CAAGGACAAG
AACGGCCGTG AGACGGTTTC GAAGATGGTG CGCGTCGCCG ACCTGACGCG TGCCGCCTTC
ATGAACGGCG ATCTCTCGAC CGTCATGAGC CCGCGTACGG TTATCACCTG GGCCGAAAAC
GCCGAAATCT TCGGCGATCT CGCCTTCGCC TTCCGCGTCA CCTTCCTCAA CAAGTGCGAC
GAGCTGGAGC GTCCGTTGGT CGCCGAGCAT TATCAGCGCG CCTTCGGCGT CGAGCTGAAG
GAAAGTGCCG CCAACATCGT TCTCGGGGCT TGA
 
Protein sequence
MSKIDLDISE LPDTTVSVRE AFGIDSDIRV PAYSKGDAYV PDLDTDYLFD RDTTLAILAG 
FAHNRRVMIS GYHGTGKSSH IEQVAARLNW PCVRINLDSH VSRIDLVGKD AIVVKDGLQV
TEFKDGILPW AYQHNVALVF DEYDAGRPDV MFVIQRVLES SGRLTLLDQS RVIRPHPAFR
LFATANTIGL GDTTGLYHGT QQINQAQMDR WSIVTTLNYL PHDHEVNIVA AKVKSFGKDK
NGRETVSKMV RVADLTRAAF MNGDLSTVMS PRTVITWAEN AEIFGDLAFA FRVTFLNKCD
ELERPLVAEH YQRAFGVELK ESAANIVLGA