Gene Rleg_2363 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_2363 
SymbolcobD 
ID8013353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2365506 
End bp2366486 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content66% 
IMG OID644824945 
Productcobalamin biosynthesis protein 
Protein accessionYP_002976175 
Protein GI241205079 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1270] Cobalamin biosynthesis protein CobD/CbiB 
TIGRFAM ID[TIGR00380] cobalamin biosynthesis protein CobD 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.764656 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0402118 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATCG ACCAAAACCT TCTCGTGCTG CTTTTGGCGC TGCTGCTCGA CCGGATCGCC 
GGCGATCCAC AATGGCTGTG GTTGCGGGTG CCGCATCCCG TCGTCATGTT CGGCGCGGCG
ATCTCCTATG CCGACCGGCA GCTCAATCCC GCAAGCCTCA CGGGGTCGCA ACGCCGGATG
AACGGCGTCG CTGCCATCCT GGCGCTGCTT CTTTTGGCGC TGGCCGCAGG CTTCGTGTTC
AACCGGTTCT TCGCGCTGTT CGGCCTTGTC GGCATCTTGC TGGAGACCGG GCTGGTGGCG
ATCTTCCTGG CGCAGAAAAG CCTTGCCGAT CACGTCGCGG CCGTCGCCGT CGCGCTACGC
GACGAGGGGC TTGCCGGCGG GCGGACCGCC GTTTCCCGCA TCGTCGGGCG CGATCCCGAG
ACGCTGGACG AGCCTGCCGT CTGCCGCGCG GCGATCGAAA GCCTTGCCGA GAATTTCTCC
GACGGCGTCG TCGCACCGGC GCTCTGGTAT GCAGTCTTCG GCCTGCCGGG GCTTTTCGCC
TACAAGATGC TGAACACGGC GGATTCGATG ATCGGCCATA AGTCGGAAAA ATACATCGAC
TTCGGCTGGG CGGCCGCTCG GCTCGACGAT GTCGCCAACT GGCCGGCCGC GCGCCTCTCC
ATCCTGCTGA TTGCCGCCGG AGCCTGGATC CGGCGGGGAA CAAGCGCCGG CCGTGAGGCG
ATCCGCGTGG CGATGCGCGA CGGGGCCTTG CACCGTTCGC CGAACTCCGG CAGGCCGGAG
GCGGCCATGG CAGGCGCGCT GAACGTCCAG CTCGCCGGCC CGCGCATCTA TGGCGGCGTC
ATCGTGCGTG AACCGATGAT CAACGACGCC GGCCGCGACG TGGCGACCTC GGGCGACATC
GAGGACGGCG TATCGGTGTT TTATGCCAGC TGCATGGTGC TCGCCGGTGT GACGTTCGGG
CTTTTCTTGT GTTTTCTGTA G
 
Protein sequence
MTIDQNLLVL LLALLLDRIA GDPQWLWLRV PHPVVMFGAA ISYADRQLNP ASLTGSQRRM 
NGVAAILALL LLALAAGFVF NRFFALFGLV GILLETGLVA IFLAQKSLAD HVAAVAVALR
DEGLAGGRTA VSRIVGRDPE TLDEPAVCRA AIESLAENFS DGVVAPALWY AVFGLPGLFA
YKMLNTADSM IGHKSEKYID FGWAAARLDD VANWPAARLS ILLIAAGAWI RRGTSAGREA
IRVAMRDGAL HRSPNSGRPE AAMAGALNVQ LAGPRIYGGV IVREPMINDA GRDVATSGDI
EDGVSVFYAS CMVLAGVTFG LFLCFL