Gene Smed_2559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2559 
Symbol 
ID5323427 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2656021 
End bp2657916 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content64% 
IMG OID640791502 
Productcobalt chelatase, pCobT subunit 
Protein accessionYP_001328224 
Protein GI150397757 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4547] Cobalamin biosynthesis protein CobT (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole phosphoribosyltransferase) 
TIGRFAM ID[TIGR01651] cobaltochelatase, CobT subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.921544 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.478832 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTTCGA ATTCCAAGGC GAAACCGAAT ACGAGGGAAA ACGCCGCCGA ACCGTTCAAG 
CGGGCGCTCT CCGGCTGCGT CAGGTCCATC GCCGGCGATG CGGAGCTTGA GGTCGCTTTT
GCCAATGAGC GGCCGGGAAT GACAGCCGAG CGCATTCGGC TGCCGGAACT CTCCAAACGC
CCGACCCGTC AGGAGCTGGC AGTGGCGCGG GGCCTTGGCG ATTCCATGGC GCTGCGCAAG
GCCTGCCACG ATGCTCGAGT CCATGCGACG ATGTCGCCGC AGGGGGCGGA CGCCCGAGCG
ATCTTCGATG CGGTCGAGCA GGCGCGCGTC GAGGCGATCG GCGCTCTCAG GATGCCGGGT
GTGGCCGACA ATCTCTCCTC CATGCTCCAG GAGAAATACG CCAAGGCGAA TTTCTCGGCA
ATCAATGCCC AGGCGGATGC GCCGCTCGAG GAGGCGGTGG CGCTGCTCGT GCGCGAGAAG
CTGACCGGCG AGAAGCCGCC GGAATCGGCC GGCCAGGTCC TCGACCTCTG GCGTGAGTTC
ATTGAGCAGA AGGCCGGCAG CGACATGCGC AACCTTGCCG GTACGGTCAA CGATCAGCAG
GCTTTTGCCC GCGTCGTGCG CGACATGCTG TCTTCGATGG ACGTTGCCGA GAAATACGGC
GACGACGATA GCGAGCCGGA CGAACAGGAG AGCGAGACCG ACGAGGACCA GCCGCGCAGT
CAGGAGCAGG ACGAAAACGC CAGCGACGAA GAGCAGGGCT CCGATGCCGC CCCTGCCGAG
GAAAACCAGT CTGCCGAGGA GCAGATGGAA GATGGCGAAA TGGACGGCGC GGAAATTTCC
GACGACGACC TTCAGGACGA AGGCGACGAG GACAGCGAAA CGCCGGGCGA GGTCAAGCGG
CCGAACCATC CCTTCGCGGA TTTCAACGAG AAGGTGGATT ATTCCGTCTA TACCCGGGAT
TTCGACGAAA CGATTGCCTC GGAAGAGCTC TGCGACGAGG CGGAGCTCGA CAGGCTGCGC
GCTTTTCTCG ACAAGCAGCT CGCCCATCTG CAGGGTGCAG TGGGGCGCCT CGCCAATCGC
CTGCAGCGCC GCCTGATGGC GCAGCAGAAC CGGTCCTGGG AGTTCGATCT CGAGGAGGGG
TATCTCGACA CGGCCCGGCT GCAGCGCATC ATCATCGATC CTATGCAGCC GCTTTCCTTC
AAGAGGGAGA AGGATACCAA TTTCCGCGAC ACGGTCGTCA CGCTGCTGAT AGACAATTCC
GGCTCCATGC GCGGGCGTCC GATCACGGTA GCAGCCACCT GCGCCGACAT TCTTGCGCGA
ACGCTCGAGC GTTGCGGCGT GAAGGTCGAA ATCCTCGGAT TCACGACAAA GGCCTGGAAG
GGCGGACAGT CGCGCGAGAA ATGGCTTGCC GGCGGCAAGC CCCAGTCGCC GGGCCGCCTC
AACGACCTCA GGCACATCAT CTATAAGTCC GCTGACGCGC CATGGCGCCG CGCACGGCGC
AATCTCGGTC TGATGATGCG CGAGGGACTG CTCAAGGAGA ACATCGACGG CGAGGCGCTG
ATGTGGGCGC ATGACCGGCT CCTGGCACGG TCCGAGCAAA GACGCATTCT CATGATGATC
TCGGATGGCG CGCCGGTCGA CGATTCGACC CTTTCGGTGA ACCCCGGAAA TTACCTCGAA
CGCCATCTGC GCGCCGTTAT CGAGCAGATC GAAACGCGCT CTCCGGTCGA GTTGCTTGCG
ATCGGCATCG GGCACGACGT CACACGCTAT TATCGCCGCG CCGTGACGAT CGTCGATGCG
GACGAACTCG CGGGAGCGAT GACGGAGCAG CTTGCAGCCC TCTTCGAGGA CGAGAGTACG
CGCCGCCGCC CGGGCGGGGC GCGTCGAGCC GGTTAA
 
Protein sequence
MSSNSKAKPN TRENAAEPFK RALSGCVRSI AGDAELEVAF ANERPGMTAE RIRLPELSKR 
PTRQELAVAR GLGDSMALRK ACHDARVHAT MSPQGADARA IFDAVEQARV EAIGALRMPG
VADNLSSMLQ EKYAKANFSA INAQADAPLE EAVALLVREK LTGEKPPESA GQVLDLWREF
IEQKAGSDMR NLAGTVNDQQ AFARVVRDML SSMDVAEKYG DDDSEPDEQE SETDEDQPRS
QEQDENASDE EQGSDAAPAE ENQSAEEQME DGEMDGAEIS DDDLQDEGDE DSETPGEVKR
PNHPFADFNE KVDYSVYTRD FDETIASEEL CDEAELDRLR AFLDKQLAHL QGAVGRLANR
LQRRLMAQQN RSWEFDLEEG YLDTARLQRI IIDPMQPLSF KREKDTNFRD TVVTLLIDNS
GSMRGRPITV AATCADILAR TLERCGVKVE ILGFTTKAWK GGQSREKWLA GGKPQSPGRL
NDLRHIIYKS ADAPWRRARR NLGLMMREGL LKENIDGEAL MWAHDRLLAR SEQRRILMMI
SDGAPVDDST LSVNPGNYLE RHLRAVIEQI ETRSPVELLA IGIGHDVTRY YRRAVTIVDA
DELAGAMTEQ LAALFEDEST RRRPGGARRA G