Gene Sare_4003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4003 
Symbol 
ID5704889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4554675 
End bp4556732 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content71% 
IMG OID641273428 
Productmethylmalonyl-CoA mutase, large subunit 
Protein accessionYP_001538784 
Protein GI159039531 
COG category[I] Lipid transport and metabolism 
COG ID[COG1884] Methylmalonyl-CoA mutase, N-terminal domain/subunit
[COG2185] Methylmalonyl-CoA mutase, C-terminal domain/subunit (cobalamin-binding) 
TIGRFAM ID[TIGR00640] methylmalonyl-CoA mutase C-terminal domain
[TIGR00641] methylmalonyl-CoA mutase N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.446619 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000372138 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACGAGA AGGCCCCCCC AGGTCGGCTA CCCGAACGGG ACCGTCCGTG GGTGATGCGA 
ACCTACGCCG GGCACAGCTC GGCCGCCGCG ACCAATGCGC TCTTCCGCCG CAACCTGGGA
AAGGGGCAGA CCGGGCTCTC GGTCGCCTTC GACCTGCCCA CCCAGACCGG GTACGACCCC
GATCACGAAC TGGCTGTCGG GGAGGTCGGC CGGGTGGGCG TGCCGGTGGC GCACCTCGGC
GACCTGCGGG CTCTCTTCCA GGGCATTCCG CTGGCCGACA TGAACACCTC GATGACCATC
AACGCCCCGG CGATGTGGCT GCTCGGCCTC TACAGCACCG TCGCCTCCGA GCAGGGGGCC
GAACTCCACA GTTGTGCCGG CACAACCCAG AACGACATCA TCAAGGAGTA CCTGTCCCGC
GGAACGTATA TCTTCCCGCC GGCGGCGTCG CTGCGGCTGA CCGCCGACCT CATCGCGTAC
ACGCTGCGCG AGATGCCCCG GTGGAACCCG GTCAACATCT GCTCGTACCA CCTCCAGGAG
GCGGGGGCCA CGCCGGTCCA GGAGGTCGGC TTCGCGCTCG CCACCGCGGT CGCCGTCCTC
GACGCCGTCC GTGACTCCGG CCAGGTGCCC GCCGAGCGAA TGGGGGACGT GGTCCAGCGA
ATCAGCTTCT TCGTCAACGC CGGAGTCCGG TTCGTCGAGG AGGTCGCCAA GATGCGCGCC
TTCGGTGCGC TCTGGGACGA GATCACCCGC GAACGGTACG GCGTGACGAA CCCGAAACAG
CGCCGCTTTC GCTACGGCGT CCAGGTCAAC TCCCTGGGCC TGACCGAGGC GCAGCCGGAG
AACAACATCC AGCGCATCGT CCTGGAGATG CTCGGCGTGA CCTTCTCCCG AGATGCCCGG
GCTCGGGCGG TGCAGCTCCC CGCCTGGAAC GAGGCGCTCG GCCTACCCCG GCCGTGGGAT
CAGCAGTGGT CGCTGCGGAT GCAGCAGGTG CTCGCGTACG AGTCGGACCT GCTGGAGTAC
CCCGACCTGT TCGACGGCTC GCACGTGATG ACGGCACTGG TCGACAGCAT CGTCTCCGGA
GCTCGGGTCG AGCTGGACAA GGTGCTGGAG ATGGGGGGCG TGGTCGCCGC TGTCGAGACC
GGGTACCTGA AGAGCGCCCT GGTCGCCTCG CTCGCCGACC GGCGTCGTCG AATGGAGGCC
GGCACGGACG TCGTGGTCGG CGTCAACCGG TTCACCGAAA CCGAGCCGTC CCCGCTGACC
GCGGCCGGCG CCGAGGCGAT CGAACAGGTC GATCCGGCGG TGGAGGCTGC CGCCGCGGAC
GCGGTGCGGA ACTGGCGGGC CAGCCGGGAC ACGGCAGCCA CGGACGCCGC ACTCGCCCGG
CTGCGCGCGG ACGCCGCGTC CACCACGAAC CTGATGCCGG CCACGCTGGC CTGCGTCCGC
GCAGGTGTCA CCACCGGTGA GTGGGCGGGC GCGTTGCGGC AGGTGTTCGG GGAGTACCGG
GCCCCCACCG GCCTGTCCGG GTCGGCTGGC TCCGGCGGGG ACGTCACCCT GTCGGCGGTC
CGGGCCCGGG TGGCTGCCAC CGCCCGACAG TTGGGCAGCG GACGGCTGCG GCTGCTGGTC
GGCAAGCCCG GTCTGGACGG GCACTCCAAC GGCGCCGAGC AGATCGCGGT TCGGGCCCGT
GACGCGGGCT TCGAGGTCGT CTACCAAGGC ATCCGGCTGA CCGCTGGGCA GATCGTGGCC
GCCGCCGTCG AGGAGGACGT CGACCTGGTC GGTCTGTCGG TTCTCTCCGG TTCGCACCTT
GCGGCCGTGC CGGCCGTCCT GGACGGTCTG CGTGCCGCCG GCCGCGGCGA CATGCCCGTC
GTCGTCGGCG GCATCATTCC TGAGACGGAC ACGCAGACCC TTCGGGACGC CGGAGTCGTC
CGGGTCTTCA CGCCGAAGGA TTTCGCGCTC ACCGACGTCA TCGACGAGTT GGTGACCGTG
GTTCGCCACG CCAACGGGCT GCCGGAGGAG GACGGGCCGT CGCCGGCACC GGGCTTCGAA
AAGGCGCACC GGCGCTGA
 
Protein sequence
MDEKAPPGRL PERDRPWVMR TYAGHSSAAA TNALFRRNLG KGQTGLSVAF DLPTQTGYDP 
DHELAVGEVG RVGVPVAHLG DLRALFQGIP LADMNTSMTI NAPAMWLLGL YSTVASEQGA
ELHSCAGTTQ NDIIKEYLSR GTYIFPPAAS LRLTADLIAY TLREMPRWNP VNICSYHLQE
AGATPVQEVG FALATAVAVL DAVRDSGQVP AERMGDVVQR ISFFVNAGVR FVEEVAKMRA
FGALWDEITR ERYGVTNPKQ RRFRYGVQVN SLGLTEAQPE NNIQRIVLEM LGVTFSRDAR
ARAVQLPAWN EALGLPRPWD QQWSLRMQQV LAYESDLLEY PDLFDGSHVM TALVDSIVSG
ARVELDKVLE MGGVVAAVET GYLKSALVAS LADRRRRMEA GTDVVVGVNR FTETEPSPLT
AAGAEAIEQV DPAVEAAAAD AVRNWRASRD TAATDAALAR LRADAASTTN LMPATLACVR
AGVTTGEWAG ALRQVFGEYR APTGLSGSAG SGGDVTLSAV RARVAATARQ LGSGRLRLLV
GKPGLDGHSN GAEQIAVRAR DAGFEVVYQG IRLTAGQIVA AAVEEDVDLV GLSVLSGSHL
AAVPAVLDGL RAAGRGDMPV VVGGIIPETD TQTLRDAGVV RVFTPKDFAL TDVIDELVTV
VRHANGLPEE DGPSPAPGFE KAHRR