Gene Smed_3785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3785 
Symbol 
ID5317953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp231748 
End bp233016 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content62% 
IMG OID640775598 
Productprotein of unknown function DUF900 hydrolase family protein 
Protein accessionYP_001312531 
Protein GI150375935 
COG category[S] Function unknown 
COG ID[COG4782] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.658591 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTAATG TAAACTCGTG CGGAGGCAGG CTCCGTGTCG CGAAAACCGG AAGCACGACC 
GTGTCGGCGC TTGTCGCCCT CGTTCTTCTG GCGGGCTGCG GAGGCCACGC GAAAGGCGTG
ATGGCTCCTG TGGCTCTGGC GCAGCCGTCG GCCACCTCGC AGGTCGACAT GCTGGTTGCG
ACGACCCGTG AACCCTCGGG AGACGCGGCG ACATTGTTCT CGGGAGAGCG CAGCCCGACA
CTGTCTATGA CCGATGTTGC GGTTTCGATT CCGCCGGACT CGCGCCGCAA GCCCGGTACG
GTGCAGTGGC CGCGGAAGCT TCCTCCGAAT CCGGAAACCG ACTTCGCTGT CACGCGGGTG
CGCAAGCTGG CATCGAATGA CGAAGCGCGC GACTGGTTCC AGGTTCACAA TGAGGGCGGC
CACGTGTTGC TCTTCGTGCA TGGTTTCAAC AACCGCTATG AGGATGCTGT TTTCCGGCTG
GCGCAGATTG TTCACGATTC GGGCGCCCAG GCGACCCCGA TCCTGTTCAC CTGGCCATCG
CGGGCACGGC TGTTCGACTA TAATTACGAC AAGGAGAGCA CCAATTACTC GCGCACTGCA
CTCGAGGATA CGCTGCGCAC GCTGGCGTCC GCGCCGCGCG TCAAGGACAT CACCATCCTT
GCCCATTCTA TGGGAACCTG GCTGACGATG GAGTCGCTGC GCCAGATGGG GATCCGCGAC
GGCGGCATCG CGCCAAAGAT CGAGAACGTG ATTCTCGCTT CGCCCGACAT CGATCTCGAC
GTCTTCGCCA AGCAATGGGT CGATATGGGC AAGGCACGCC CGAAGTTTAC GATCTTCGTC
TCACAGGACG ACCGGGCGCT CGCGGTATCG CGGCTGATCT CCGGCGACGT GTCTCGACTC
GGCGCGATCG ATCCGACCGC CGAGCCCTAC CGTACACAGT TGGAGACTGC CGGCATCACG
GCGATCGATC TCACCAAGGT CCAGACGGAT GACGGCCTGC ATCATGGAAA ATTCGCCGAA
AGCCCGGAGA TCGTGCAACT GATCGGGCAG CGGATCATCA AAGGGCAGAC GCTGACCGAT
TCCGACATCT CGCTCGGCGA AGGCATCACT GCCGTCGTCG CAGGCACGGC CAAGAATGTC
GGTACCGTTG CGGCGGCCAC GATCACCGCG CCGGTCACCA TCATCGAGCA GCGTGGAACG
CCGCGCAAGA AGGTCAATCT GGAGGAGACG CTGACGAGCA GCGAGAATGC GGGAAACACG
GCCCGTTGA
 
Protein sequence
MGNVNSCGGR LRVAKTGSTT VSALVALVLL AGCGGHAKGV MAPVALAQPS ATSQVDMLVA 
TTREPSGDAA TLFSGERSPT LSMTDVAVSI PPDSRRKPGT VQWPRKLPPN PETDFAVTRV
RKLASNDEAR DWFQVHNEGG HVLLFVHGFN NRYEDAVFRL AQIVHDSGAQ ATPILFTWPS
RARLFDYNYD KESTNYSRTA LEDTLRTLAS APRVKDITIL AHSMGTWLTM ESLRQMGIRD
GGIAPKIENV ILASPDIDLD VFAKQWVDMG KARPKFTIFV SQDDRALAVS RLISGDVSRL
GAIDPTAEPY RTQLETAGIT AIDLTKVQTD DGLHHGKFAE SPEIVQLIGQ RIIKGQTLTD
SDISLGEGIT AVVAGTAKNV GTVAAATITA PVTIIEQRGT PRKKVNLEET LTSSENAGNT
AR