Gene Smed_6106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_6106 
Symbol 
ID5320408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp1038167 
End bp1039384 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content60% 
IMG OID640777746 
Producthypothetical protein 
Protein accessionYP_001314678 
Protein GI150378083 
COG category[S] Function unknown 
COG ID[COG1944] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00702] uncharacterized domain
[TIGR03266] putative methanogenesis marker protein 1
[TIGR03604] bacteriocin biosynthesis docking scaffold, SagD family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0977639 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCTG TTGAGGAATA CGCACAAGGA ACCCAGCGCA CCTATAATCC GGAAGAGACA 
TTGCGGAGAA TTGCGCCAGC GATGCGCACC TGCGGCATCA GCCGCGTTCT CGACGTCACC
CATCTCGATC GAATCGGCAT CCCCACCTAC AATGCCGTGC GCCCCAATGG AATGATCTTG
TCCGTGTCCA ATGGGAAAGG GTGGACGAAG GCCGCCGCAT CCGTATCCGC AATCATGGAG
TCGATCGAGG TCGAGCACGC CGAATATCCG GATACCTCGG CCTGGCATCT GGCTCAAAGC
GCGAAGGTAC TGCGAAACCG GGGATATTCG GTCGTCGATG CGCCAACGCT CATCAGCGAG
TGCCTCTGGC CCTCCGATAC CTATGGCGGT CTCTACTACT CAGATGATCT TCGCCTCGAT
TGGGTCGAGG GACGCGAGAT CATCGAGTCT CGGCCAGTCC TGCTGCCGGC CAGCACCATC
TATGTACGCG CGCCTTACGT GCATTATTTC ACCAGCAATG GATTGGCGAG CGGCAACACC
TGGGAAGAAG CAACGCTTCA CGGCATCTGC GAGTTGATCG AACGCGATTC CACGGCGCGT
CTCCTCGGTC GTCCCGAGGG CATGACCACA TCACGGTTGC TTCGGATCGA GCCAAAATCG
ATGCCTGAAC ATCTTGGGCA CTTCTCGGAG AAAGTGGCGC AGGCCGGAAT AGAGCTTTTC
ATGTTCGCTC TTCCGAGCGC CATCGATATC CATACATTCT GGGCTGTGTT CCATTGCCCG
GGCGAGCCCA GTTTCATGCT GGCCACCTCG GCGGGCTTCG GGTGCCATAC ATCGCCGCAG
ATCGCCGCGT CTCGCGCCCT GACCGAAGCT GCCCAGTCGC GCCTGACCTA TATTCACGGG
GCGCGGGAGG ATCTGGGGAT AGACCACGTC AATCGTCCAC TGACCTGCGC GGAGACAGAA
GCGCGATTGG CCTTGCAGGC GCGGACATTT GCCAAGTTTA GGCAGATCCC GACCGTCACC
TGGGATGAAC TTCTGGCCGT GGCGCCACAT CGTGCCCGCG GGCGTACGAT CCCGGAAAGC
CTCTCCATGG TGCTGAGAAT GTTGAAGGAG GCCGGACACG GTCAGGTCTA TGTCCACGAT
CTGACGAAGC GCGGTCTAGA CCTGGCCGTG ACGAAGGCCT TCGTGCCGGG GCTCAAGGTC
AGCGCGAAGA TGATTTGA
 
Protein sequence
MNAVEEYAQG TQRTYNPEET LRRIAPAMRT CGISRVLDVT HLDRIGIPTY NAVRPNGMIL 
SVSNGKGWTK AAASVSAIME SIEVEHAEYP DTSAWHLAQS AKVLRNRGYS VVDAPTLISE
CLWPSDTYGG LYYSDDLRLD WVEGREIIES RPVLLPASTI YVRAPYVHYF TSNGLASGNT
WEEATLHGIC ELIERDSTAR LLGRPEGMTT SRLLRIEPKS MPEHLGHFSE KVAQAGIELF
MFALPSAIDI HTFWAVFHCP GEPSFMLATS AGFGCHTSPQ IAASRALTEA AQSRLTYIHG
AREDLGIDHV NRPLTCAETE ARLALQARTF AKFRQIPTVT WDELLAVAPH RARGRTIPES
LSMVLRMLKE AGHGQVYVHD LTKRGLDLAV TKAFVPGLKV SAKMI