Gene Smed_1646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1646 
Symbol 
ID5322504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1739655 
End bp1741526 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content62% 
IMG OID640790586 
Producthypothetical protein 
Protein accessionYP_001327318 
Protein GI150396851 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.327516 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.459319 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGTC CCGACATTCC CGTCACGATC TCCGGTGATC CGAAGGGCTT CGAGTCCGCG 
CTTGTCCGGG TGCGGGCACT CTCGAAGTCG ACGGCAACTG ACGTCGTTGC ATCCTTCGGC
CGGATCAAGA ACCTCGTGGC CGGCGGCGCC GGTCTCGTGA CCGGGCTTGT CTCCGCCGCC
AGCGTCACCG CATTGCGCGA TGCAGCGGGC GCGATTGCCT CGATCGGCGA CGAGGCGCGT
CGGGCCGGCC TCGACGTCAA GAGCTTCCAG GAGCTGAAGT TCGTCGCCGA GCAGAACCGT
GTCGGCGTCG ACGCGCTGAC CGACGGCATC AAGGAATTGA ACCTTCGGGC CGACGAATTC
ATCGTCACCG GAGGCGGATC GGCGGCCGAG GCTTTCCAGC GCCTCGGCTA CTCGGCCGAG
GACCTGAAGC AGAAGCTCGA AGATCCGGCT GATCTCTTCA CCGAGATCAT CGGTCGCCTG
GGCGAGCTCG ACAAGGCGGC ACAGATCCGC ATCATGGACG AGATCTTCGG CGGCGCGGGC
GGCGAACAGT TCGTGCAGCT GATCGAGGCG GGTGAAGCGG GCATCCGCGA CACCATCAGG
GCCGCGAACG ACCTGGGCAT CGTTGTTGAC GAGCAGATGA TCCAGAAGGC TGCAGACGTC
GACCGCAAGT TCAACATGCT TGCGACGACG GTCGGCACGA AGTTGAAATC CGCCATCGTT
TCGGCGGCTG ACAGCCTAGC GGAATTTATC GACGGCTTCC GTGATTTCCA AAACCAGATG
AACAGCACGC TTCAGGGCAG GCAAGCCAAA ATCGGCGAGC GTCAGCTCGA GATCGAGAAT
GAAATCCTCA AGAAGAAGGA GGCGCAGGCT CGACAGGACG AGAAGCTCTC CGATGTCGCC
AGGAAGCTTG GTTTTGAAAA CAGTAAGAAC GCCAACCTTG CCGGCTACAC CGGGCAGATA
GAAGCCCTGA AGGAAGAGAG CCGGAAACTC GCCGAAGAAG AGGCGAAGAT CGTCAATATC
CTGAGCGATC GCCTCCAGCC GATGAACCGC CCGGCCGAGA GGACCTGGAC GCCGATCGAT
ACTGAAGAAA AAGGCGGCGG CCGGTCCAAG AAAGTCTCGG AAGCCGGGAA AGAAAAGAAG
GCGATCGACG ACGTGATCGC GTCGTTGCGT GAGGAGTTGG CGATCATCGG CCTCACCGAC
ATCGAGCGGG AGCGGACAAT TGCGCTGCGC GAGGCGGGTG TCGAGGCGAC CTCGAAGGAA
GGCCAGCAGA TCTCGGCGCT CATCGACGAA AAGTACCACC AGCTCGCAGC TGAGGAGGCC
TTGGCCGAGC AGTATGAGCG CAGCGAAGAA GCGGCCGAGC GAATGGGGCA GGTCCTCGAT
GATCAGCTCA TGCGCATCGT CGACGGCAGT TTCGATGCGA AGGAGGCGAT TGCGGCGCTG
CTCACCGAGA TCATCAATGT CCAGACGAAC GGGAAGGGGC TCTTCGGTTC GCTGTTCAGC
TCCATTTTCG GCGGTGGTAG CGGTTTTGGC TCCAACTTCG TGCCGACCAC AACGCTCGGT
GACTTCCTCG GCTATGGCGG TGCGCGCGCT GGCGGCGGTG ATGTTTCTCC CGGGCGCATC
TACCGGGTGA ACGAATATGA GGACGAGTTC TTTGCTCCGA CCAGCCACGG CCGGATCATC
GCGCCGAGCA AGCTGTCCGG CGCGGCGGCA GACAGAGAAG GCGGCGGCGG GCGCACCGTC
GTTGAGATCG TACTGAGCAA GGATTTGTTG GCCAGCATCC TCGAGCAGAC CGGCAATCAG
ACCGTGCGCA TCGTGCGCAG CAACGAGGAA GCCCGGACGA ACTATCGCCT GAATGGCGGG
GAAGATTTCT GA
 
Protein sequence
MSRPDIPVTI SGDPKGFESA LVRVRALSKS TATDVVASFG RIKNLVAGGA GLVTGLVSAA 
SVTALRDAAG AIASIGDEAR RAGLDVKSFQ ELKFVAEQNR VGVDALTDGI KELNLRADEF
IVTGGGSAAE AFQRLGYSAE DLKQKLEDPA DLFTEIIGRL GELDKAAQIR IMDEIFGGAG
GEQFVQLIEA GEAGIRDTIR AANDLGIVVD EQMIQKAADV DRKFNMLATT VGTKLKSAIV
SAADSLAEFI DGFRDFQNQM NSTLQGRQAK IGERQLEIEN EILKKKEAQA RQDEKLSDVA
RKLGFENSKN ANLAGYTGQI EALKEESRKL AEEEAKIVNI LSDRLQPMNR PAERTWTPID
TEEKGGGRSK KVSEAGKEKK AIDDVIASLR EELAIIGLTD IERERTIALR EAGVEATSKE
GQQISALIDE KYHQLAAEEA LAEQYERSEE AAERMGQVLD DQLMRIVDGS FDAKEAIAAL
LTEIINVQTN GKGLFGSLFS SIFGGGSGFG SNFVPTTTLG DFLGYGGARA GGGDVSPGRI
YRVNEYEDEF FAPTSHGRII APSKLSGAAA DREGGGGRTV VEIVLSKDLL ASILEQTGNQ
TVRIVRSNEE ARTNYRLNGG EDF