Gene Smed_2143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2143 
Symbol 
ID5323003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2211507 
End bp2213453 
Gene Length1947 bp 
Protein Length648 aa 
Translation table11 
GC content59% 
IMG OID640791081 
Producthypothetical protein 
Protein accessionYP_001327811 
Protein GI150397344 
COG category[S] Function unknown 
COG ID[COG3533] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.56985 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTTGC CCCAAAACAA GGCCGCCGAA GTCAAGCCAC GCGCCTTCGA GCGTTTTGTG 
CCTGTCGACC ACACTCGCGT AGCCTTTGAT GGCGGGTTCT GGCAGAGTTG GTCGGAAACC
GTCCGAAGCG TCACTATACC CACTCAGCAC AGGCGCCTCG AAGAGGAGGG CTTCCTCGAA
GTCCTGGATT TCGAAAAGCC GCCGTCACCA CTCGTCCGCC CTATCCAGCC CAGCGGACTG
TCGATGCAAC ATTTCTTCGA CTCTGACTTC GGCAAATGGA TTGAGGCTGC GAGCTATACG
CTCAAGAACC ATCCTGATCC GGACATTGAA GCCAAGATCG ATGCGATCGT GGAAAGGCTG
GAGCACGGAC AGATGCCAGA TGGCTATCTG AACAGCTGGT TCATCCGGCG CGAACCGGAC
AAGCGCTGGA CCAACCTGCG CGACCTGCAT GAAATGTATT CGATGGGCCA TCTGATCGAG
GGAGCCGTGG CCTATTTCGA GGCTACCGGA AAACGGCGGT TCCTGGACGT GATGATCCGT
GCCGTCGATC ACATCATCGA CACTTTCGGG ACGGAGCCGG GCAAACTGCG CGGCTACGAT
GCCCATGAGG AAGTCGAACT GGCGCTTGTG AAGCTTTATC GCTTAACCGG CGACCCCAGG
CACCTGAAAC TCGCTACCTA TTTCGTCGAC GAGCGCGGCC GAATGCCGTC CTACTTCGAC
GAGGAAACGC GCCGGCGGGG GGAGAATCCG GCCGATTATG TCTACGGGAC CTATGCCTAC
AGTCAGGCGC ACATGCCCGT CCGCAATCAG ACGCAAGTCG TTGGCCATGC CGTGCGAGCT
ATGTATCTCT TCTCAGCGAT GGCGGACCTA GCCTATGAAA ATGACGATCC TAGCCTAAAG
CACGCCTGCG ACCGCCTGTT CGACAATCTG ATAGGCCGTC AGCTTTACAT AACCGGAGGT
CTCGGGCCAT CCGCATCCAA CGAAGGCTTC ACGCGCGAAT ATGATCTGCC GAACACGACG
GCCTATGCGG AGACATGTGC CGCGGTCGCG CTTGGTCTGT GGAGCCATCG CATGGCGCAG
CTTGACCTGG ACAGCAAGTT CACCGACGCC CTGGAAACAA TTCTATTCAA CGGCGCGCTT
TCTGGAATTT CGCGAGACGG TGAGCACTAT TTCTACGAGA ACGTGCTCGA AAGCCACGGC
CAGCATCGCC GCTGGAAATG GCATTACTGC CCATGCTGCC CGACGAACAT CGCCCGCTTC
ATAACGTCGC TGGGCCAGTA CTTCTATTCT GCAAAGCGGG ACGAAATCGC TGTCCACCTC
TACGGTGCCA ACACAGCCGA GCTGGAAATC CAGGGCCAAT TCGTGCGACT TCGGCAAGAA
ACCAGCTATC CGTGGGATAA GGATGTTCTT CTCGCCCTTG GTCTGGTTGC GCCGACCCGG
CTCACCTTCA GGCTGCGAAT CCCTGGCTGG TGCCGTAACG CCCGGTTGTG GGTAAACGGA
GAGCAAATGG ACCTCGGCGC ATCGCTTGAA AAGGGCTATG CGGTCGTGAA CCGCGAATGG
GTCGACGGGG ATGAAATCCG TCTGACTTTC GAGATGCCAG TGGAGCGCCT CTACGCCCAT
CCAGCAGTAG GGGAGGACGC GCAGCGTGTC GCTCTTAAGC GCGGTCCGGT CGTCTATTGC
GTCGAGGAGA CGGACATTGG CACGGAACCC CAGCGCCTGA GAATCTCAGC GGACACCAAC
CTCACCCCGC GCTTCGACGA AACCCTGCTT GGCGGTGCCG TCGTGCTTGA GGGAGAAGCA
TTGGAAGCCG ATGCCGAGGA TTGGGGGCCA ACGCTCTATT GCAACAGGCC ACCTTCCTTG
AAGGGAAGAA CGTTCAAGGC GATACCCTAT CACCTCTGGG CCAATCGTGA CGAGGGCGCA
ATGCAGGTCT GGCTGACGGA GAAGTAG
 
Protein sequence
MSLPQNKAAE VKPRAFERFV PVDHTRVAFD GGFWQSWSET VRSVTIPTQH RRLEEEGFLE 
VLDFEKPPSP LVRPIQPSGL SMQHFFDSDF GKWIEAASYT LKNHPDPDIE AKIDAIVERL
EHGQMPDGYL NSWFIRREPD KRWTNLRDLH EMYSMGHLIE GAVAYFEATG KRRFLDVMIR
AVDHIIDTFG TEPGKLRGYD AHEEVELALV KLYRLTGDPR HLKLATYFVD ERGRMPSYFD
EETRRRGENP ADYVYGTYAY SQAHMPVRNQ TQVVGHAVRA MYLFSAMADL AYENDDPSLK
HACDRLFDNL IGRQLYITGG LGPSASNEGF TREYDLPNTT AYAETCAAVA LGLWSHRMAQ
LDLDSKFTDA LETILFNGAL SGISRDGEHY FYENVLESHG QHRRWKWHYC PCCPTNIARF
ITSLGQYFYS AKRDEIAVHL YGANTAELEI QGQFVRLRQE TSYPWDKDVL LALGLVAPTR
LTFRLRIPGW CRNARLWVNG EQMDLGASLE KGYAVVNREW VDGDEIRLTF EMPVERLYAH
PAVGEDAQRV ALKRGPVVYC VEETDIGTEP QRLRISADTN LTPRFDETLL GGAVVLEGEA
LEADAEDWGP TLYCNRPPSL KGRTFKAIPY HLWANRDEGA MQVWLTEK