Gene Smed_6520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_6520 
Symbol 
ID5320823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009622 
Strand
Start bp209954 
End bp211174 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content56% 
IMG OID640778069 
Productresponse regulator receiver protein 
Protein accessionYP_001315001 
Protein GI150378407 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0775] Nucleoside phosphorylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.143497 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.120873 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAAGTCT TGATAGTCGA GGACGACAAT CAAAAGTACA ATCGCGTTCA TGCCGTGCTC 
GAGCAGGCCG GCGTTGCCGG CAGCGACATC ACCCATGTGA TCGCCGCGGC TCCGGCATAT
GAACTTTTGC GCCAGACCCT GTTCGATCTG ATGCTTCTGG ATGTCAACAT CCCTCGTCGG
CTCGGCGATA GGAAGCCACA ACGCGGCGGC GGGCTGGAAT TACTGAAGGA CCTTGGACGC
GAGAGCGACC TTCGACGGCC GACATACATC GTGGGTTTGA CAGCTTACGA AGACGTCGTT
GCCGAATTCG GTTCCGCCTT CGAGGATCAG CTTTGGTCGC TTGTGCACTA CAAGGAGTCC
TCCGACCAGT GGATCGCACA GCTGCTGGTG AAGGTGAATT ATATCCAGGC GGCCAACCGA
TCGCGCAACT TTAGTGACGG CGAGACATAT GGCTGCGATC TGGCCATCAT CACGGCCTTA
GATACTGTCG AATTCGACGC AGTTCAGTCG CTCCCGTTAA GCTGGGAGCC TCTTCGCCTT
CAACACGATG AGACTAGGTA CCTTGCTGGC ACGCTCGCGA CATCGAGCGG TACAAAGAGC
GTCATCGCGG CGGCGGCCCC GAGAATGGGC ATTCCCGCCT CCGGAATCCT GAGCTCGAAG
ATCATTCACC AATTCCGCCC CCGTTTCATC GCGATGGTCG GAATCTGCGC TGGTCGCAAG
GATAAGGTGA GCTTGGGCGA CCTGATCGTC GCGGAACCGA CATGGGACTG GGGAAGTGGC
AAGATCAGCT CCGAAGAAGG TGAGCCTAAA TTTATGCCTT CTCCGCACCA ACTGGACATC
GATCCGGACA CTACGTCTCT GTTGAAAGCC ATGACGAAAG ACGCGGTGCT TTTGGCCGGC
ATCAAAAAAG CCTCCCGGGG AACCAAGCCC AAGACTGAAT TGTCAGCACA CATGGGGCCT
TTGGTTTCGG GAGCTGCTGT CGTGGCACAT AAGCCGACAT TCGATCAGCT GCTCGATCAG
CATCGCGGTA TCTTAGGAGT CGATATGGAG GCGTATGCCG TCGCCGCCGC TGCGATGGGC
AGCGCCAAAC CGCGTCCAAA ATTTCTCATA GTCAAAGGCG TCAGTGACTT TGCTGACGAA
CACAAGGACG ACGATTACCA GGAATTTGCA GCGTCGGTAA GCGCTAATTT CCTCTTAGTC
GCGGCCAAAG AGTTTCTTTA G
 
Protein sequence
MKVLIVEDDN QKYNRVHAVL EQAGVAGSDI THVIAAAPAY ELLRQTLFDL MLLDVNIPRR 
LGDRKPQRGG GLELLKDLGR ESDLRRPTYI VGLTAYEDVV AEFGSAFEDQ LWSLVHYKES
SDQWIAQLLV KVNYIQAANR SRNFSDGETY GCDLAIITAL DTVEFDAVQS LPLSWEPLRL
QHDETRYLAG TLATSSGTKS VIAAAAPRMG IPASGILSSK IIHQFRPRFI AMVGICAGRK
DKVSLGDLIV AEPTWDWGSG KISSEEGEPK FMPSPHQLDI DPDTTSLLKA MTKDAVLLAG
IKKASRGTKP KTELSAHMGP LVSGAAVVAH KPTFDQLLDQ HRGILGVDME AYAVAAAAMG
SAKPRPKFLI VKGVSDFADE HKDDDYQEFA ASVSANFLLV AAKEFL