Gene Smed_4517 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4517 
Symbol 
ID5318080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1001730 
End bp1003130 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content64% 
IMG OID640776318 
Producthypothetical protein 
Protein accessionYP_001313250 
Protein GI150376654 
COG category[S] Function unknown 
COG ID[COG4529] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.198903 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTTC GTGGATTGTT TGCCCGCAGG TCAGCAATGG CCGCCTTGTC GAAGCCGGAA 
ATCGCGATCG TTGGCCGCGG TTTTTCGGGC ATCATGATGG CGATCGCGTT GATGAAATCG
ATGCGGGTAT CTTTCCACCT CACCATGTAT GATCCGCATC CATCGATCAG CGGGGGCCAG
GTATTTTCCG CCGCTCAGCG CTGCGAGATA CTCAACAGCC GCGTCCGCGA TCTTTCGGTC
GCGGCCGGCC AGCCGGATGA TTTCAACGAT TGGCTCTGCG CCAATGATGA GTTCCGAACG
GCGGTTCCGG CGGCCATTCC CGGCTTCCGG CAGATTTTCG TGCCGAAAGG CATCTTCAGC
GATTACGTGC ACCAGCGCTT CTCCGAGGCG CTTGCCCGGC GCTCCGACAT CACGGTGAGG
TTCTCACACG AGCCGGTCAC AGGCCTGCGA AAGCTCGCAA GTGGTCGCTT CAGCCTTGTG
CGGCACGGTG GCAACGACGA GAGCGACATT GTCATCCTCG CTACGGGCTT CGGCATGCGC
CCGCGGGACC TCGAGGTTTC CGAGGAGGAG CGGCCGCTTG TGCGCACTCG GCGCCTCGTC
GATCCACGCC ACGCGGTGCT GCTCGGTAGC GGGATACGTG TGGTCGACCA GTTGTTCCAG
ATGCGCGACA ACGGCTATGC GGGAAAGGTC ACGCTCATTT CGCGGCACGG TTTCCTGCCG
CAGGCGCACA CGCAGCGCGC GGCATCGCCG AGCTTTCCCG TCGATCCGCT GCCGCAAGGC
CTGGGCCGTA TCGTGCGCTT CGTGCGTCAG GCCTGCGCCG AGGCCGAAGC GAACGGGCAG
GGATGGCAAT CTGCGATGAA CGGCCTCCGC CGCCGCGCTC GCTCTCTCTG GCAATCGCTC
TCCGCACAGG AGAAGCGGCA GTTCAACCGT CACCTGCGTG CAATTTACGA CAGCCACAGG
AACCGCCTGC CAGCGGCCGT TCACGCGCGG CTGCAGCAGG AACTTGGCGA GGGTCGGACC
GTGCTTCGCC GCGGCCGGGC GGGTCGACGC CTGCCCGAAG GTATCCTCGT GCGATGGGCC
GGCCAGGATA CCGAGGAACT GCTGAGGGCT GATCAGGTGA TCGAATGTCG CTGTTCAGCT
CCGGACCTCG GAACGCCGTT GCTTCGGAGC CTTATTGCGG GCGGGCTTGC CCAACCAGAC
GAACTCGAGC TCGGCATTGC TGTCGCCCCG ACGGGCGAAG TCTTGAGCTC GAGCGGACAC
ACGCCGAACC TCTTCGCCAT CGGTCCGTTG GGATTGGGAA GCCTTCCCGA CATCGACCTC
GTACCGGAAA TCGTCACGCA GACCTATGCG GCATCACGGC TGATAGCGAC AGGAAAGCGC
ATGACGCTGA AAGCTGGATA G
 
Protein sequence
MTVRGLFARR SAMAALSKPE IAIVGRGFSG IMMAIALMKS MRVSFHLTMY DPHPSISGGQ 
VFSAAQRCEI LNSRVRDLSV AAGQPDDFND WLCANDEFRT AVPAAIPGFR QIFVPKGIFS
DYVHQRFSEA LARRSDITVR FSHEPVTGLR KLASGRFSLV RHGGNDESDI VILATGFGMR
PRDLEVSEEE RPLVRTRRLV DPRHAVLLGS GIRVVDQLFQ MRDNGYAGKV TLISRHGFLP
QAHTQRAASP SFPVDPLPQG LGRIVRFVRQ ACAEAEANGQ GWQSAMNGLR RRARSLWQSL
SAQEKRQFNR HLRAIYDSHR NRLPAAVHAR LQQELGEGRT VLRRGRAGRR LPEGILVRWA
GQDTEELLRA DQVIECRCSA PDLGTPLLRS LIAGGLAQPD ELELGIAVAP TGEVLSSSGH
TPNLFAIGPL GLGSLPDIDL VPEIVTQTYA ASRLIATGKR MTLKAG