Gene Smed_4906 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4906 
Symbol 
ID5317883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1414790 
End bp1416673 
Gene Length1884 bp 
Protein Length627 aa 
Translation table11 
GC content61% 
IMG OID640776690 
Producthypothetical protein 
Protein accessionYP_001313622 
Protein GI150377026 
COG category[S] Function unknown 
COG ID[COG3930] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR02421] conserved hypothetical protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.699143 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTAG CGCGCGCATC GATCCAGGCC GAAGCGAACC ATCCCGATTG GCTGGCGGAG 
GCGCTTGCGG CGATCCATGC AGGGAAAGCC GTACGCCAGG AATTCGCAGG CGGCGGGCGC
CTTCACATCG ATAGGGCCCT GCCCTTTCTC TGCGTGCACC TTGCCGGCGA TGCCGCACCT
GTTGCCCGCG AAATCGTCCG TGCAAACGCC TCCCACCTGC TCGCGCCGGA TGCTGCCACG
GCGAAGCCCC TCGTCACAGC CATTGGAGAA TTGCTCGAGC GTCGCTTCGG CGCCTTCATG
GTTCTCGATA TCGGCGAACT GGCAAAAGAC GAGCTCCTGA CCGACAAGTC TCCTTTCCTG
CCGCCCTTCA AGGTCGATGT TATGGCGGAC CGCCGGCGAA ATATCCAACC CGCAGCAAAG
GCCTTTATCG ACGCCGTCGA GGCGACCCTG GCGCAGTTCC GCACGCCGCG TGTCGAGCGT
CACGAATGGC AGGCGTATCG GGAAGAAGAA GCACTCGATC TGCCATTCCC TTCGATGCGG
GTTCGTTTCG CGCCTATCTA TCGGCAGCCT GACTCCGGTC AGGTTTTCCC GGAACTACGC
GAGCGGCTGA TCGCCACCAT TTTCGACGCC GCCCTGCACG CCTTTGCTGC CTTCGCGAAG
GCGACGGGCA GCCTCGAATT TCCCTCTCAC CGCGCGCTCG GCAGAAGAGC TTTCATCGAT
GCCGTGGAAC GAACGGACCG CAGCATCGAT GAGGTCGCCT CGAGTTTCGA TTTTCTCCTC
GCCGTCACGC CCATAAACGC GGAAGCCGCC TGGAGCGAGT TTGCGGCTGG CGGCTTCGCG
CGTTCGCCGC GATTCCTCTA CCGGCCGTTG GTCATCGAAA TCGAGAACGC GAAGAAGATG
CTTTTCTCGA TCCCCTTCGA TCATCTCGAA GACCCGGTGC TTTACCAACT CTATCGTGAA
AAGCAGCAGG AGCTGGATCT GCAGCTATCG ATGCTCTCCG CCCGCGAGAC GAAGAAATTC
GTCGAATATA GTCGCGCCCT GTACGGCCCG GTCGAGCCGA ATCTGCTGCG AGCAGCCGAA
AACATCCTGG ACCAAAGCCG AAATCTCGCC GGTTACCCGG CGGCACCCGC AGCCGAACTG
CGCGCCGACA GCCATGTCAT CGAGAAGAGG GCGCGCCAGA TGATCGACGC CTATGCACAG
CGCTATGCCG GATTCGAAGC GCTTGTGGAG ATACGCGACG ATCTACCCGC CGGCTTGCTG
GTTTCAGGTA AACGACTGTT CATCGCGCGA AGCACCAGCA TGGCGCCCGA TCGCGTCGAA
CCGATCCTCA GCCACGAGAT TGGCGTTCAC CTCCTGACCT ACTTTAACGG CTCGGCGCAG
GGCCTTAGAC TGTTTCGCTC GGGTCTCGCC GGATATGAAG GGATGCAGGA GGGCCTTGCC
GTCTTTTCGG AGTATCTTTC CGGGGGGATG ACGGGCGAGC GATTGCGTCT CCTTGCCGCC
CGTGTGACCG CCTGCGCCGC AATGCTTGAG GGCGCGTCCC TGCCGGAAGT CTATCATCGG
CTGGTAGATG ATCACGGCTT CCGAGAAGCA GACGCTTTCA ACGTTGTCGT GCGCATCTAT
CGCGGCGGAG GTCTCGCCAA GGATGCGATC TACCTGCGAG GGCTTCTGCA ACTGCTCGAT
CACCTGGCAG GTGGAGGCGC ACTCGAGCCC TTCTGGATGG GCAAGATCGC CGCCTCTCAT
TTCGATGTCA TGCAGGAACT GCACGCCCGT GGGCTGCTCC GCGGCCCGTC GGTCCTCCCG
CTTTTTCTCG ATAGTCCAGA AGCGTCGTCG CGGCTTGCCA GAACGTGCGA GCGGATGGCT
CCACTCGATT TGCTGCAGCA ATAG
 
Protein sequence
MKLARASIQA EANHPDWLAE ALAAIHAGKA VRQEFAGGGR LHIDRALPFL CVHLAGDAAP 
VAREIVRANA SHLLAPDAAT AKPLVTAIGE LLERRFGAFM VLDIGELAKD ELLTDKSPFL
PPFKVDVMAD RRRNIQPAAK AFIDAVEATL AQFRTPRVER HEWQAYREEE ALDLPFPSMR
VRFAPIYRQP DSGQVFPELR ERLIATIFDA ALHAFAAFAK ATGSLEFPSH RALGRRAFID
AVERTDRSID EVASSFDFLL AVTPINAEAA WSEFAAGGFA RSPRFLYRPL VIEIENAKKM
LFSIPFDHLE DPVLYQLYRE KQQELDLQLS MLSARETKKF VEYSRALYGP VEPNLLRAAE
NILDQSRNLA GYPAAPAAEL RADSHVIEKR ARQMIDAYAQ RYAGFEALVE IRDDLPAGLL
VSGKRLFIAR STSMAPDRVE PILSHEIGVH LLTYFNGSAQ GLRLFRSGLA GYEGMQEGLA
VFSEYLSGGM TGERLRLLAA RVTACAAMLE GASLPEVYHR LVDDHGFREA DAFNVVVRIY
RGGGLAKDAI YLRGLLQLLD HLAGGGALEP FWMGKIAASH FDVMQELHAR GLLRGPSVLP
LFLDSPEASS RLARTCERMA PLDLLQQ