Gene Smed_4102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4102 
Symbol 
ID5318991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp564366 
End bp565871 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content63% 
IMG OID640775909 
Producthypothetical protein 
Protein accessionYP_001312842 
Protein GI150376246 
COG category[S] Function unknown 
COG ID[COG3333] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.574512 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACTCT TCAGCAATCT TGCCCTCGGC TTCGCGACCG CCGGCTCACC GGCGAACCTT 
TTCTTCTGCC TGATCGGCGT ACTCCTCGGG ACCCTGATCG GCGTGCTGCC CGGGATCGGA
GCCACGGCGA CCATCGCGAT GCTGCTGCCG ATCACCTTCC AGCTTGAACC GGTCTCATCG
TTGATCATGC TCGCCGGGAT TTATTACGGC GCCCAATATG GCGGCTCGAC GACCGCTATC
CTGATCAACA TGCCCGGGGA GTCGTCTTCC GCCGTGACCG CCATTGACGG CTACCAGATG
GCGCGCAAGG GCCGCGCCGG CGCGGCGCTG GCGATTGCGG CGCTCGGTTC GTTCTTCGCC
GGTACCGTCT CGACATTCCT CGTCGCGCTC TTCGCGCCGC CGCTGACCGA GATCGCGCTC
GAATTCGGCG CGGCTGAATA TTTCTCGCTG ATGATCGTCG GCCTCGTTTC CTCGATCGCG
CTTGCGCACG GATCGGTTGT CAAGGCGCTT GCCATGGTGG CCCTCGGTCT TCTTCTCGGC
CTTGTTGGAA CCGACATCTA CACCGGCACG CCGCGCTTCA CCCTGGGCAT CCGCGAATAT
TCCGACGGAC TCAATTTCGT GGCGCTGGCG GTCGGCGTCT TCGGTATCGC CGAAATCCTC
CGCAATCTGG AAAGCGAGAA AACGCGCGAA GTGCTTATGG CCAAGGTAAC CGACTTGATG
CCGACGCGGG AAGACTTCAG GCAGATGGTC GCACCGGTGC TGCGCGGCAC CGCGATCGGC
TCGGCGCTCG GCGTGCTCCC GGGGGGCGGC GCCATTCTCG CCGCCTTCGC GTCCTACACG
GTGGAAAAGC GCCTCTCCGA CCGTCCGGAG GAATTCGGCC GCGGCGCAGT CGCCGGCGTC
GCCGGACCGG AAAGCGCCAA TAATGCCGGT GCGCAGACCT CGTTCATTCC GCTGCTCACC
CTCGGCATTC CGGCAAACCC CGTCATGGCG CTGATGATCG GTGCAATGAT CATCCAGGGC
ATCGTTCCGG GACCGAACGT CGCGACGGAG CAACCCGCGC TCTTCTGGGG CATCATCGCC
TCGATGTGGA TCGGCAATCT GATGCTGGTG GTCCTCAACC TACCGCTGAT CGGCCTTTGG
GTGAAGCTCC TGACAATCCC TTATTTCGTG CTCTTCCCAA TCATCATGGC CTTCTGTTCG
ATCGGCGTCT ACAGCGTCAA TTCCAACGTT TACGACCTCT ACGCCGTCGC CTTCTTCGGC
CTTGTGGGCT ACCTGCTGCT CAAGCTGCGC TGCGAGCCGG CGCCGCTCCT CCTCGGCTTT
GTTCTCGGAC CGCTGCTCGA GGAAAACCTC AGGCGCGCCA TGATCCTTTC GCGCGGCGAC
GCGTCCACCT TCGTCACCCG GCCGATCAGC GCAACCCTAC TCCTCCTTGC CGCCGCCGTG
CTCGTCATTG TCTTCCTGCC GAGCGTCAAG AAGAAGCGCG AACAGGTCTT CGTCGAGGAG
GATTGA
 
Protein sequence
MELFSNLALG FATAGSPANL FFCLIGVLLG TLIGVLPGIG ATATIAMLLP ITFQLEPVSS 
LIMLAGIYYG AQYGGSTTAI LINMPGESSS AVTAIDGYQM ARKGRAGAAL AIAALGSFFA
GTVSTFLVAL FAPPLTEIAL EFGAAEYFSL MIVGLVSSIA LAHGSVVKAL AMVALGLLLG
LVGTDIYTGT PRFTLGIREY SDGLNFVALA VGVFGIAEIL RNLESEKTRE VLMAKVTDLM
PTREDFRQMV APVLRGTAIG SALGVLPGGG AILAAFASYT VEKRLSDRPE EFGRGAVAGV
AGPESANNAG AQTSFIPLLT LGIPANPVMA LMIGAMIIQG IVPGPNVATE QPALFWGIIA
SMWIGNLMLV VLNLPLIGLW VKLLTIPYFV LFPIIMAFCS IGVYSVNSNV YDLYAVAFFG
LVGYLLLKLR CEPAPLLLGF VLGPLLEENL RRAMILSRGD ASTFVTRPIS ATLLLLAAAV
LVIVFLPSVK KKREQVFVEE D