Gene Smed_3761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3761 
Symbol 
ID5319053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp204582 
End bp205883 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content64% 
IMG OID640775574 
Producthypothetical protein 
Protein accessionYP_001312507 
Protein GI150375911 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0660829 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCTCG GCGCATCCCA ACGTTTGAAT GACGGCGTTG CGGCCGTGGA AACCATGACG 
CGTTTCGCGC TTGCAGTATT GGCGCTTGCC TCGGGCGTCT ACACCTATCT CGGCGTCCGC
AGTATCCTTG ATGGTTCGCC GACGGCAGTG TTCTTCGCAG CGATCATCTA CTCGGCCTCG
GTCTCCGTCG GCATCTACGC CTTCTGGTCC TACATGGCCC GCTTCTATCC GCATGTGACC
ACCCATGCCG GCCGGGTCGC CATGCTGGGC GTGATGGCCC TCGGTGCTGC CATGATCATC
GCCATGTCCA GCTGGCTCAA CGCGGCGGCG CTGGCGGGGT CGGCCGCGCT CGAACAGCAC
CTCGCGGAGA CGGTGGAAGA CTATACTGCC GACCTCGACC AGGCGCATCA GAACGCACTT
GCAGCCCAGA GCCTGCTGCC CGATATCCAA CGCGCATCAG AACGCTTTGC TCAGCTTGCC
GCCTCGGAGC GGCAATCGGG CGCGCTCACC GGCACCACGG GTTCGGGAAG CGTGGTGCAG
CTTCTGTCGC AGATGTCGGC CCAGATGAAG GATCTGGAAA ACGGCATCAA TGCTTCTCGG
GAACAGGTCG CGATGCTGTT CAATCAGGGA CAGGAGCGGC TCGAGACCAT GCGGACGCTG
GTATCCGCGC CTGGTGCGGT CACGCCACGT GCCGATCAGT TCTCCTCCGA AGTGGTGGCG
CTTACGGGGG TGATAACATC GCTCGGACAG ACTTCGATCG CGCCTTCGAT CCGCCGCGCC
GCAGACGACC TGTCCCTCGG CTTCATCGCG CCGGTGGCCG ATGGCGGCGA TGCCGATCTC
GTCACCCGCC AGGACCAGGT GATGGAGACG GTGCGGGCTT CGGTGGCAGC GCAGTCCAAG
GTTCTCTCGG ATGCGGCAGA CGAAATACTG GGTCGGATGC CGGTGGCGGA GCGACGCTTC
GTTCCGCTTT CATCCGCCGA GGCGGTACTG CGCTACGCGG CCGATTTTAT TCCCGCCTGG
GCCGGTGCCA TTTCCATTGA CCTGCTGCCG GGCGTACTGG TCTTCATCCT CGCGACCGTG
CACGGGGCGA TCCGCAGGCA GGAGGAGAAA CTGCCCTTTG CCGAGCGCAT CACGGCCGCC
GAGCTTCTGC AGGCCCTGGA GGTCCAGCGC GCGGTGATGG CGAATGGGGG CCAGAACGGC
GAGGCAGGCG ATTCGGTGGA AATGGAGGGC GATGAGCCGA ACAACATCAC CAGCCTCGAC
CCGAGGGTGC GCGTAAAGGA CCGGTCGCAT GAGGATCGAT GA
 
Protein sequence
MALGASQRLN DGVAAVETMT RFALAVLALA SGVYTYLGVR SILDGSPTAV FFAAIIYSAS 
VSVGIYAFWS YMARFYPHVT THAGRVAMLG VMALGAAMII AMSSWLNAAA LAGSAALEQH
LAETVEDYTA DLDQAHQNAL AAQSLLPDIQ RASERFAQLA ASERQSGALT GTTGSGSVVQ
LLSQMSAQMK DLENGINASR EQVAMLFNQG QERLETMRTL VSAPGAVTPR ADQFSSEVVA
LTGVITSLGQ TSIAPSIRRA ADDLSLGFIA PVADGGDADL VTRQDQVMET VRASVAAQSK
VLSDAADEIL GRMPVAERRF VPLSSAEAVL RYAADFIPAW AGAISIDLLP GVLVFILATV
HGAIRRQEEK LPFAERITAA ELLQALEVQR AVMANGGQNG EAGDSVEMEG DEPNNITSLD
PRVRVKDRSH EDR