Gene Smed_3157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3157 
Symbol 
ID5324036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3315466 
End bp3316884 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content63% 
IMG OID640792105 
Producthypothetical protein 
Protein accessionYP_001328816 
Protein GI150398349 
COG category[S] Function unknown 
COG ID[COG5383] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAGAGA ATAGTTTCGT ATCCGCCGAC GATATCCGCT CGGCCTTTTC GGCTGCGATG 
TCGCGCATGT ATCGCGAGGA AGTGCCGGCT TACGGCACGC TGATGGAGCT CGTCGCGAGG
GTCAACGGCG AAACGCTGTC GGCCGACTCC GCATGGAAGG AGCGCCTGGA AGCGACCGAT
TCGCTCGACC GTATTTCCGA GGAGCGTCAC GGCGCGATCC GCCTCGGCAC GCCGGCGGAG
CTTTCGATGA TGCGCCGTGT CTTCGCGGTG ATGGGGATGT ATCCGGTCGG CTATTACGAT
CTGTCGACCG CGGGCGTGCC GGTGCATTCC ACCGCCTTCC GCCCGGTCGG AGACGCGGCG
CTGAAACGCA ATCCGTTTCG CGTCTTCACA TCTCTTCTGA GGCTCGACCT TATTGCCGAC
GAAGCCCTGC GCGCCGAGGC CGAAGCAATT CTTAAGGCGC GCCGGATCTT CACTCCGGGC
GCAATGGAGC TGACCGGACA GGCCGAGCGC GAGGGTGGCC TCGACAAGGC CGGGGCCGAG
CGTTTCGTCG CCGAAGTCAT CGAGACATTC CGCTGGCACG ACAGGGCCAA TGTCAGCGCC
GACATGTACC AGCGTCTCCA CGATGCGCAC CGCCTGATCG CCGACGTGGT CTCCTTCAAG
GGTCCGCACA TCAACCATTT GACGCCGCGC ACGCTCGACA TCGATCAGGT CCAGGCGCTG
ATGCCCGAAT ACGGCATTGC GCCGAAAGCC GTCGTCGAAG GACCGCCGAC GCGTAAATGC
CCGATCCTGC TTCGCCAGAC CTCCTTCAAG GCGCTCGAAG AGCCGGTTTC TTTCCGCGGC
GCCGATGGTG GCTGGAAGGC CGGTTCCCAT ACCGCGCGCT TCGGCGAAAT CGAACAGCGC
GGCATCGCAT TGACGCCGAA GGGCCGCGGT CTCTACGACC GGCTGCTCGA CGAATCGCGC
AAGATCGTGC GCCCGGCCGC CGATGGTTCG AACGCCGGAG AATACGGTGC TGCCCTGGCC
CAGGTATTCG AAGCCTTTCC GGACCATTGG GCGGAGATCC GTGCTGCCGG CCTCGGCTAT
TTCAGCTACT CCCTGACGGA AAAGGGCAGG CAGGCGAAGA TGTCGGTTTG CCTGGCGGAA
AGGGGCCGTC GCGACCTGGA CTCGCTGATC GCTGATGGCC TCGTTCAATT CGATCCTATC
GTCTATGAGG ACTTCCTTCC GGTCAGCGCC GCCGGTATCT TCCAGTCGAA TCTCGGCGAC
GGCGCGCAGC AGGACTTCGT TGCGAGCCCG AACCAGAAGC GCTTCGAGAC GGATCTCGGA
GTCGCGGTCC TCAACGAATT CGATCACTAT GCCGGCATCG AACAGGCATC GATCGAAGAC
TGCCTCCAGG CGCTCACCGC CGCCGTGGCA GCGGAGTGA
 
Protein sequence
MKENSFVSAD DIRSAFSAAM SRMYREEVPA YGTLMELVAR VNGETLSADS AWKERLEATD 
SLDRISEERH GAIRLGTPAE LSMMRRVFAV MGMYPVGYYD LSTAGVPVHS TAFRPVGDAA
LKRNPFRVFT SLLRLDLIAD EALRAEAEAI LKARRIFTPG AMELTGQAER EGGLDKAGAE
RFVAEVIETF RWHDRANVSA DMYQRLHDAH RLIADVVSFK GPHINHLTPR TLDIDQVQAL
MPEYGIAPKA VVEGPPTRKC PILLRQTSFK ALEEPVSFRG ADGGWKAGSH TARFGEIEQR
GIALTPKGRG LYDRLLDESR KIVRPAADGS NAGEYGAALA QVFEAFPDHW AEIRAAGLGY
FSYSLTEKGR QAKMSVCLAE RGRRDLDSLI ADGLVQFDPI VYEDFLPVSA AGIFQSNLGD
GAQQDFVASP NQKRFETDLG VAVLNEFDHY AGIEQASIED CLQALTAAVA AE