Gene Smed_5214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5214 
Symbol 
ID5319516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp176235 
End bp177614 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content58% 
IMG OID640776992 
Producthypothetical protein 
Protein accessionYP_001313924 
Protein GI150377329 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAACC CGGTCGCATT GAACCGCATC GCCGAAAACA AGGTCTTTCG TAAGGGTGAT 
GTTTTCGTCC TCTTCGGCGA ACTATTCGGC CGCGGATATG CCACTGGCTT GCTCGATGAA
GCCCGGCGGG TTGGAATGGA GATCGTGGGC ATCACAGTCG GGCGGCGCGA CGAAAACAAC
GCGCTTCGGC CTCTGGACGC CGAGGAACTC TCTGCCGCCG AGGCCCGATT GGGCGGCAGA
ATCATCAACA TACCTCTCAT GGCTGGCTTC GATCTCGACG CCCCGGCAGG AGGCCCGACT
CCGACGGATC TCCTCGCATC CATGACGCTG GAGAGCTGGG AACATGACAC GCTCGATTGG
GACTACATCA AACAGTGCCG GGACATCGCT ACCGCACGGT TCACGAATTC GCTTTCGCAG
GTCATGGCCG TTCTCGACGG AATGATCGCC GACGGCCGCA ATGTCTTCTT CGCCCATACA
ATGGCCGGAG GCATTCCGAA AGCCAAGGTG TTCCTGGTTG TCGCCAATCG TATCTACAAG
GGACGTGGTC CCCGCCACAT GTCTTCGCAA GCGCTGCTCG ACAGCGACAT GGGCAAGCTT
ATCCTGCAGA ATTTCGACGA AGTCTCCGCA ATCACCTTTC GACATCTTAT CGACTTCAGC
GCGGCGATCC GTGAGCGCGT AGAGGCTTCG GGTGCTCAGG TCCGGTACAC GGCCTACGGT
TATCACGGAA CTGCAGTCCT GATTGACGGG CGCTATCGTT GGCAGACCTA CACCAACTAC
ACCCAGGGTT ACGCCAAGAT GCTGCTAGAA TCCATCGCGC AGGAAGCCTG GACAGCGGGC
GTCAAGGCAA CCGTCTATAA CTGCCCCGAG ATCCGGACCA ATTCGTCCGA CGTATTTACG
GGTATCGAGC TTCCCCTGAT ACCGTTGCTG CTTGCCCTGA GGAAAGAGAA CGGTGGGCAA
TGGGCAGAGG ACCAGTGGCA GGCATGCCAA CAGCTTCTGG CAGACGGCTT CACGATGAAG
GACGTTTTCC AGAAGATTAC CGAGATGCAG GCCAACGAGG TCATGCGTCC GTTTTACGAC
TTCTCGGCAT GGCCCATGGC AAACAGCCAG GCACAGTCCG ATCTGACCAT CGGCACGTCC
AACGAGATCA CGCAGATGCA TCGGGACAGC AAGGCCATGA TCAGCGACCT CTTGAGTGCT
CTCGTGGTGG AAGCGACCGG GCAGCTAATT TTTGGCGCGT CCTCAGACCC CTCCGGCCAC
ATCCAATGGC TCAACCACGA TATCGTGGCG CGGCGACTTA ACGCTTCCCA CCTGCAGTGG
AAGTCTGGCA CGCCAATGGT CGAACAAGGG GCGAAAGACC CTCATCTCGA GATCGCCTGA
 
Protein sequence
MENPVALNRI AENKVFRKGD VFVLFGELFG RGYATGLLDE ARRVGMEIVG ITVGRRDENN 
ALRPLDAEEL SAAEARLGGR IINIPLMAGF DLDAPAGGPT PTDLLASMTL ESWEHDTLDW
DYIKQCRDIA TARFTNSLSQ VMAVLDGMIA DGRNVFFAHT MAGGIPKAKV FLVVANRIYK
GRGPRHMSSQ ALLDSDMGKL ILQNFDEVSA ITFRHLIDFS AAIRERVEAS GAQVRYTAYG
YHGTAVLIDG RYRWQTYTNY TQGYAKMLLE SIAQEAWTAG VKATVYNCPE IRTNSSDVFT
GIELPLIPLL LALRKENGGQ WAEDQWQACQ QLLADGFTMK DVFQKITEMQ ANEVMRPFYD
FSAWPMANSQ AQSDLTIGTS NEITQMHRDS KAMISDLLSA LVVEATGQLI FGASSDPSGH
IQWLNHDIVA RRLNASHLQW KSGTPMVEQG AKDPHLEIA