Gene Smed_5642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5642 
Symbol 
ID5319944 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp607955 
End bp609262 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content59% 
IMG OID640777378 
Producthypothetical protein 
Protein accessionYP_001314310 
Protein GI150377715 
COG category[S] Function unknown 
COG ID[COG5361] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACCCG ACCAGAGCAG GGAGGAGATC GCATACGCGG TAGGTCTGCA GGCCTATCTC 
TGGGGCTTCC CGCTGTACTA CTACAGCAGA AGCACTCCGA AGAGCGTCGA GGTCGGCGGA
ACCTTCATCA ATGATTTCCG CAAGTATTCG GAGTTGAAGA CTGCCAAAGA CAGGTTCGTC
GTCACGCCCA ACAACGTGAC CATCGATGCC TACGCAACGC TCGACCTCAC GGTCGAGCCG
GTCGTCATCT TCGTTCCCTC TCTTTCGCAA CCGCGCTGGT ATATCGTTCA GATCGGGGAT
TCCTTCGACG AGATCGTCAG AAACATCGGC GGAACCAAGG GTGCGGAACC CGGCGTCTAC
ATTGTCACCG GACCGGATTT CAGCGGGGAC GTTCCGGGGG ACATGATCCA GGTGAAGAGC
CGTACCAAGA TCGGCGTGGC CGCCGTCCGG ATTCTGGCGA ACGGGGAAGC AGACCTTCCA
AATGCTGTCG AGGCCCAGAA GGGTTTCCAC CTTATGCCCC TATCCGCCTA TCTGCGAGAC
GGGCTAGCAC ACAAGGCGGC CGATCCACGT CCGCAGATGA GGCTTTTCGA AAGCGATGCC
CCCGAGGGGA TCAGGTATTT CGACGAGCTC GGCGACGCGA TGACGAAACG TCTTCCCGCG
TCCGCCGACT CGCAGGATTT CCTCGTCTCA TCGTTCAAGC AGATCGGTTT GAGCGTCGGC
GGAGGCTTTC AGTGGAAATC GCTCGACGAG TCGACAAAGA AAGGTCTGGA ACGAGCGATC
AAGACGGGAG AGCAGATCGT CGACAGCAAA TGGGCGGCGA CCGGGGAAAT CACCAACGGC
TGGAAATACA CCTTCGCTGG CGGCAGGGCG GGATACGATC CCGGCCTTCG CGCGGCGCTC
GCCAAATACG AGGTCGGAGC CCAGCTTTCC GATCATGTCA TCTATCCCAA CACCAGCGTC
GACGACAAGG GCGAGCCCCT CAACGGCTCG AAGAGGTACG TCCTGCACTT TGATGCCGGA
AAACTTCCGC CTGTCTCCGT ATTCTGGAAC ATGGCGATGT ATGGTTCCGA CATGCTGTTC
GTCGAGAACG AGTTCAAGCG TTACAGCATT GGCAGCACGA CGGACGGGTT GAACAAGGAC
GCTGACGGCT CGCTGACGAT ACTCATTCAG AAGAACAAAC CAGCAGACAC TGCCAATTGG
CTGCCCGCTC CCGAGGGCGA CTTCAATTTG ACCATGCGCT TCTACGGTCC TGAGACGACG
GTTCTGGATG GCTCCTATCG GCTGCCGGCT GTCCGGAGCG TCGAATGA
 
Protein sequence
MSPDQSREEI AYAVGLQAYL WGFPLYYYSR STPKSVEVGG TFINDFRKYS ELKTAKDRFV 
VTPNNVTIDA YATLDLTVEP VVIFVPSLSQ PRWYIVQIGD SFDEIVRNIG GTKGAEPGVY
IVTGPDFSGD VPGDMIQVKS RTKIGVAAVR ILANGEADLP NAVEAQKGFH LMPLSAYLRD
GLAHKAADPR PQMRLFESDA PEGIRYFDEL GDAMTKRLPA SADSQDFLVS SFKQIGLSVG
GGFQWKSLDE STKKGLERAI KTGEQIVDSK WAATGEITNG WKYTFAGGRA GYDPGLRAAL
AKYEVGAQLS DHVIYPNTSV DDKGEPLNGS KRYVLHFDAG KLPPVSVFWN MAMYGSDMLF
VENEFKRYSI GSTTDGLNKD ADGSLTILIQ KNKPADTANW LPAPEGDFNL TMRFYGPETT
VLDGSYRLPA VRSVE