Gene Smed_4997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4997 
Symbol 
ID5318718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1511623 
End bp1512624 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content64% 
IMG OID640776779 
ProductKpsF/GutQ family protein 
Protein accessionYP_001313711 
Protein GI150377115 
COG category[M] Cell wall/membrane/envelope biogenesis
[T] Signal transduction mechanisms 
COG ID[COG0794] Predicted sugar phosphate isomerase involved in capsule formation
[COG2905] Predicted signal-transduction protein containing cAMP-binding and CBS domains 
TIGRFAM ID[TIGR00393] KpsF/GutQ family protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0692534 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGCGA AAGCGGATGG AAATGCCGGT GTCACCGTGC TGGAGTCGAT CGGCAGGACG 
TTGGCGACGG CGACGAACGG CATCAGGGCG CTTGCCGACC ATCTGTCGAG CGACGAAACC
TTCGCGGACG CCCTTGTCAA TGCGGTCGAA CTGATGGGTG ATGGAGATGG CCGCGTCGTC
GTTTCGGGTG TCGGCAAGAG CGGTCACATC GGCCGCAAGA TCGCAGCCAC GCTCGCATCC
ACCGGCACCT CGGCCTATTT CGTCCATCCG ACCGAGGCGA GCCATGGCGA TCTCGGCATG
ATCACCGCGC AGGATGCATT GGTCCTGCTT TCCTGGTCGG GCGAGACGGC GGAACTCGCC
AACATGCTGA CCTATGCCAA GCGTTTCAAG GTGCCGATCA TTTCGATCTG TTCCAACCGC
GAGAGCACGC TTGCGCGCAA CTCCGAAGTC GCGCTCGTGC TGCCGAAGGT GCCGGAGGCT
TGTCCGCACG GTCTGGCGCC GACGACCTCG GCAATGCTTC AGCTCGCCAT CGGCGATGCA
CTGGCAATCG CGCTGCTGGA GCGGCGCGGC TTCTCTGCCG AGGACTTCAA GACCTTCCAT
CCGGGCGGCA AGCTGGGCGC GCAGCTGCGC CTCGTCCATG AGCTGGCGCA TGGCGCCGGG
CAGATGCCGT TGCTCCCTGT CGGTCGCCCG ATGAGCGAGG CGGTCATCGA GATGTCGGCC
AAGGGCTTCG GCGTCGTCGG CATCGTCGAT GAAAGCGGAA AGCTGGTCGG CGTCATCACC
GACGGCGATA TGCGCCGCCA CATGACGGCG GACCTCCTGG CGCAACCGGT CGAGGCCATA
ATGTCGCACA ACCCGCGTGT CCTCAGCCGC GACGTGCTGG CCAGTGCGGC CATGGAGTTT
ATGGAAGAAC ACAAGATCAC CGTGCTCTTC CTCGTCGGCG ATGCGGGCGC ACCGGTCGGC
ATCCTGCATA TTCACGATCT GCTGCGCGCC GGAGTCGCCT GA
 
Protein sequence
MQAKADGNAG VTVLESIGRT LATATNGIRA LADHLSSDET FADALVNAVE LMGDGDGRVV 
VSGVGKSGHI GRKIAATLAS TGTSAYFVHP TEASHGDLGM ITAQDALVLL SWSGETAELA
NMLTYAKRFK VPIISICSNR ESTLARNSEV ALVLPKVPEA CPHGLAPTTS AMLQLAIGDA
LAIALLERRG FSAEDFKTFH PGGKLGAQLR LVHELAHGAG QMPLLPVGRP MSEAVIEMSA
KGFGVVGIVD ESGKLVGVIT DGDMRRHMTA DLLAQPVEAI MSHNPRVLSR DVLASAAMEF
MEEHKITVLF LVGDAGAPVG ILHIHDLLRA GVA