Gene Smed_5015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5015 
Symbol 
ID5318754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1537297 
End bp1538286 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content59% 
IMG OID640776797 
ProductKpsF/GutQ family protein 
Protein accessionYP_001313729 
Protein GI150377133 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0794] Predicted sugar phosphate isomerase involved in capsule formation 
TIGRFAM ID[TIGR00393] KpsF/GutQ family protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.336764 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTGG TTGTGAGTGA CCCAATACTG GCGTCGATCA GTCGTACGAT CGCCACTGCC 
GCCGACGGAA TCCATGCGCT GGCGGCCTGT CTGGAGGAGA ATGCGGCCTT GCGCCGTAGT
TTTGTCGATG CCATCGAACT CGTCGCCTCC AAGCGTGGCC GGGTCGTTGT GGCGGGTGTT
GGCAAGAGCG GGCACATCGG ACGCAAGATC GCCGCGACGC TAGCTTCTAC CGGTACGTCC
GCCTATTTCG TGCATCCGAC GGAAGCAAGT CACGGCGATC TCGGCATGAT CACCGCCGAG
GATTTGCTCA TCCTCCTTTC ATGGTCCGGT GAGACGGTCG AACTCGGGAA CGTGCTCACT
TACGCCAAGC GCTTCAATGT CCCGGTCATT TCCGTCACGT CGAATGCCGA CAGCACAATC
GCTCGCAACT CCACGATCCC AGTGATTCTG CCGAAAGTGC CGGAGGCATG CCCACATGGT
CTAGCTCCGA CCACGTCTGC AATACTGCAG TTGGCGGTGG GGGATGCTTT TGCAATAGCC
TTGCTTGAGC GGAGGGGGTT TTCGGCCGAG GACTTCAAGA CGTTTCATCC GGGCGGCAAG
CTCGGTTCGC AGTTGCTGCT CGCCCATGAA CTGGCCCATT CGGGTGAGGC TGTGCCACTT
TTGCCGATCG GCAGTCCGAT GAGCGAAGCA GTCATTCAGA TGTCTTGCAA GGGTTTCGGG
GTCGTCGGCG TCGTTGGTGG CGACGGTGAG CTTGTCGGTG TCATTACCGA CGGCGATTTG
CGGCGCCACA TGTCACAAAA TCTTTTGCTC CTCACCGTCG AGACCGTGAT GTCGCACATG
CCTCGCGTGA TTACGCCTGG AATGCTGGCC AGTGCAGCCA TGGAAATGAT GCAATCGCAA
AAGATCACGG TGCTGTTTCT GGTCGATGAC GTTGGCCGGC CGTCGGGCAT CTTGCACGTC
CACGATCTTC TGCGTGCCGG CGTGGCCTAA
 
Protein sequence
MNVVVSDPIL ASISRTIATA ADGIHALAAC LEENAALRRS FVDAIELVAS KRGRVVVAGV 
GKSGHIGRKI AATLASTGTS AYFVHPTEAS HGDLGMITAE DLLILLSWSG ETVELGNVLT
YAKRFNVPVI SVTSNADSTI ARNSTIPVIL PKVPEACPHG LAPTTSAILQ LAVGDAFAIA
LLERRGFSAE DFKTFHPGGK LGSQLLLAHE LAHSGEAVPL LPIGSPMSEA VIQMSCKGFG
VVGVVGGDGE LVGVITDGDL RRHMSQNLLL LTVETVMSHM PRVITPGMLA SAAMEMMQSQ
KITVLFLVDD VGRPSGILHV HDLLRAGVA