Gene Smed_4452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4452 
Symbol 
ID5318604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp936403 
End bp937482 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content62% 
IMG OID640776254 
Producthypothetical protein 
Protein accessionYP_001313187 
Protein GI150376591 
COG category[R] General function prediction only 
COG ID[COG1073] Hydrolases of the alpha/beta superfamily 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0556925 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGC ACGTTGACAC GGACGACGTT TGCAACACCA GCAGGCGCAA CCTGATGAAG 
GGAACAGGCC TCGCCGTCGC GGCGATAAGC ATGATCCCGG CTGTGACGGC AACAGGGGCG
TTCGCGCAGA GCCCTGCCTG GGACAAGGTG TTTCCGAAAA GCGAGAACGT CGATCACCAG
AAGGTTTCGT TCAAGAACCG CTACGGCATC ACGATCGCCG GCGACCTCTA CCTGCCGAAG
AACCGCGGCA GTCAACCCTT AGCAGCTCTT GCGGTCGCCG GCCCCTTTGG TGCCGTGAAG
GAACAATCTT CGGGATTGTA CGCTCAGACC ATGGCCGAAC GCGGCTTCGC GGCGCTGGCC
TTCGACCCTT CCTTTACCGG TGAAAGTGGT GGCGAGCCAC GCAACGTCGC TTCGCCGGAC
ATCAACACGG AAGACTTCAG TGCCGCGGTC GATTACCTGG GACTGCAGCC CACCATCGAC
CGCGAGCGGA TCGGCGTGAT CGGCATTTGC GGCTGGGGTG GCATGGCCCT GAACGCCGTC
GCCGCTGACA AGCGCGTCAA GGCGGTCGTG GCCAGCACCA TGTACGACAT GACCCGTCTG
ATGTCCAAAG GCTACAACGA CAGTGTCACG CAGGAGCAGC GGACGCAGAC GCTGGAGCAG
TTGAGCCGCC AGCGCTGGGC GGACGCGGAG AAGAACGGGC CAGCCTATCA GCCGCCCTAC
AATGTACTCA AGGGAGGCGA GGCTCAGTTC CTGGTCGACT ATCACGATTA CTACATGACG
CCCCGCGGCT ACCATCCGCG CGCTGTCAAC TCCGGCAATG CCTGGACGCA GACCACGCCC
CTGTCGTTCA TGAACATGCC GATCCTGACC TACATCGCCG AGATTTCCCC GCGCCCGCTT
CTGCTCATCC ACGGCGAGAA CGCCCATTCG CGATACTTCA GCGAAACAGC CTTTGCCGCC
GCAGCGGAGC CAAAGGAGCT GATGATCATC CCGAACGCCA ACCATACCGA TCTCTACGAC
CGCATGGACA AGATCCCGTT CGACCGGATC GCCAAGTTCT TCGGGCAGCA TCTGGCCTAG
 
Protein sequence
MSEHVDTDDV CNTSRRNLMK GTGLAVAAIS MIPAVTATGA FAQSPAWDKV FPKSENVDHQ 
KVSFKNRYGI TIAGDLYLPK NRGSQPLAAL AVAGPFGAVK EQSSGLYAQT MAERGFAALA
FDPSFTGESG GEPRNVASPD INTEDFSAAV DYLGLQPTID RERIGVIGIC GWGGMALNAV
AADKRVKAVV ASTMYDMTRL MSKGYNDSVT QEQRTQTLEQ LSRQRWADAE KNGPAYQPPY
NVLKGGEAQF LVDYHDYYMT PRGYHPRAVN SGNAWTQTTP LSFMNMPILT YIAEISPRPL
LLIHGENAHS RYFSETAFAA AAEPKELMII PNANHTDLYD RMDKIPFDRI AKFFGQHLA