Gene Smed_2623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2623 
Symbol 
ID5323492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2721994 
End bp2723931 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content63% 
IMG OID640791567 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_001328288 
Protein GI150397821 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCTA ATTTTCGAAA TTTTGCCCTT TGGGCGATCA TAGCGCTTCT CCTGATAGCG 
CTGTTCAGCA TGTTCCAGCA GCCGACCGAA CGGGCAGGCT CGCGTGAAAT TCCGTTCTCG
CAATTCCTCA AGGACGTCGA TGCAAGCCGC GTAAAGGACG TGGTGATCAC CGGTTCGAAG
GTGATCGGCA GCTACACCGA AAGCGGAGCG ACCTTCCAGA CCTACGCCCC CGCGGTCGAC
ACGGCACTCA CGGAGCGGCT GGAAGCCAAG GACGTCACGG TCACGGTTCG TCCGGAGACG
GACGGCTCGT CGGGTTTCCT CAGCTATATC GGAACGCTCC TGCCGATGCT CCTGATCCTC
GGCGTCTGGC TGTTCTTCAT GCGCCAGATG CAAGGCGGCT CGCGCGGCGC CATGGGCTTC
GGCAAGTCCA AGGCCAAGCT TCTGACAGAA GCGCATGGCC GCGTAACCTT CGACGACGTC
GCCGGCGTCG ACGAGGCCAA GCAGGACCTT GAGGAGATCG TCGAATTTCT GCGCGATCCG
CAGAAGTTTC AGCGCCTCGG CGGCCGCATC CCGCGCGGCG TTCTGCTGGT CGGCCCCCCC
GGCACGGGTA AGACGCTGCT TGCCCGCTCG GTCGCCGGCG AGGCGAACGT GCCCTTCTTC
ACGATCTCGG GTTCGGACTT CGTCGAAATG TTCGTCGGCG TCGGTGCCTC GCGGGTCCGG
GACATGTTCG AGCAGGCGAA GAAGAACGCG CCCTGCATCA TCTTCATCGA CGAAATCGAC
GCGGTCGGCC GCCATCGCGG CGCCGGGCTT GGCGGCGGAA ACGACGAGCG CGAGCAGACG
CTGAACCAGT TGCTCGTCGA AATGGACGGC TTCGAGGCAA ATGAGGGCAT CATCCTCATT
GCCGCGACCA ACCGGCCGGA CGTGCTCGAC CCCGCGCTGC TGCGTCCGGG CCGCTTCGAC
CGCCAGGTCG TGGTGCCGAA CCCTGACATC AACGGCCGCG AGCGGATCCT CAAGGTGCAC
GTCCGAAACG TGCCGCTGGC GCCGAATGTC GACCTCAAGG TGCTTGCGCG CGGCACCCCG
GGCTTCTCCG GTGCCGATCT CATGAACCTC GTTAACGAGT CGGCGCTCAT GGCAGCGCGG
CGCAACAAGC GCGTCGTTAC CATGCAGGAG TTCGAGGACG CCAAGGACAA GATCATGATG
GGCGCGGAGC GCCGCTCTTC CGCCATGACC GAAGCCGAGA AGAAGCTGAC TGCCTATCAC
GAGGCGGGCC ATGCGATCCT CGCGCTCAAC GTTCCTTCGG CCGATCCGCT GCACAAGGCG
ACGATCATTC CCCGAGGCCG AGCGCTCGGC ATGGTGATGC AGCTGCCCGA AGGCGATCGC
TATTCGATGA GCTACAAGTG GATGATCTCG CGCCTCGCCA TCATGATGGG CGGACGTGTT
GCCGAGGAGC TGACCTTCGG CAAGGAGAAT ATCACTTCAG GCGCCTCCTC CGACATCGAG
CAGGCGACGA AGCTTGCCCG GGCGATGGTC ACGCAGTGGG GCTTCTCCGA CCAGCTCGGA
CAGGTCGCTT ACGGCGAGAA CCAGCAGGAG GTTTTCCTGG GCCATTCGGT TGCCCAGCAG
AAGAACGTTT CCGAGTCGAC TGCCCAGAAG ATCGACAACG AGATTCGGCG TCTGATAGAC
GAGGCCTATG AAACGGCGCG GCGCATCCTT ACCGAGCACC ATCATGAATT CGTGGCGCTC
GCGGAAGGCT TGCTCGAGTA CGAGACACTC ACGGGCGACG AGATCAAGGC TCTGATCCGC
GGCGAGAAGC CGGCGCGCGA TCTTGGCGAC GACACGCCGC CGCATCGCGG CTCGGCCGTT
CCCTCTGCCG GTACCAAGAA AGAAACAGGC AACAAGGGCG AGGAGCCCGA AGGCGGGTTC
GAACCGCAGC CGCAATAG
 
Protein sequence
MNPNFRNFAL WAIIALLLIA LFSMFQQPTE RAGSREIPFS QFLKDVDASR VKDVVITGSK 
VIGSYTESGA TFQTYAPAVD TALTERLEAK DVTVTVRPET DGSSGFLSYI GTLLPMLLIL
GVWLFFMRQM QGGSRGAMGF GKSKAKLLTE AHGRVTFDDV AGVDEAKQDL EEIVEFLRDP
QKFQRLGGRI PRGVLLVGPP GTGKTLLARS VAGEANVPFF TISGSDFVEM FVGVGASRVR
DMFEQAKKNA PCIIFIDEID AVGRHRGAGL GGGNDEREQT LNQLLVEMDG FEANEGIILI
AATNRPDVLD PALLRPGRFD RQVVVPNPDI NGRERILKVH VRNVPLAPNV DLKVLARGTP
GFSGADLMNL VNESALMAAR RNKRVVTMQE FEDAKDKIMM GAERRSSAMT EAEKKLTAYH
EAGHAILALN VPSADPLHKA TIIPRGRALG MVMQLPEGDR YSMSYKWMIS RLAIMMGGRV
AEELTFGKEN ITSGASSDIE QATKLARAMV TQWGFSDQLG QVAYGENQQE VFLGHSVAQQ
KNVSESTAQK IDNEIRRLID EAYETARRIL TEHHHEFVAL AEGLLEYETL TGDEIKALIR
GEKPARDLGD DTPPHRGSAV PSAGTKKETG NKGEEPEGGF EPQPQ