Gene Smed_1202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1202 
Symbol 
ID5322049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1281895 
End bp1283778 
Gene Length1884 bp 
Protein Length627 aa 
Translation table11 
GC content60% 
IMG OID640790143 
ProductATPase central domain-containing protein 
Protein accessionYP_001326887 
Protein GI150396420 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0870142 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.869838 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCGG CAAAACCTGA CGAGCGGAAG ATCTCGCTCG AACTGACCAC CTATGCCAAT 
TCTCTGCGCA TCGCGTTGAG GCGGTGCGGG ATTTTTCTGA AGGGCGACAA GGCTGTAGGC
TTGCTACTGC CGCCTCAAGC GGACCTCGAG ACTTACCAGC AGGCAGTCAA ATCGGTCCTG
GTTGGCGGCG GAATCCTCCA CTACGCTGTC CCACTGGTGC ACGTCAGCGC TTACAAGGGG
GAGACGGACG CGGACGACGC AGTCAAGACC GTCTCGCGGA GCAACGCAGT GATCGTCGTG
ATCGCGCACG GTGCCGATCT GCCCCCAGAA CTCTCGGTCT CACTGGATCG TCTCGAGCAG
GTGACCGCGG TCAAGCCGTA TCATCTGATC GCTGCGGCAA GGACGGTACT CGATGCGGTA
ATCACGCGGG ATCAGGCCAC TGCGCTGTTC TCGTATCCTC TGCCGTTGAT GTTCGCCGCT
CTCAGGCGGT CACGCCCAAT CGAAGTCACC CTGGAGAAGC TGGCAGCTTC GATGGATTTG
CAGGCGGTGC CGGAGGTCAA GCGCGCGGTC GCTTGGGAGC CGAGGATCGA ACAGCTTGCA
GGCTATGGGA AGGCCACGGA TTGGGCGCTG GACCTCGTCG AGGATATCAC CGCGTGGAGA
CAGGGAGGCG TCGAGTGGTG CGACGTGGAC GCGGGATTGC TGCTGAGCGG TCCGCCAGGC
TGTGGCAAGA CCCTGTTCGC GAGCGCGGTA GCCAGATCGT GTGAAGCTTC TTTCTTCGCT
GTATCGAGTG CGGTTTGGCA ATCCCACGGC CACCTCGGTG ACATGTTGCG CGCAATGCGG
AAATCATTCG AGCAAGCGAT CGCTGCCGCA CCGTCGATTT TATTGATTGA CGAATTCGAT
TCCTTTGGCT CCCGAAAGAA CCTTCGGGGA GATGGCGCAT CTTATGGTCT ACAAGTGATC
AATGCCCTTC TCGAACACCT TGATGGCGCT GTGGGGCGCG AAGGTGTCGT CGTGATCGCG
GCTACCAACA GGCCGGACGA TATCGATGAG GCGCTCCGTA GGCCGGGTCG TCTGGATCGC
CACATTGCGG TGGAGATGCC CGATCAGGAA GCACGTGAGC AGATCCTGTC GGCGCATGCT
GGCGTGGCGT TGCCGCGGGA CGAACTGAAA ACGATCGCGG TGGCGACCAG TGGCTATTCC
GGGGCAGCTT TACGGCAGCT TACCCGTGAT GCGCGCCGGA TCGCTAGAAA AGCACGGAGG
TCGGTCTGCG GTTCCGACTT CATGTCGATC GTGCCGCCCG TCGCTGTGTT AACTCATAAG
GAGCGATGGC AAGTGTGTGT TCATGAGGCT GGACACGCGA TCGTGGGGCT CGCGCTCGGT
ACCGGTGATA TCGAGGCTAT CGTCGTCGCC CGTCAGGCAG CGCACCGGGA TGATAGCGTG
GGGCACGTCG AATGGCGCCG ACCAGTTGTG CTCAATCGGA CGCTTTGGGC TTACAGGAAC
GAGATTGCGA TGCTGCTCGG TGGCAGGGCT GCCGAAAAGG AAGTCCTGGC CGAAATGTAC
GTCGGCTCGG GTGGCGTTGA GGGATCCGAT CTTAATCGGG CAGCGGACAT TGCGACAATT
CTAATTGCCG GCCACGGTGT TCAAGGGCTC GGGTATACCG ACGTCTCCCG ATCGCGAGAT
CTCGATCAGC TTCGTCGGAC TGATGTCGTC TTGCGGAGGA GGGTGGAGCG GCTGCTGGCG
GAGGAACTGG CGCGAGCGGA GGATATCGTC AGGGAGCGGC GGGGCGATGT GATGCGCGTT
GCGGAAGCGC TCTTGGAACA CGAGGTGCTA TCGGGCGAAG GTGTCGCGAA GTTAATCCTA
GGACGACGAA ATTCTCAGTT CTAG
 
Protein sequence
MKSAKPDERK ISLELTTYAN SLRIALRRCG IFLKGDKAVG LLLPPQADLE TYQQAVKSVL 
VGGGILHYAV PLVHVSAYKG ETDADDAVKT VSRSNAVIVV IAHGADLPPE LSVSLDRLEQ
VTAVKPYHLI AAARTVLDAV ITRDQATALF SYPLPLMFAA LRRSRPIEVT LEKLAASMDL
QAVPEVKRAV AWEPRIEQLA GYGKATDWAL DLVEDITAWR QGGVEWCDVD AGLLLSGPPG
CGKTLFASAV ARSCEASFFA VSSAVWQSHG HLGDMLRAMR KSFEQAIAAA PSILLIDEFD
SFGSRKNLRG DGASYGLQVI NALLEHLDGA VGREGVVVIA ATNRPDDIDE ALRRPGRLDR
HIAVEMPDQE AREQILSAHA GVALPRDELK TIAVATSGYS GAALRQLTRD ARRIARKARR
SVCGSDFMSI VPPVAVLTHK ERWQVCVHEA GHAIVGLALG TGDIEAIVVA RQAAHRDDSV
GHVEWRRPVV LNRTLWAYRN EIAMLLGGRA AEKEVLAEMY VGSGGVEGSD LNRAADIATI
LIAGHGVQGL GYTDVSRSRD LDQLRRTDVV LRRRVERLLA EELARAEDIV RERRGDVMRV
AEALLEHEVL SGEGVAKLIL GRRNSQF