Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5691 |
Symbol | |
ID | 5319993 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | + |
Start bp | 660858 |
End bp | 664025 |
Gene Length | 3168 bp |
Protein Length | 1055 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640777417 |
Product | sterile alpha motif-containing protein |
Protein accession | YP_001314349 |
Protein GI | 150377754 |
COG category | [R] General function prediction only |
COG ID | [COG3899] Predicted ATPase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGACG TCACGACATG GCTGGCCGAG ATCGGATTGC AGCACTTGGC GGGCAAATTC GCAGACGCGC AGATCGATTT CGACACGCTT GCGCTGCTAT CAGAGCAGGA TCTGCGCGAG CTGGGCATCC CGTTGGGCCC ACGCCGCAAG CTGCTCGCCG CCATCGCTGC GCTCGGCCGG TCGGTGCGGC CCCAAGGGAG TGATCCGACT CCAGTCGAAC GCCGCCAGCT CACGATCCTT ATTGGCGACA TGGTTGGTTC GACCGAGTAT GCCTCCAGGC TCGACCCCGA AGACGTCAGC CAGCTCACGC AGACATTCCT CTCGCGATGC AGCGCGCTTG CAAAGACCCA TGGGGGCTTC GTCGCGAACT ATGTCGGCGA CGCGCTCCAG GTCCTGTTCG GTTTTCCGGC AGCGGAGGAG AACGACGCGG AGCGCGCGGT ACATTTGGCG TTCGACATCG TGGCAGCGGT GCCACAGATA GAGACGCCGG ACAGTTCGCG CTTGGGGGTG AGGATTGGGG TTGCCAGCGG CCTAGTAGTC GTGGGCGACA TCGAGGGTGC GCCTGCCGGA ATTTCGACGG TGGCATTCGG CCATGTCCCG AATCTCGCCC AACGACTTCA GGCCATCGCC GAGCCAAACG GAATCTTCGT CGATGAGAAC ACGTTCCGGG CGACGGCAAG CGCCTTCGCT TACGCCGACA TTGGCGAAAA GACGCTGAAG GGCTTCAGCG ATCCCGTCCA CGTCCGGCGC GCCCTCAAGC CAATCGCAAG AGAATACCGC TTCGCGGGCG AACTCCGATC GACGCCGCTG GTGGGAAGGA GGAGCGAACT GCAGGCAGTC GAGGCCCTCT GGGACGCGGT GCGAGCCGAC CGCCATGGTC GCGTCGCGTT GATAAGCGGC GAGCCGGGGA TAGGGAAGTC GCGGCTGCTC TTCGAGGTTG GGAGGAGTTT TCCCGGAGTG AAGGCGCTTG TTGCGCAGTG CGCCCCCGCC TTCGCCAACA GCGCACTCTA TCCGTTTCTA AGGCTGTTCA AGCAGGAGGT TGGAATTACC GAGGACGCGC TGCTGTCGTC TGATAGGCTT CGGGCCGCTC TGTCGTCGAG CACCATCCCT TTGTCGGTCT CGTATCCGAT CTTCACACGC CTTCTGACAG TCGAACCGGA CTATGAACCG TCCAGGCTTG CCTCCTCTCA GCAGGAAGCC GTGATTAGCC AAGTGTTTTC GGGCTGGCTC CGGCAATTGT CGAGCGCCGG ACCCCTGATG CTCTTCGTGG AGGACGAGCA GTGGATCGAC CCGTCCTCCG GCAAGCTCCT CCAGACGCTT GCGCACGACG TCGCGCAATT TTCAGTGCTC CTCCTGGTGA CCTCGCGCGA GAAGCAGACG AAGGCTGGCT TCGACAACGC CGTCGCGGCG CATCTTGCAC TCGAGCGGTT CTCCCGCGAG GAGGCTGGCG AGTTCGTCCA GAACGTCGTC GAAGGGGGCC ATCTGTCTTC CAGCGTCGTC GCGACGCTTC TCAGCAAGGC CGAAGGTGTT CCCCTCTATC TTGAGGAACT CGCCCGATCC GCCCTCATGC TTGCTCCCTA TACGTCGGGA CGGTCAGGGG CGAAGGCCGA GTGCGACGTC GACGTCCCCG TCTCCCTCCA GTCCGCGCTC ATGTCCCGTC TCGACAAGCT CGGCGCCGTG AAGACGGTCG CGCAGACTGC CGCTGTAATC GGCCGAGAAT TCGACCTGAA GACGCTCGCC CATGTTGTCG GTCTCTCGAG CGAGATGCTG CGGCCGCAGA TCGAACGACT TGTCGCCGTC GGCCTCGTCG CGCCGCAGCC CTTCAGCAAC TGGCCCCGGT ACTCCTTCTC TCACAGTCTG CTGCAGGAGG CCGCCCGCGG GGCACTGCTT CGGGACAGGC GGCGAGAGCT TCATGCCCTC GTCGCCAAGG CAATCGAGGT CGTCCAGCCG ACAATGGTAG TGGAGCATCC CGAGATCCTC GCGCAGCATT ATGACGAGGC GACGCAGTTC GAACTCGCCG CCGACTACTG GCTCAGGGCG GGCCGGAAGC TTGGGGCGAC GTGGGCAAAG GTCGAGGCTG CGAACATGTT CGCGAAAGGT ATCGAATGTG TCCGCAGGAT GCCGCCCTCG AAGATGCGCG ACAGCAGAGA GCTGACCCTC GAACTCGAAA GGGGCGACGT TCTCTACGCG GCATATGGAT ATATCACCGA GGACGGCAGC GACGCATACC GGAATGTCAT GCGGCTCAGC GAGGCGACCG GAGACTCAGA GGCGGCGATT CGGGCCCTGG ACGGCTTGTT CGGAACTTCG TTCAACTCAG CCCGGTTCTC GGACGCGGAA TGGGCGAGCA ACCGGCTTAA GGAGATTGGC CGCAGGGATG GCAACATCAA GGCCCTCGTG CTCGGACTGC AGTTCGGAGG CATGTGCGCC TTCGCGCGCG GCAGATTTTC GCAGGCGCGA GAGGTGTTCC TGGAGGCTCT CGAAAACCGC GAGCATGCCG ACGAGGTCGG AAGCGACTTT CCAAGTATGT CGATGCTTTA TCTCTCGTGG ACGCTGCAAA TCTTGGGGGA TGCCGACGCG GCCATGGAGT TGTTTCACGC GGCCGAAACG GAGACCCGCC GACAAACGGA CTACCGCCTC GCTGCCTGTC TGGGTAACGG CTGCATACTG ATGTCTCTTC GTCAGGATGT AGCAACGCTC CAGAGGCTGG TGGACGACCT CGTGCCGCTC GCGAAGCGCA ACGGATTCCA GCTTTGGCTG AACTTTGCTT CGTTCTTTTC CGGGTGGGCC AAGGTGGCAA GTCGCCAAGA CGCCTCCGGC CTCGCGCAGA TGAAGCACAT ATGCGACAAT ATGGGACAGC AGGAGGTCGA TAAGACATGC TATCTAGGCG TGCTCGCCGA TAGCTACCTT CGAATGGGAC GGGCTGACGA GGCAACGGCG GTCCTCGATC AGGCACTCAA GCTCGCGGCG CGTACTGGTG AGCACTACTA CACGTCCGAG TTGCTTCGTC TGCGGGGAGA ACTTCTAAGG CAGACGCAGC GGCTCTCTGA TGCGAAAGAA TCGTTTCAGG AGGCCATCTC TTTTGCGCGC GAGCAAGGGG CCAAGGCATG GGAGGCGAGG GCCAAGCAGT CGCTTGATTT GCTTTCGTCG GAGCTGCCGC GTGGTTAA
|
Protein sequence | MSDVTTWLAE IGLQHLAGKF ADAQIDFDTL ALLSEQDLRE LGIPLGPRRK LLAAIAALGR SVRPQGSDPT PVERRQLTIL IGDMVGSTEY ASRLDPEDVS QLTQTFLSRC SALAKTHGGF VANYVGDALQ VLFGFPAAEE NDAERAVHLA FDIVAAVPQI ETPDSSRLGV RIGVASGLVV VGDIEGAPAG ISTVAFGHVP NLAQRLQAIA EPNGIFVDEN TFRATASAFA YADIGEKTLK GFSDPVHVRR ALKPIAREYR FAGELRSTPL VGRRSELQAV EALWDAVRAD RHGRVALISG EPGIGKSRLL FEVGRSFPGV KALVAQCAPA FANSALYPFL RLFKQEVGIT EDALLSSDRL RAALSSSTIP LSVSYPIFTR LLTVEPDYEP SRLASSQQEA VISQVFSGWL RQLSSAGPLM LFVEDEQWID PSSGKLLQTL AHDVAQFSVL LLVTSREKQT KAGFDNAVAA HLALERFSRE EAGEFVQNVV EGGHLSSSVV ATLLSKAEGV PLYLEELARS ALMLAPYTSG RSGAKAECDV DVPVSLQSAL MSRLDKLGAV KTVAQTAAVI GREFDLKTLA HVVGLSSEML RPQIERLVAV GLVAPQPFSN WPRYSFSHSL LQEAARGALL RDRRRELHAL VAKAIEVVQP TMVVEHPEIL AQHYDEATQF ELAADYWLRA GRKLGATWAK VEAANMFAKG IECVRRMPPS KMRDSRELTL ELERGDVLYA AYGYITEDGS DAYRNVMRLS EATGDSEAAI RALDGLFGTS FNSARFSDAE WASNRLKEIG RRDGNIKALV LGLQFGGMCA FARGRFSQAR EVFLEALENR EHADEVGSDF PSMSMLYLSW TLQILGDADA AMELFHAAET ETRRQTDYRL AACLGNGCIL MSLRQDVATL QRLVDDLVPL AKRNGFQLWL NFASFFSGWA KVASRQDASG LAQMKHICDN MGQQEVDKTC YLGVLADSYL RMGRADEATA VLDQALKLAA RTGEHYYTSE LLRLRGELLR QTQRLSDAKE SFQEAISFAR EQGAKAWEAR AKQSLDLLSS ELPRG
|
| |