Gene Smed_5691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5691 
Symbol 
ID5319993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp660858 
End bp664025 
Gene Length3168 bp 
Protein Length1055 aa 
Translation table11 
GC content62% 
IMG OID640777417 
Productsterile alpha motif-containing protein 
Protein accessionYP_001314349 
Protein GI150377754 
COG category[R] General function prediction only 
COG ID[COG3899] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGACG TCACGACATG GCTGGCCGAG ATCGGATTGC AGCACTTGGC GGGCAAATTC 
GCAGACGCGC AGATCGATTT CGACACGCTT GCGCTGCTAT CAGAGCAGGA TCTGCGCGAG
CTGGGCATCC CGTTGGGCCC ACGCCGCAAG CTGCTCGCCG CCATCGCTGC GCTCGGCCGG
TCGGTGCGGC CCCAAGGGAG TGATCCGACT CCAGTCGAAC GCCGCCAGCT CACGATCCTT
ATTGGCGACA TGGTTGGTTC GACCGAGTAT GCCTCCAGGC TCGACCCCGA AGACGTCAGC
CAGCTCACGC AGACATTCCT CTCGCGATGC AGCGCGCTTG CAAAGACCCA TGGGGGCTTC
GTCGCGAACT ATGTCGGCGA CGCGCTCCAG GTCCTGTTCG GTTTTCCGGC AGCGGAGGAG
AACGACGCGG AGCGCGCGGT ACATTTGGCG TTCGACATCG TGGCAGCGGT GCCACAGATA
GAGACGCCGG ACAGTTCGCG CTTGGGGGTG AGGATTGGGG TTGCCAGCGG CCTAGTAGTC
GTGGGCGACA TCGAGGGTGC GCCTGCCGGA ATTTCGACGG TGGCATTCGG CCATGTCCCG
AATCTCGCCC AACGACTTCA GGCCATCGCC GAGCCAAACG GAATCTTCGT CGATGAGAAC
ACGTTCCGGG CGACGGCAAG CGCCTTCGCT TACGCCGACA TTGGCGAAAA GACGCTGAAG
GGCTTCAGCG ATCCCGTCCA CGTCCGGCGC GCCCTCAAGC CAATCGCAAG AGAATACCGC
TTCGCGGGCG AACTCCGATC GACGCCGCTG GTGGGAAGGA GGAGCGAACT GCAGGCAGTC
GAGGCCCTCT GGGACGCGGT GCGAGCCGAC CGCCATGGTC GCGTCGCGTT GATAAGCGGC
GAGCCGGGGA TAGGGAAGTC GCGGCTGCTC TTCGAGGTTG GGAGGAGTTT TCCCGGAGTG
AAGGCGCTTG TTGCGCAGTG CGCCCCCGCC TTCGCCAACA GCGCACTCTA TCCGTTTCTA
AGGCTGTTCA AGCAGGAGGT TGGAATTACC GAGGACGCGC TGCTGTCGTC TGATAGGCTT
CGGGCCGCTC TGTCGTCGAG CACCATCCCT TTGTCGGTCT CGTATCCGAT CTTCACACGC
CTTCTGACAG TCGAACCGGA CTATGAACCG TCCAGGCTTG CCTCCTCTCA GCAGGAAGCC
GTGATTAGCC AAGTGTTTTC GGGCTGGCTC CGGCAATTGT CGAGCGCCGG ACCCCTGATG
CTCTTCGTGG AGGACGAGCA GTGGATCGAC CCGTCCTCCG GCAAGCTCCT CCAGACGCTT
GCGCACGACG TCGCGCAATT TTCAGTGCTC CTCCTGGTGA CCTCGCGCGA GAAGCAGACG
AAGGCTGGCT TCGACAACGC CGTCGCGGCG CATCTTGCAC TCGAGCGGTT CTCCCGCGAG
GAGGCTGGCG AGTTCGTCCA GAACGTCGTC GAAGGGGGCC ATCTGTCTTC CAGCGTCGTC
GCGACGCTTC TCAGCAAGGC CGAAGGTGTT CCCCTCTATC TTGAGGAACT CGCCCGATCC
GCCCTCATGC TTGCTCCCTA TACGTCGGGA CGGTCAGGGG CGAAGGCCGA GTGCGACGTC
GACGTCCCCG TCTCCCTCCA GTCCGCGCTC ATGTCCCGTC TCGACAAGCT CGGCGCCGTG
AAGACGGTCG CGCAGACTGC CGCTGTAATC GGCCGAGAAT TCGACCTGAA GACGCTCGCC
CATGTTGTCG GTCTCTCGAG CGAGATGCTG CGGCCGCAGA TCGAACGACT TGTCGCCGTC
GGCCTCGTCG CGCCGCAGCC CTTCAGCAAC TGGCCCCGGT ACTCCTTCTC TCACAGTCTG
CTGCAGGAGG CCGCCCGCGG GGCACTGCTT CGGGACAGGC GGCGAGAGCT TCATGCCCTC
GTCGCCAAGG CAATCGAGGT CGTCCAGCCG ACAATGGTAG TGGAGCATCC CGAGATCCTC
GCGCAGCATT ATGACGAGGC GACGCAGTTC GAACTCGCCG CCGACTACTG GCTCAGGGCG
GGCCGGAAGC TTGGGGCGAC GTGGGCAAAG GTCGAGGCTG CGAACATGTT CGCGAAAGGT
ATCGAATGTG TCCGCAGGAT GCCGCCCTCG AAGATGCGCG ACAGCAGAGA GCTGACCCTC
GAACTCGAAA GGGGCGACGT TCTCTACGCG GCATATGGAT ATATCACCGA GGACGGCAGC
GACGCATACC GGAATGTCAT GCGGCTCAGC GAGGCGACCG GAGACTCAGA GGCGGCGATT
CGGGCCCTGG ACGGCTTGTT CGGAACTTCG TTCAACTCAG CCCGGTTCTC GGACGCGGAA
TGGGCGAGCA ACCGGCTTAA GGAGATTGGC CGCAGGGATG GCAACATCAA GGCCCTCGTG
CTCGGACTGC AGTTCGGAGG CATGTGCGCC TTCGCGCGCG GCAGATTTTC GCAGGCGCGA
GAGGTGTTCC TGGAGGCTCT CGAAAACCGC GAGCATGCCG ACGAGGTCGG AAGCGACTTT
CCAAGTATGT CGATGCTTTA TCTCTCGTGG ACGCTGCAAA TCTTGGGGGA TGCCGACGCG
GCCATGGAGT TGTTTCACGC GGCCGAAACG GAGACCCGCC GACAAACGGA CTACCGCCTC
GCTGCCTGTC TGGGTAACGG CTGCATACTG ATGTCTCTTC GTCAGGATGT AGCAACGCTC
CAGAGGCTGG TGGACGACCT CGTGCCGCTC GCGAAGCGCA ACGGATTCCA GCTTTGGCTG
AACTTTGCTT CGTTCTTTTC CGGGTGGGCC AAGGTGGCAA GTCGCCAAGA CGCCTCCGGC
CTCGCGCAGA TGAAGCACAT ATGCGACAAT ATGGGACAGC AGGAGGTCGA TAAGACATGC
TATCTAGGCG TGCTCGCCGA TAGCTACCTT CGAATGGGAC GGGCTGACGA GGCAACGGCG
GTCCTCGATC AGGCACTCAA GCTCGCGGCG CGTACTGGTG AGCACTACTA CACGTCCGAG
TTGCTTCGTC TGCGGGGAGA ACTTCTAAGG CAGACGCAGC GGCTCTCTGA TGCGAAAGAA
TCGTTTCAGG AGGCCATCTC TTTTGCGCGC GAGCAAGGGG CCAAGGCATG GGAGGCGAGG
GCCAAGCAGT CGCTTGATTT GCTTTCGTCG GAGCTGCCGC GTGGTTAA
 
Protein sequence
MSDVTTWLAE IGLQHLAGKF ADAQIDFDTL ALLSEQDLRE LGIPLGPRRK LLAAIAALGR 
SVRPQGSDPT PVERRQLTIL IGDMVGSTEY ASRLDPEDVS QLTQTFLSRC SALAKTHGGF
VANYVGDALQ VLFGFPAAEE NDAERAVHLA FDIVAAVPQI ETPDSSRLGV RIGVASGLVV
VGDIEGAPAG ISTVAFGHVP NLAQRLQAIA EPNGIFVDEN TFRATASAFA YADIGEKTLK
GFSDPVHVRR ALKPIAREYR FAGELRSTPL VGRRSELQAV EALWDAVRAD RHGRVALISG
EPGIGKSRLL FEVGRSFPGV KALVAQCAPA FANSALYPFL RLFKQEVGIT EDALLSSDRL
RAALSSSTIP LSVSYPIFTR LLTVEPDYEP SRLASSQQEA VISQVFSGWL RQLSSAGPLM
LFVEDEQWID PSSGKLLQTL AHDVAQFSVL LLVTSREKQT KAGFDNAVAA HLALERFSRE
EAGEFVQNVV EGGHLSSSVV ATLLSKAEGV PLYLEELARS ALMLAPYTSG RSGAKAECDV
DVPVSLQSAL MSRLDKLGAV KTVAQTAAVI GREFDLKTLA HVVGLSSEML RPQIERLVAV
GLVAPQPFSN WPRYSFSHSL LQEAARGALL RDRRRELHAL VAKAIEVVQP TMVVEHPEIL
AQHYDEATQF ELAADYWLRA GRKLGATWAK VEAANMFAKG IECVRRMPPS KMRDSRELTL
ELERGDVLYA AYGYITEDGS DAYRNVMRLS EATGDSEAAI RALDGLFGTS FNSARFSDAE
WASNRLKEIG RRDGNIKALV LGLQFGGMCA FARGRFSQAR EVFLEALENR EHADEVGSDF
PSMSMLYLSW TLQILGDADA AMELFHAAET ETRRQTDYRL AACLGNGCIL MSLRQDVATL
QRLVDDLVPL AKRNGFQLWL NFASFFSGWA KVASRQDASG LAQMKHICDN MGQQEVDKTC
YLGVLADSYL RMGRADEATA VLDQALKLAA RTGEHYYTSE LLRLRGELLR QTQRLSDAKE
SFQEAISFAR EQGAKAWEAR AKQSLDLLSS ELPRG