Gene Smed_1159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1159 
Symbol 
ID5322005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1234734 
End bp1237634 
Gene Length2901 bp 
Protein Length966 aa 
Translation table11 
GC content65% 
IMG OID640790100 
Productsporulation domain-containing protein 
Protein accessionYP_001326845 
Protein GI150396378 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.30865 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.808052 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGACA AACAATTCGC ACGAAGCGGA CCGGCGGAAT TCGATAGGTT AGCCGATGAT 
GATCCGTTGA GCGAACTTGC CCGCATTGTC GGTTACGATG CTCGGCCAGC CGTTCAGCAA
TTGCAGGAGT TGCAGCGCCA TCAGGAGGCG ATGCGGCGCG AGCCGGTTTT CGATCTCGAG
GAGGAGCTCC TTCGCGCGTT CGACAGTTAC GATGCTCCCC AGGCACTGCC CGTCCGGCCC
GATAGCGGCC TTGTCGGGCA TCCTTCGGAC GGCATGTCCG ATGATCGGGA GCATGCCGTA
TTCGAAGAAC AGTTCGCTCC GGATTCAGCC GCCACCGATG CCGTTCCGGC CGTCGAAAAG
GTCGCCGTCG GTGCCGAGTA TGCATCCATC GAGACGCCGA TCGAAGAGGC GCCGAGTGAC
GATGCAGCAT TTGCGTCAGA GCGGTATTCC CCTGCATCCA ATGCCGATGT GGCCGAACGC
GCAATGGCAT CGCATCAGGA GCCTGCCGTC GACCTAGAGC GCGAGCTCGA ATTGTCGCTG
GGCTACGATG CGTCACCTGA TGACACGGAC GCCGTGCAGT CGGTTGTTGT CCCGGCTGAC
GCAAACCCTG CTGCCGGCCT AAGTGATGAG CCGGTTTTCG CCGATTGGCA GGAGCCGCTC
TCCTCAAAGC TGGCCGTCGC CGGCAAACTC GCCGAGCGGG CCGGGCTGGC GGAACCGGTC
TATGTCGACA TGGCCGGTCA TTTCGAGGGT GTTGAGCGTG CGGATATTGC CGCGGCGCCT
GCAGAAATGC TTCCGCAGCA GGCGGCTTCA GCAGACGCCG ACCGGCTCCT GGCCGATGTC
GAGCGCTTCC CGGTCCCGGC GGCAGCAGAG GCTGCGGCCC CCGTCGTGGC CGAAATTCAG
GATCGGCCTG CTCCGGTCGT GAAGAAGAAT AACTTCCCCT TTACGCCGAC ATTCAGCCGC
GCGACGCCGG TGGCGTCACC TGGCGGTGTT TCGCAGCAGC GCGCTTTTGC AACGCCGGCT
GTCGATGCGG TCATCGCGAG TGCGGCCGCT GTTGGCCCGG TGTCTGCCGT GCCGAGCCAA
CCCCCGGAAG ATGCACGCGC ACTGCGTGCG GAAGAGGCGC CTCCGGTGCT TCAGGACAGC
TCGCCTGAAG CGGAAGTCGA GCCGAGCTTC GACATCGAAG ACTTCGAACT GGAGTTGGCC
GACATAGCGC TCGATCTTGC GGACGGCCCT GTTGATGAGG CTCCTGCGAG CGTCGTCGCT
GCGCACCCGG CCGTCTCCCA CCCGGCGCTC GTCGAAACGC CTGCACAGCC CTATCTGCAG
GCTCTGAACA TGCCGGGCGA TCCGGAGACA ATCGTCCGCA GGCCGCAAAA CCCGGCGCCT
GAGCCTTTTG TGCCCGTGGA GCCCGAAATG AAGCCTGCCG AGGTCGTATT GCCTTTCGAC
CCGGCGATGA TCGGCGAGAC GGAGACGGGG GTCGCGCCTA TCGCCGAACT GGACGTCCCG
CAATTGCCCG CGGTCGAGGC AGAGGGGAAG GCGGCCGGAT ATCCCGCCGA CTACGATCTC
GACATCGATG CCGAGATGGC TCAACTGTTC AGTGCGCCTG CTTCCCCCGC CCGGAGCGCC
GCAGACCACG CATCCGCGCC TGTTTCGGCG GATAAAGCGG GATCTACATT CGCTTCGGCT
GATGATTTCG ACGAGTTCGA GAAGGCTATG GAGGAGGATT TCCAGCGATC CATGGCAGAG
CGACGGACCT CGGTTCCGGA AGCCGAAGGC ATGACGGTGA TGCCCCATGC GCAGGCCGAA
GAATATGTTG AAGAAGGTTA TGGCCGCCGC TCCCAGCGCA TGATGATGCT CGCCGCGAGT
GTAGCCGGCG TCATCATCAT CGGCGGCGCT GCAGTCTACG CCTGGATGGG CGGCAGTCAC
GCTGTCCTTT CCGGCGATGG TCCGAAGATA ATCCTTGCGG ATAAGGCGCC GGTGAAGGTC
GTACCCGAAG AGAAGGGCGG CAAGACTGTG CCGAACCAGG ACAAGGCAGT TTACGACCGC
GTTGCCGGTG TCTCGGGCAA GGCGCCGCTC CAGCAGAGCC TGGTCTCCTC GACCGAGGAG
CCGATGGATG TGGTGCAACG TACGCTGACC CCGGAGACCT TACCGCTGGA AGGTCGCGCA
GAAGAGGGCG CATTGCAGGG TGCGTTGCCC GGCGAGGAGG ACGAAGTCGC TCGCCTGCTG
CCGGATGGAG ATGCCGGCAA TGCGACGGCG GATGCAGAGG AGGCGGTTCC AGCCGTTGCG
CCGCGCAAGG TCCGCACCAT GATCGTCAAG CCGGACGGCA CGCTCGTCCC GCGTGAAGAG
CCGGCGCAGC AGCCGGAAAG CGTGGCGGTT GCGCAGCCGA TTCCGGCTCG TGCCTCCGGC
GAAGCCGTTG CGGCCGGCTC ACAAGGCGAA GCAACCGTGG CCGGAGCCGG AGCCGATCTT
CGGGCAGCCG CGCTGCCGGA TCCGTCCAAC GCAGCGGCTG ACGGGGCGAC CCCTGTTCGG
AGTGTGAAAA CGTCCGCGCT GCCTCAGGCA CGCCCGGCCG CTCAGCCGCA AGCGGCTGCC
TCCAACGCGA CTCCCGCGGA GAACCAGCCC GCCACCGAAA CGGCAGCCTC GCCGCTGGCG
ACGGCTTCGG TTCCGGCCGG CAGCTACGTG ATTCAGATTG CTTCTCTGCC TTCGGAGGCC
GAGGCGCAGA AGAGCTATAA CAATCTTTCC TCCAGGTATG CGAGCGTGAT CGGCGGTCGC
GGCGTGGACA TCCGCAAGGC GGAGATCGCT GGCAAGGGCA CGTATTACCG GGTCAGGATA
CCGGTCGGTA CGCGGGAGGA GGCCAATTCG CTTTGCTCTC GCTACAAAAG CGCGGGCGGA
AGCTGCCTCG TAACGAAGTA A
 
Protein sequence
MADKQFARSG PAEFDRLADD DPLSELARIV GYDARPAVQQ LQELQRHQEA MRREPVFDLE 
EELLRAFDSY DAPQALPVRP DSGLVGHPSD GMSDDREHAV FEEQFAPDSA ATDAVPAVEK
VAVGAEYASI ETPIEEAPSD DAAFASERYS PASNADVAER AMASHQEPAV DLERELELSL
GYDASPDDTD AVQSVVVPAD ANPAAGLSDE PVFADWQEPL SSKLAVAGKL AERAGLAEPV
YVDMAGHFEG VERADIAAAP AEMLPQQAAS ADADRLLADV ERFPVPAAAE AAAPVVAEIQ
DRPAPVVKKN NFPFTPTFSR ATPVASPGGV SQQRAFATPA VDAVIASAAA VGPVSAVPSQ
PPEDARALRA EEAPPVLQDS SPEAEVEPSF DIEDFELELA DIALDLADGP VDEAPASVVA
AHPAVSHPAL VETPAQPYLQ ALNMPGDPET IVRRPQNPAP EPFVPVEPEM KPAEVVLPFD
PAMIGETETG VAPIAELDVP QLPAVEAEGK AAGYPADYDL DIDAEMAQLF SAPASPARSA
ADHASAPVSA DKAGSTFASA DDFDEFEKAM EEDFQRSMAE RRTSVPEAEG MTVMPHAQAE
EYVEEGYGRR SQRMMMLAAS VAGVIIIGGA AVYAWMGGSH AVLSGDGPKI ILADKAPVKV
VPEEKGGKTV PNQDKAVYDR VAGVSGKAPL QQSLVSSTEE PMDVVQRTLT PETLPLEGRA
EEGALQGALP GEEDEVARLL PDGDAGNATA DAEEAVPAVA PRKVRTMIVK PDGTLVPREE
PAQQPESVAV AQPIPARASG EAVAAGSQGE ATVAGAGADL RAAALPDPSN AAADGATPVR
SVKTSALPQA RPAAQPQAAA SNATPAENQP ATETAASPLA TASVPAGSYV IQIASLPSEA
EAQKSYNNLS SRYASVIGGR GVDIRKAEIA GKGTYYRVRI PVGTREEANS LCSRYKSAGG
SCLVTK