Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_1159 |
Symbol | |
ID | 5322005 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 1234734 |
End bp | 1237634 |
Gene Length | 2901 bp |
Protein Length | 966 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640790100 |
Product | sporulation domain-containing protein |
Protein accession | YP_001326845 |
Protein GI | 150396378 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.30865 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.808052 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGACA AACAATTCGC ACGAAGCGGA CCGGCGGAAT TCGATAGGTT AGCCGATGAT GATCCGTTGA GCGAACTTGC CCGCATTGTC GGTTACGATG CTCGGCCAGC CGTTCAGCAA TTGCAGGAGT TGCAGCGCCA TCAGGAGGCG ATGCGGCGCG AGCCGGTTTT CGATCTCGAG GAGGAGCTCC TTCGCGCGTT CGACAGTTAC GATGCTCCCC AGGCACTGCC CGTCCGGCCC GATAGCGGCC TTGTCGGGCA TCCTTCGGAC GGCATGTCCG ATGATCGGGA GCATGCCGTA TTCGAAGAAC AGTTCGCTCC GGATTCAGCC GCCACCGATG CCGTTCCGGC CGTCGAAAAG GTCGCCGTCG GTGCCGAGTA TGCATCCATC GAGACGCCGA TCGAAGAGGC GCCGAGTGAC GATGCAGCAT TTGCGTCAGA GCGGTATTCC CCTGCATCCA ATGCCGATGT GGCCGAACGC GCAATGGCAT CGCATCAGGA GCCTGCCGTC GACCTAGAGC GCGAGCTCGA ATTGTCGCTG GGCTACGATG CGTCACCTGA TGACACGGAC GCCGTGCAGT CGGTTGTTGT CCCGGCTGAC GCAAACCCTG CTGCCGGCCT AAGTGATGAG CCGGTTTTCG CCGATTGGCA GGAGCCGCTC TCCTCAAAGC TGGCCGTCGC CGGCAAACTC GCCGAGCGGG CCGGGCTGGC GGAACCGGTC TATGTCGACA TGGCCGGTCA TTTCGAGGGT GTTGAGCGTG CGGATATTGC CGCGGCGCCT GCAGAAATGC TTCCGCAGCA GGCGGCTTCA GCAGACGCCG ACCGGCTCCT GGCCGATGTC GAGCGCTTCC CGGTCCCGGC GGCAGCAGAG GCTGCGGCCC CCGTCGTGGC CGAAATTCAG GATCGGCCTG CTCCGGTCGT GAAGAAGAAT AACTTCCCCT TTACGCCGAC ATTCAGCCGC GCGACGCCGG TGGCGTCACC TGGCGGTGTT TCGCAGCAGC GCGCTTTTGC AACGCCGGCT GTCGATGCGG TCATCGCGAG TGCGGCCGCT GTTGGCCCGG TGTCTGCCGT GCCGAGCCAA CCCCCGGAAG ATGCACGCGC ACTGCGTGCG GAAGAGGCGC CTCCGGTGCT TCAGGACAGC TCGCCTGAAG CGGAAGTCGA GCCGAGCTTC GACATCGAAG ACTTCGAACT GGAGTTGGCC GACATAGCGC TCGATCTTGC GGACGGCCCT GTTGATGAGG CTCCTGCGAG CGTCGTCGCT GCGCACCCGG CCGTCTCCCA CCCGGCGCTC GTCGAAACGC CTGCACAGCC CTATCTGCAG GCTCTGAACA TGCCGGGCGA TCCGGAGACA ATCGTCCGCA GGCCGCAAAA CCCGGCGCCT GAGCCTTTTG TGCCCGTGGA GCCCGAAATG AAGCCTGCCG AGGTCGTATT GCCTTTCGAC CCGGCGATGA TCGGCGAGAC GGAGACGGGG GTCGCGCCTA TCGCCGAACT GGACGTCCCG CAATTGCCCG CGGTCGAGGC AGAGGGGAAG GCGGCCGGAT ATCCCGCCGA CTACGATCTC GACATCGATG CCGAGATGGC TCAACTGTTC AGTGCGCCTG CTTCCCCCGC CCGGAGCGCC GCAGACCACG CATCCGCGCC TGTTTCGGCG GATAAAGCGG GATCTACATT CGCTTCGGCT GATGATTTCG ACGAGTTCGA GAAGGCTATG GAGGAGGATT TCCAGCGATC CATGGCAGAG CGACGGACCT CGGTTCCGGA AGCCGAAGGC ATGACGGTGA TGCCCCATGC GCAGGCCGAA GAATATGTTG AAGAAGGTTA TGGCCGCCGC TCCCAGCGCA TGATGATGCT CGCCGCGAGT GTAGCCGGCG TCATCATCAT CGGCGGCGCT GCAGTCTACG CCTGGATGGG CGGCAGTCAC GCTGTCCTTT CCGGCGATGG TCCGAAGATA ATCCTTGCGG ATAAGGCGCC GGTGAAGGTC GTACCCGAAG AGAAGGGCGG CAAGACTGTG CCGAACCAGG ACAAGGCAGT TTACGACCGC GTTGCCGGTG TCTCGGGCAA GGCGCCGCTC CAGCAGAGCC TGGTCTCCTC GACCGAGGAG CCGATGGATG TGGTGCAACG TACGCTGACC CCGGAGACCT TACCGCTGGA AGGTCGCGCA GAAGAGGGCG CATTGCAGGG TGCGTTGCCC GGCGAGGAGG ACGAAGTCGC TCGCCTGCTG CCGGATGGAG ATGCCGGCAA TGCGACGGCG GATGCAGAGG AGGCGGTTCC AGCCGTTGCG CCGCGCAAGG TCCGCACCAT GATCGTCAAG CCGGACGGCA CGCTCGTCCC GCGTGAAGAG CCGGCGCAGC AGCCGGAAAG CGTGGCGGTT GCGCAGCCGA TTCCGGCTCG TGCCTCCGGC GAAGCCGTTG CGGCCGGCTC ACAAGGCGAA GCAACCGTGG CCGGAGCCGG AGCCGATCTT CGGGCAGCCG CGCTGCCGGA TCCGTCCAAC GCAGCGGCTG ACGGGGCGAC CCCTGTTCGG AGTGTGAAAA CGTCCGCGCT GCCTCAGGCA CGCCCGGCCG CTCAGCCGCA AGCGGCTGCC TCCAACGCGA CTCCCGCGGA GAACCAGCCC GCCACCGAAA CGGCAGCCTC GCCGCTGGCG ACGGCTTCGG TTCCGGCCGG CAGCTACGTG ATTCAGATTG CTTCTCTGCC TTCGGAGGCC GAGGCGCAGA AGAGCTATAA CAATCTTTCC TCCAGGTATG CGAGCGTGAT CGGCGGTCGC GGCGTGGACA TCCGCAAGGC GGAGATCGCT GGCAAGGGCA CGTATTACCG GGTCAGGATA CCGGTCGGTA CGCGGGAGGA GGCCAATTCG CTTTGCTCTC GCTACAAAAG CGCGGGCGGA AGCTGCCTCG TAACGAAGTA A
|
Protein sequence | MADKQFARSG PAEFDRLADD DPLSELARIV GYDARPAVQQ LQELQRHQEA MRREPVFDLE EELLRAFDSY DAPQALPVRP DSGLVGHPSD GMSDDREHAV FEEQFAPDSA ATDAVPAVEK VAVGAEYASI ETPIEEAPSD DAAFASERYS PASNADVAER AMASHQEPAV DLERELELSL GYDASPDDTD AVQSVVVPAD ANPAAGLSDE PVFADWQEPL SSKLAVAGKL AERAGLAEPV YVDMAGHFEG VERADIAAAP AEMLPQQAAS ADADRLLADV ERFPVPAAAE AAAPVVAEIQ DRPAPVVKKN NFPFTPTFSR ATPVASPGGV SQQRAFATPA VDAVIASAAA VGPVSAVPSQ PPEDARALRA EEAPPVLQDS SPEAEVEPSF DIEDFELELA DIALDLADGP VDEAPASVVA AHPAVSHPAL VETPAQPYLQ ALNMPGDPET IVRRPQNPAP EPFVPVEPEM KPAEVVLPFD PAMIGETETG VAPIAELDVP QLPAVEAEGK AAGYPADYDL DIDAEMAQLF SAPASPARSA ADHASAPVSA DKAGSTFASA DDFDEFEKAM EEDFQRSMAE RRTSVPEAEG MTVMPHAQAE EYVEEGYGRR SQRMMMLAAS VAGVIIIGGA AVYAWMGGSH AVLSGDGPKI ILADKAPVKV VPEEKGGKTV PNQDKAVYDR VAGVSGKAPL QQSLVSSTEE PMDVVQRTLT PETLPLEGRA EEGALQGALP GEEDEVARLL PDGDAGNATA DAEEAVPAVA PRKVRTMIVK PDGTLVPREE PAQQPESVAV AQPIPARASG EAVAAGSQGE ATVAGAGADL RAAALPDPSN AAADGATPVR SVKTSALPQA RPAAQPQAAA SNATPAENQP ATETAASPLA TASVPAGSYV IQIASLPSEA EAQKSYNNLS SRYASVIGGR GVDIRKAEIA GKGTYYRVRI PVGTREEANS LCSRYKSAGG SCLVTK
|
| |