Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_1646 |
Symbol | |
ID | 5322504 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 1739655 |
End bp | 1741526 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640790586 |
Product | hypothetical protein |
Protein accession | YP_001327318 |
Protein GI | 150396851 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.327516 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.459319 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGTC CCGACATTCC CGTCACGATC TCCGGTGATC CGAAGGGCTT CGAGTCCGCG CTTGTCCGGG TGCGGGCACT CTCGAAGTCG ACGGCAACTG ACGTCGTTGC ATCCTTCGGC CGGATCAAGA ACCTCGTGGC CGGCGGCGCC GGTCTCGTGA CCGGGCTTGT CTCCGCCGCC AGCGTCACCG CATTGCGCGA TGCAGCGGGC GCGATTGCCT CGATCGGCGA CGAGGCGCGT CGGGCCGGCC TCGACGTCAA GAGCTTCCAG GAGCTGAAGT TCGTCGCCGA GCAGAACCGT GTCGGCGTCG ACGCGCTGAC CGACGGCATC AAGGAATTGA ACCTTCGGGC CGACGAATTC ATCGTCACCG GAGGCGGATC GGCGGCCGAG GCTTTCCAGC GCCTCGGCTA CTCGGCCGAG GACCTGAAGC AGAAGCTCGA AGATCCGGCT GATCTCTTCA CCGAGATCAT CGGTCGCCTG GGCGAGCTCG ACAAGGCGGC ACAGATCCGC ATCATGGACG AGATCTTCGG CGGCGCGGGC GGCGAACAGT TCGTGCAGCT GATCGAGGCG GGTGAAGCGG GCATCCGCGA CACCATCAGG GCCGCGAACG ACCTGGGCAT CGTTGTTGAC GAGCAGATGA TCCAGAAGGC TGCAGACGTC GACCGCAAGT TCAACATGCT TGCGACGACG GTCGGCACGA AGTTGAAATC CGCCATCGTT TCGGCGGCTG ACAGCCTAGC GGAATTTATC GACGGCTTCC GTGATTTCCA AAACCAGATG AACAGCACGC TTCAGGGCAG GCAAGCCAAA ATCGGCGAGC GTCAGCTCGA GATCGAGAAT GAAATCCTCA AGAAGAAGGA GGCGCAGGCT CGACAGGACG AGAAGCTCTC CGATGTCGCC AGGAAGCTTG GTTTTGAAAA CAGTAAGAAC GCCAACCTTG CCGGCTACAC CGGGCAGATA GAAGCCCTGA AGGAAGAGAG CCGGAAACTC GCCGAAGAAG AGGCGAAGAT CGTCAATATC CTGAGCGATC GCCTCCAGCC GATGAACCGC CCGGCCGAGA GGACCTGGAC GCCGATCGAT ACTGAAGAAA AAGGCGGCGG CCGGTCCAAG AAAGTCTCGG AAGCCGGGAA AGAAAAGAAG GCGATCGACG ACGTGATCGC GTCGTTGCGT GAGGAGTTGG CGATCATCGG CCTCACCGAC ATCGAGCGGG AGCGGACAAT TGCGCTGCGC GAGGCGGGTG TCGAGGCGAC CTCGAAGGAA GGCCAGCAGA TCTCGGCGCT CATCGACGAA AAGTACCACC AGCTCGCAGC TGAGGAGGCC TTGGCCGAGC AGTATGAGCG CAGCGAAGAA GCGGCCGAGC GAATGGGGCA GGTCCTCGAT GATCAGCTCA TGCGCATCGT CGACGGCAGT TTCGATGCGA AGGAGGCGAT TGCGGCGCTG CTCACCGAGA TCATCAATGT CCAGACGAAC GGGAAGGGGC TCTTCGGTTC GCTGTTCAGC TCCATTTTCG GCGGTGGTAG CGGTTTTGGC TCCAACTTCG TGCCGACCAC AACGCTCGGT GACTTCCTCG GCTATGGCGG TGCGCGCGCT GGCGGCGGTG ATGTTTCTCC CGGGCGCATC TACCGGGTGA ACGAATATGA GGACGAGTTC TTTGCTCCGA CCAGCCACGG CCGGATCATC GCGCCGAGCA AGCTGTCCGG CGCGGCGGCA GACAGAGAAG GCGGCGGCGG GCGCACCGTC GTTGAGATCG TACTGAGCAA GGATTTGTTG GCCAGCATCC TCGAGCAGAC CGGCAATCAG ACCGTGCGCA TCGTGCGCAG CAACGAGGAA GCCCGGACGA ACTATCGCCT GAATGGCGGG GAAGATTTCT GA
|
Protein sequence | MSRPDIPVTI SGDPKGFESA LVRVRALSKS TATDVVASFG RIKNLVAGGA GLVTGLVSAA SVTALRDAAG AIASIGDEAR RAGLDVKSFQ ELKFVAEQNR VGVDALTDGI KELNLRADEF IVTGGGSAAE AFQRLGYSAE DLKQKLEDPA DLFTEIIGRL GELDKAAQIR IMDEIFGGAG GEQFVQLIEA GEAGIRDTIR AANDLGIVVD EQMIQKAADV DRKFNMLATT VGTKLKSAIV SAADSLAEFI DGFRDFQNQM NSTLQGRQAK IGERQLEIEN EILKKKEAQA RQDEKLSDVA RKLGFENSKN ANLAGYTGQI EALKEESRKL AEEEAKIVNI LSDRLQPMNR PAERTWTPID TEEKGGGRSK KVSEAGKEKK AIDDVIASLR EELAIIGLTD IERERTIALR EAGVEATSKE GQQISALIDE KYHQLAAEEA LAEQYERSEE AAERMGQVLD DQLMRIVDGS FDAKEAIAAL LTEIINVQTN GKGLFGSLFS SIFGGGSGFG SNFVPTTTLG DFLGYGGARA GGGDVSPGRI YRVNEYEDEF FAPTSHGRII APSKLSGAAA DREGGGGRTV VEIVLSKDLL ASILEQTGNQ TVRIVRSNEE ARTNYRLNGG EDF
|
| |