Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_2682 |
Symbol | |
ID | 5323551 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 2783154 |
End bp | 2784980 |
Gene Length | 1827 bp |
Protein Length | 608 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640791626 |
Product | hypothetical protein |
Protein accession | YP_001328347 |
Protein GI | 150397880 |
COG category | [S] Function unknown |
COG ID | [COG3108] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.043341 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.427449 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAAATT TGTTCGGTTT GGAGGAACCA GTCGGTTCCT TTGCGCGCAG AGTCTGCTCG GCGGTCGCTC GAAAGGCGCC GCAGGTCCTG GCGTCTATCG CGCTGGTATG CTCCCTTGTC ACGCCTGGCA TGGCGCCTCC GGTCGAGGCC GCCGGACAGA CACGCACCCT CAAGCTGTAT TTCATCCACA CCAAGGAAAA GGCGCAGATC ACCTTCAAGC GCAATGGTCG CTACGATCAG AAGGGGCTGC AGCAGATAAA CCGTTTTCTT CGCGACTGGC GGCGGAACGA GCCGACAAAA ATGGATCCGC GCCTGCTTGA TCTCGTCTGG GAAGTTTATC AGAAGAGCGG TTCGCGCGAT TATATCCACG TCGTTTCGGC TTATCGTTCA CCGGCGACGA ACGGCATGTT GAGGTCACGC TCCAAGGGCG TCGCCAAGAA GAGCCAGCAT ATGCTCGGCA AGGCGATGGA CTTCTATATT CCGGATGTGA GGCTGAAGTC GCTGCGCGAG ATCGGCATGA AGTTCCAGGT CGGCGGCGTC GGCTACTATC CGACCTCCGG TTCGCCCTTC GTGCACATGG ATGTTGGCGG GGTACGCGCC TGGCCGCGCA TGACGCGCAA CGAGCTTGCG CGTCTCTTCC CCGATGGCGA GACCATGCAC ATCCCCTCCG ACGGCAAGCC GCTGCCCGGC TATAATCAGG CCGTGGCCGA TTATAAGCGC CGCGTCGGCG CGTCTGCGAT CCAGGTCGCG GGCGGCGGTG CCACGGGGTC AGGCGATACC GGAAAGCGCC GCAACCTCTT TGCCGCGCTC TTTGGCGGAG GCGGCGACGA GGATGAAGAA CCCGCGGCGA TTGTCGCCGG TGGCGATGAA GAGGCTCCGG CCAAGGTCCG TACCGCGTCC GCGCCGGCGG CCCAGGATGC TCTCCCGGGC GTTGCAGGGT CTGCGGCGGG TTCGCAGCAG CAGGACATCG ACGCACCAGT GCCTGCCGTG CGACCGGCCT TCAAGGAAAC ACCGGCTGAT GGCGGCGTCG CTGTAGCGCT CGTCGCTCCG GAGAAGAACA GCGCTCAGGA GGCGTTGGCT GCGGCCATGC CGCAGACGCC GGCCGTGCCA TCGGAATTTG CGGATCTGAG CACCCTCAAG GTTCCTGTCC CGCAGATGCT CGACCGGCGC GACATGAACG CTCTGATCGC CAACGAGACG CTCGTCGCCG CGGCCGGCGG TGAGGCCGAG GAGTTCGGGT TCGTGCCGGT GCCGGGAATG CGCCCGGCAG GTGAGGAGGC GCTTGCGGCC GTTGCTCAGG CGGACGTCAC CGTTCCTCTC TTTGAAGACC GGCCCGCGCT CGCTTCCGCG GCACAGGCGC CGAACGCAAC CGGTGCGACC GGGTCACGCG TGGCTCTCGC CGCGCCGACA ACCGAGAACC GGGCGCCTGC AATTCCTCCT GCGATCGAGC TTGCCGCCTA TGCGCCGCAA CCGGGTTCGG CTGCGCGTGA GGCGATCTTC GACAGTGTCT TCGACCCAGA CGCGGATGCC CCCACGAAAG GCGCCCGCCC CAAGAGGCAG GATGCCGAGG CGAAAAGCCG GTCTTCCGTC CGCACCGAGC CGAAGCTGAC GCAGAAGATC ATTTCCGAGT GGGCGCTCTC GGCGGGTCGT GTGGCGACCC TGTCCAAGCC GGTCAAGGCG CCGCGCTTCG TCAGCAAGTC GCTCCGCGTC GCTCCGACGA CCGTTTACGC GGCCGGCTTC ACCAGCAATA ACGGCGCGGT CGATACGGCG CGCTTCAGCG GCAGTGCCGT CAACTTCATG GAAGTCAAGA AGTTCGACAC CAACTGA
|
Protein sequence | MPNLFGLEEP VGSFARRVCS AVARKAPQVL ASIALVCSLV TPGMAPPVEA AGQTRTLKLY FIHTKEKAQI TFKRNGRYDQ KGLQQINRFL RDWRRNEPTK MDPRLLDLVW EVYQKSGSRD YIHVVSAYRS PATNGMLRSR SKGVAKKSQH MLGKAMDFYI PDVRLKSLRE IGMKFQVGGV GYYPTSGSPF VHMDVGGVRA WPRMTRNELA RLFPDGETMH IPSDGKPLPG YNQAVADYKR RVGASAIQVA GGGATGSGDT GKRRNLFAAL FGGGGDEDEE PAAIVAGGDE EAPAKVRTAS APAAQDALPG VAGSAAGSQQ QDIDAPVPAV RPAFKETPAD GGVAVALVAP EKNSAQEALA AAMPQTPAVP SEFADLSTLK VPVPQMLDRR DMNALIANET LVAAAGGEAE EFGFVPVPGM RPAGEEALAA VAQADVTVPL FEDRPALASA AQAPNATGAT GSRVALAAPT TENRAPAIPP AIELAAYAPQ PGSAAREAIF DSVFDPDADA PTKGARPKRQ DAEAKSRSSV RTEPKLTQKI ISEWALSAGR VATLSKPVKA PRFVSKSLRV APTTVYAAGF TSNNGAVDTA RFSGSAVNFM EVKKFDTN
|
| |