Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_2969 |
Symbol | |
ID | 5323846 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 3115874 |
End bp | 3117529 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640791920 |
Product | HemY domain-containing protein |
Protein accession | YP_001328633 |
Protein GI | 150398166 |
COG category | [S] Function unknown |
COG ID | [COG3898] Uncharacterized membrane-bound protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTCGAA TACTGTTCTT CGTCCTGCTC GTCCTTTGCC TTGCTCTCGG CTTTGCCTGG CTCGCGGATC GCCCGGGGGA ACTGTCGCTA ATCTGGCAGG GTCAGCTGAT CGAGATGAGC CTGCTGCGCG CCGCCTCGCT CCTGATTTCG GTTTTCGCCG CCGTTCTGAT CGCCGTCTGG CTGCTCCGCA CGATCTGGTC GTCTCCCCAT ACGGTCACGC GCTATTTTCG CGCCCGCAAG CGCGACCGTG GCTATCAGGC GCTGTCGACC GGCCTGATCG CAGCCGGCGC CGGTGATGCC AATCTCGCAC GCAAGATGAC CGCGCGCACG CGTAGCCTGA TCAGCTCCGA CCAGGAGCCG CTGATCCATC TCCTGGAAGC GCAGACCTCG CTGATCGAAG GCAAATACGA CGACGCGCGC AAGAAATTCG AGTTGATGGC GGATGATCCT GAGACGCGCG AGCTCGGCCT GCGCGGTCTT TACCTCGAAG CCAAGCGGCT CGGCGCAAAT GAAGCCGCGC GCCACTATGC CGAGCGTGCC GCCGAAAAGG CGCCGCACTT GCCGTGGGCG ACGCTCGCAA CGCTCGACCA TCGCAGCCAG GCGCGTCAGT GGGACGAGGC TATCCGGCTG CTCGATCAGA GCCGCGCCGC CAATGTGCTG GAAAGGAAGG AGGCGGATCG CAAGAAGGCC GTTCTCCTGA CAGCGCGGGC GATGGAACAA CTGGAGGCCG ACCCGAAATC CGCACGTGAC GACGCAAGGG CGGCGCTGAA GCTCGACGAC AGTCTCGTGC CGGCGGCGCT GGTTGCCGCC AAGGCGCTGT TTCGCGAGGA CAATCTGCGC AAAGGCGCCT CGATCCTCGA AAAAATGTGG AAGTTGGATC CCCATCCGGA GATCGCGCGT CTTTACGTGC GGGCGCGCGG CGGCGATTCC GCCCTCGATC GCCTGAAGCG CGCAAAAAGA CTTGAAACGC TTCGCGGTAA CAATGCTGTT TCATTGGCCA CTGTCGCCGA GGCGGCGCTC GAAGCGCGCG AACTCGCGCT TGCGCGGACG AAGGCAGAGG CCGCCGCCCG GATCGACCCA AGGGAAAGCA TCTTTCTGCT GCTCGCGGAT ATAGAAGAAG CCGACACCGG GGACGAGGGC CGCATCCGTT ACTGGATGTC GCAGGCGCTC AGGAGCCCGC GCGATCCAGC CTGGACCGCT GACGGTGTCA CATCGCCGAG CTGGCTGCCG GTCTCGCCGG TCACCGGTCG CCTCGACGCG TTCGAGTGGA AAGCTCCACC GGCGCGGCTA CCGGCCGCAA CCGAAGAAGG GTATCTGAAC CCGGACGCGG CAATTCGCAG CCTCCCGCCC GTCGCAACCG TGCCTCGCCC GGCCACTGAA CCCGAGACTG AAAGCGCCGT CGATGCCGCC CCGCCGGCAG AGCGCGTTGA GGCCGAGCCC GAGAAAGCCA TCACCGTTCC CGACCCGAAC GAAACGGTCG CCCCTCCAGC CGGGGATCCG AACCGGAAAG AACCGGTCCC GGCGGTCGCC TCGCCCGCCA AAGGCAAGGA TTCCGAGGAG GCGCCGGATC CCTTTTTCGG CCGACCTCCG GACGATCCGG GCGTACGCGA GCCGGCGTCG ACGGAACAGA ACACCGGCAA CTTCCGTCTT TTCTGA
|
Protein sequence | MIRILFFVLL VLCLALGFAW LADRPGELSL IWQGQLIEMS LLRAASLLIS VFAAVLIAVW LLRTIWSSPH TVTRYFRARK RDRGYQALST GLIAAGAGDA NLARKMTART RSLISSDQEP LIHLLEAQTS LIEGKYDDAR KKFELMADDP ETRELGLRGL YLEAKRLGAN EAARHYAERA AEKAPHLPWA TLATLDHRSQ ARQWDEAIRL LDQSRAANVL ERKEADRKKA VLLTARAMEQ LEADPKSARD DARAALKLDD SLVPAALVAA KALFREDNLR KGASILEKMW KLDPHPEIAR LYVRARGGDS ALDRLKRAKR LETLRGNNAV SLATVAEAAL EARELALART KAEAAARIDP RESIFLLLAD IEEADTGDEG RIRYWMSQAL RSPRDPAWTA DGVTSPSWLP VSPVTGRLDA FEWKAPPARL PAATEEGYLN PDAAIRSLPP VATVPRPATE PETESAVDAA PPAERVEAEP EKAITVPDPN ETVAPPAGDP NRKEPVPAVA SPAKGKDSEE APDPFFGRPP DDPGVREPAS TEQNTGNFRL F
|
| |