Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4298 |
Symbol | |
ID | 5318461 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 791659 |
End bp | 792837 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640776103 |
Product | hypothetical protein |
Protein accession | YP_001313036 |
Protein GI | 150376440 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.304532 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGACG ACACTACCAA GGCAGTAAAG GCCGTGGCGC AGAACCAGCG TAAGGATCCG GCGGAACAGG TTTTGCAGGT CGAACGCCGG GACTATACCG AGCTTGCTCC CGTAGACCGG CCCCGCAGCC AGTCGCTCGA TGGCTTCGAC GAGATCTACA CCGACATTGT CGATTATATC GTCCGCTGCA CACATCGCAT CTGGGACGAG CGTGACATCG GGCTGATCTA TACCCACTAT ACCCACAACT GCGTCCTCTA CGGCACGCTC GGCACCATCT ATAATCGCGA GGATGTGGTG CGCGACACGA TCCAGCGGCT GGTCTCGCTG CCGGAGCGGC GCGGCATGGC GACCCAGGTG CTCTGGAGCG GCAACGATGT CGAAGGCTTC TACACCTCCC ATCTCGTCAC CGGGTCCGGC CGGCACACGC AATATGGCCA TTTCGGCCCG CCGACCGGCC GCACTTTCGT GGCGCGTACG ATCGCGGACT GCATGATCCA CCGCAACAAG ATTTACCGTG AATGGGTGGT GGCCGATACG ATGGCCATCA TCAAGCAGCT CGGCCTCGAT CCGCACGCCT TCGCCGGAAA GCTTGCCAGG TCTTCATTTG ACAAAGGACT TCTCTCGCTT GATATCGGCG AGAACCGCCG CCTGCTCGGC CAATATCCGC CGGAGGCGGA AGCGGACGTC TCGATCGCGC ATAACGATAT CGAAGCCCAT ACGTTGCGTT GGCTGCACGA AGTGTTCAAC GGGCGCATGT TCGGCAGGAT CAAGGATGTC TACGCTCCCA CCTGCCAGTA TCACGGCCCG CTGATGAAGG AGCTTTACGG CGTGGCGGCC GTCACCCATC AACACCTTGG CCTCATCGGC TCGATCCCCG ATGCCGCCTA TTCTGCGCAG CACATCTGCT CGAACCCTTG CGAGGAAGGC GGCGTCAAAG TCGCTGTACG CTGGCTGATG GAAGGCCATC ACCTCGGCTA CGGTATATTG CAGGACCTCG GCGAGCCGAC CGGCGCCCGA TTGCAGGTCA TGGGCATGAC CCATTACCAC TACAAGGACG GAAAGATCGT CGACGAATGG AATGTCTACG ACGAGCTGTC GCTGCTGGTG CAGGTAAAGC TTGCCGAAAT GGCTCGCCAG TCGGGGTCAG CGAATGCAGA GGCTCACGGG GCCAGTTAG
|
Protein sequence | MSDDTTKAVK AVAQNQRKDP AEQVLQVERR DYTELAPVDR PRSQSLDGFD EIYTDIVDYI VRCTHRIWDE RDIGLIYTHY THNCVLYGTL GTIYNREDVV RDTIQRLVSL PERRGMATQV LWSGNDVEGF YTSHLVTGSG RHTQYGHFGP PTGRTFVART IADCMIHRNK IYREWVVADT MAIIKQLGLD PHAFAGKLAR SSFDKGLLSL DIGENRRLLG QYPPEAEADV SIAHNDIEAH TLRWLHEVFN GRMFGRIKDV YAPTCQYHGP LMKELYGVAA VTHQHLGLIG SIPDAAYSAQ HICSNPCEEG GVKVAVRWLM EGHHLGYGIL QDLGEPTGAR LQVMGMTHYH YKDGKIVDEW NVYDELSLLV QVKLAEMARQ SGSANAEAHG AS
|
| |