Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_1413 |
Symbol | |
ID | 5322264 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 1494057 |
End bp | 1495349 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640790355 |
Product | hypothetical protein |
Protein accession | YP_001327094 |
Protein GI | 150396627 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0999393 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.104956 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGAAC TTTTCGTCAT TTTGCTTCTG CTTTTGATCA ACGCCTTTTT TGCGTTGTCG GAAATGGCAC TCGTGTCGGC AAGCAAACCG CTCCTTCGTC AGATGGTCAA GCAGGGAATC CCGCGCGCAG AAGCCGCACT CAGGCTTGCG GAAGATCCAG GCAAATTCCT GTCGACGGTG CAGGTCGGGA TCACCCTGGT CGGTATCCTG GCTGGCGCCT ATGGAGGCGC GACAATCGCC GCCAACATCG CGCCGTTCCT GAACGACATC GCCTGGATCA GCCCTTACGG CGATACGGTC GCGGTCGCCC TCGTCGTCAC CCTGATCACG TTTCTGTCGG TGGTCATCGG CGAGCTCATA CCGAAGCAGT TGGCGCTTCG AAACTCGGAA GCGCTGGCGA TGTTCGTCGC CCGTCCGATG GCGCTGCTTT CGCGTATCGT CGCCCCGGTA GTCTATCTGT TCGAAGGCGC GGCCAACCTT TCGATGCGTA TCATGGGAAT GAGGCCCGAG GACGCGGATC ACGTGACCGA AGAGGAAGTT CAGGCGATCA TGGCGGAAGG CGTCGAAAGC GGCGCCATCG AAAAGAGCGA ACACGAGATG CTGCGGCGGA TCATTCGCCT TGGCGACCGC AATGTAAAAA CGATTATGAC GCATCGCACC GAGGTGAGCT TCATCGACAT CCAGGACGAT CTGGAGACGA TCGGACACAA GATCCGGCAG TCCGGCCACT CGCGCTATCC GGTGGTCGAC GGGCCTGCGG GCGATGTGAT CGGGGCAGTC CTTGCAAAGG AGATATTGAA TGTTTCGCAA ACCGGAAAAT TCAATATCCG CGATTATGTC CGTGACATTC TCACACTGCC GGAGACGGCC TCCTGCTTAA AGGCGCTCGA AGCCTTCAAG ACGTCCAGCA TCAATATGGC CATGATCGTC GACGAATATG GGAGCACAGA GGGGATCATC ACCACCGCCG ATATCCTCGA GGCGATCGTG GGCATCATTC CATCAAACTA TGACGATTCC GAACATGCCC TCATTCACCT GCGCGACGAC GGCAGCTATC TCGTAGACGG ACGTACGCCA ATCGATGAGA TCCACCTTCA GATCGGCATC GAGGGCATTG ACGCCGACAG CGATTTCGAA ACCATCGCGG GCTTTCTGGT GCAGCAATTG CGCAAGTCGC CGGAAGAGGG CGACACGGCC GAGGCTCACG GCTATCGATT CGAGGTGATC GATATGGACG GCCGCCGTAT CGACAAAATC CTGGTCAGCC GAGCCGGTGA GGCACTTTCC TGA
|
Protein sequence | MAELFVILLL LLINAFFALS EMALVSASKP LLRQMVKQGI PRAEAALRLA EDPGKFLSTV QVGITLVGIL AGAYGGATIA ANIAPFLNDI AWISPYGDTV AVALVVTLIT FLSVVIGELI PKQLALRNSE ALAMFVARPM ALLSRIVAPV VYLFEGAANL SMRIMGMRPE DADHVTEEEV QAIMAEGVES GAIEKSEHEM LRRIIRLGDR NVKTIMTHRT EVSFIDIQDD LETIGHKIRQ SGHSRYPVVD GPAGDVIGAV LAKEILNVSQ TGKFNIRDYV RDILTLPETA SCLKALEAFK TSSINMAMIV DEYGSTEGII TTADILEAIV GIIPSNYDDS EHALIHLRDD GSYLVDGRTP IDEIHLQIGI EGIDADSDFE TIAGFLVQQL RKSPEEGDTA EAHGYRFEVI DMDGRRIDKI LVSRAGEALS
|
| |