Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5128 |
Symbol | |
ID | 5319430 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | + |
Start bp | 81810 |
End bp | 83120 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640776906 |
Product | hypothetical protein |
Protein accession | YP_001313838 |
Protein GI | 150377243 |
COG category | [S] Function unknown |
COG ID | [COG4655] Predicted membrane protein |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.193766 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAAGA GAACTGACGC TATCGATGAA GCGAGCCGTT CGTCTCGGTG GCAGAAGCCT CTGCGCCGGT TTCTGAAAGC CGAAGGCGGC GCCGTCGCCG TCATCGCCGC GGTTGCATTT CCCGTACTCG TCGGTGCAAT GGGTTTGGGC GCCGAGACGG GGTACTGGTA TCTGGAAAAG CGCAAGCTGC AGCATGCTGC CGACGTTTCG GCCTATGCCG CAGCGGTCCG CCATCGCGCG GGTGATCAGC AGTCCGCGCT TGAAGCCGCC GCACGCAGGG TCGCAGGCGG TTCGGGCTTT TCGCCGGGGG ACCTCACCTT GTCCACGGCT TCGGCCGCTG CCGGCGGCTC GAACAATGTG ACGGTCGAGC TGACCGAGAC CCATCCGCGC CTGTTTTCTT CCGTCTTCGG CACCGGAACC ATCACGATAA AGGCACGCGC AGTCGCCCAG GTCACAGGTG GTTCGAAGGC TTGCGTCCTC GCCCTGTCGA ACTCCGCGTC GGGCGCCGTG ACCGTCACCG GCTCGACGGA AGTCCAATTG TCCGGCTGCA GCGTGGTTTC CAATTCCAGC GCCTCCGACG CCTTTCTGAT GAGGAACGGC AGCGCGCTCA TGTCGACCGA TTGCGTCTAT ACCGTCGGCG AAGCCGTGAC GACGACCGGT CTGACGCTCA CCGGTTGCAG CAAGCCCGTC CAGCAGGTCC CGCCGACGCC GGATCCGTTT GCCTCCGTCC TCGAACCGGA TCCGCTGCAG ATTCAGCAAC TTCCTTGCCG TTCACTGAGT TATGTGTCGA ATCTGCTCTA TGTTTTCGAC AGGCTTGCAA GCCCGCTGTA TCCAGGCGGC GTCGAGGCGA TCAGGTTCTG CGGCGGACTC GACATCAAGG GAACGGTCAA ACTGAAGCCC GGCCTTTATA TCATAGACGG CGGCGAATTG ACGGTGACGG CCGGGGCAAA ACTCTCCGGC GATGGCGTTA CCTTTCTTTT CACCAATTCT GCAGCCGCCA ACCTGCTGGG CAACGGCGAC ATCGATCTGT CCGCTCCGAC GAGCGGTCCC TTTGCGGGCC TGTTATTCTT TTCCAGTCGG CTCAATACCG GGGTCGTCCA TCAGATAACG GGCAATTCCG AATCTACCCT GGTCGGAAAC CTCTACGCTC CGACGGGCCG CATCGACTTC ACCGGAAACT CGACCGTTTC GGGCGGATGT ACGCAGATCG TAGCCGATCA GGTCACTTTT ACAGGCAATT CCAGAATGGA GACCTGCGCT TCGCCGACAG AGGAGATCCT GGTCGGTCTG TCGGTATCGC TCATAGAGTG A
|
Protein sequence | MSKRTDAIDE ASRSSRWQKP LRRFLKAEGG AVAVIAAVAF PVLVGAMGLG AETGYWYLEK RKLQHAADVS AYAAAVRHRA GDQQSALEAA ARRVAGGSGF SPGDLTLSTA SAAAGGSNNV TVELTETHPR LFSSVFGTGT ITIKARAVAQ VTGGSKACVL ALSNSASGAV TVTGSTEVQL SGCSVVSNSS ASDAFLMRNG SALMSTDCVY TVGEAVTTTG LTLTGCSKPV QQVPPTPDPF ASVLEPDPLQ IQQLPCRSLS YVSNLLYVFD RLASPLYPGG VEAIRFCGGL DIKGTVKLKP GLYIIDGGEL TVTAGAKLSG DGVTFLFTNS AAANLLGNGD IDLSAPTSGP FAGLLFFSSR LNTGVVHQIT GNSESTLVGN LYAPTGRIDF TGNSTVSGGC TQIVADQVTF TGNSRMETCA SPTEEILVGL SVSLIE
|
| |