Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_0043 |
Symbol | |
ID | 5320870 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 44129 |
End bp | 45304 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640788974 |
Product | CBS domain-containing protein |
Protein accession | YP_001325738 |
Protein GI | 150395271 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0313302 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000000114145 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCGACT TCAAGACAGA GCCGGCCGCT GTCGCGACCG AGGAGGCTGA AGCCTCCGGC GACGCCGAGG CCGGTAGTAG TACCGCCGCC CGATCCGAGG GCAGCAAATC CACATCATCC TTCTGGAGCC GTGCCGCGCG CCTGTTGCGC GGTGTGAGTC CATCAAGCCT GCGCGAGGAT CTTGCCGACG CGCTGATGAC CGACACCGGA GGCAATGCGG CCTTCTCGCC CGAAGAGCGG GCGATGCTCA ACAACATCCT CCGCTTTCGC GAGGTTCGTG TTGAAGACGT GATGGTTCCG CGAGCCGACA TAGAGGCGGT GGACCAGAAC ATCACCATCG GCGAACTCAT GGCCCTCTTC GAGGAGTCTG GTCGCTCGCG TATGCCGGTC TACAGCGAAG GGCTCGACGA TCCCCGCGGC ATGGTCCACA TCCGCGATCT CCTTGCCTAT GTGGCCAAGC AGGCGCGCAA CCGGCGCCGC AACGGCAAAG CTCCAACGGC GCCGACGACC GCAACGACCA CGAATGGCGA CAAGCCCGAA AAGGCCCCCC GACAGCAGAA GCCGGGTTTC GATCTCTCTC GCGTTGACCT CGACAAGACG GTCGAGGAGG CGGGAATCAT CCGTCAGCTG CTGTTCGTGC CGCCGTCGAT GCTTGCCTCG GATCTCATGC AGCGCATGCG CGCTGCGCGC ATTCAGATGG CTCTCGTCAT CGACGAATAC GGCGGGACGG ACGGTCTTGT ATCGCTCGAG GACATCGTCG AGATGGTGGT CGGCGATATC GAGGATGAGC ACGACGACGA GGAGGTGATG TTCGCGCGCA GCTCCGACGA CGTCTTCATC GCCGACGCTC GTGTGGAGCT GGAGGAAATC GCCGAGGCGG TCGGGCCGGA CTTCGATGTA CGCGAGCAAC TCGAGGACGT CGATACGCTT GGCGGTCTCG TTTTCGCATC GCTCGGCCGG ATTCCCGTTC GAGGCGAGGT GGTGCAGGCG ATTCCCGGTT TCGAGTTCCA GATACTCGAT GCGGATCCAC GTCGCGTCAA ACGCGTCAGG ATCATGCGCA AGCGCCCGTC TTCGCGCCGC CGCCCGCCGA AGGTCGAGAA GGAGCCGCTG CCAGAGGCGT TTGCCACGAC CGGCGCCACG GGCGCCGGTG TCCGGCCTCC GGCTTCGTTG GAATAG
|
Protein sequence | MSDFKTEPAA VATEEAEASG DAEAGSSTAA RSEGSKSTSS FWSRAARLLR GVSPSSLRED LADALMTDTG GNAAFSPEER AMLNNILRFR EVRVEDVMVP RADIEAVDQN ITIGELMALF EESGRSRMPV YSEGLDDPRG MVHIRDLLAY VAKQARNRRR NGKAPTAPTT ATTTNGDKPE KAPRQQKPGF DLSRVDLDKT VEEAGIIRQL LFVPPSMLAS DLMQRMRAAR IQMALVIDEY GGTDGLVSLE DIVEMVVGDI EDEHDDEEVM FARSSDDVFI ADARVELEEI AEAVGPDFDV REQLEDVDTL GGLVFASLGR IPVRGEVVQA IPGFEFQILD ADPRRVKRVR IMRKRPSSRR RPPKVEKEPL PEAFATTGAT GAGVRPPASL E
|
| |