Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5210 |
Symbol | |
ID | 5319512 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | + |
Start bp | 172393 |
End bp | 173427 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640776988 |
Product | cellulase |
Protein accession | YP_001313920 |
Protein GI | 150377325 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3405] Endoglucanase Y |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.558539 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCCAC TCGTCGTGCT AATGCTCCTG CTGCTAGCTT ATCCGACGCA TGCCCAGGAG CCAGCGACGG TGGAAGCAAC GGCGTGGCAA AAGTATAAGA CGCGATTTCT CGATCCGGGC GGCCGGATTA TCGACGACGC CAACGGTGAT ATAAGCCATA GCGAGGGACA GGGTTACGGC CTTCTGTTGG CCTTTCTCGC GGGCAGCCGG GCCGATTTTG AACTCATCTG GTCGTTCACG CGCCGCGAAC TCCTCCTGCG CGACGACGGC CTTGCGGCCT GGAAATGGAG TCCGGGCGAA GCGCCTCATG TTTCCGATAC GAACAATGCA ACAGACGGCG ACATATTGAT CGCCTATGCC TTGGCCCGTG CAGGGGTTTC CTGGGATCGC AAGGACTACA CGCGTGCGGC TACTGCGCTG GCAGAGGCGA TTCTCGAGAA AACCGTCGTC GAACATGGGG GGCTGACCTT GCTCTTGCCG GGCGCGCAGG GATTTTCTGC GGCCGATCGG GCCGACGGGC CCGTCATCAA CCCGTCCTAC TGGGTTTTCG AAGCCTTTCC GGTTCTCGAA CAATTGGTTC CTTCTCCTGC CTGGAAAGCG CTCGCAGCGG ACGGTGAAGC TATTCTCAAG AGACTGGAGT TCGGGCCAAA GAAGTTGCCC GCCGACTGGA TTAGTGCACG AACCGTGTTC AAGCCGGCGG AGGGCTTCCC GTCCGAATAC GGCTATAACG CGCTGCGCAT CCCGCTTTAT CTTGTTCGCT CGGGAAGGAC AGACAGTGAG CTCCTTTCAC GGATCTACAG GGGCATGTCC GACGCGAAAG GTGCGGTTTT GCTCAGCGAT GTCGAAAGCG GTGCCGTGGA GGAGACCCTT ACCGATCCAG GTTATCGAAT TATTAACCAT ATCCTGGCCT GTGTCCTCCA AGGGACGAAG CTCCCCGACG ACATGAAAAC GTTTGAACCG ACCCAATACT ATCCCTCGAC AATGCACCTG CTGGGTTTGT CTTTCGTGGA GGAAATGCGT CCGGAGTGCC TATGA
|
Protein sequence | MKPLVVLMLL LLAYPTHAQE PATVEATAWQ KYKTRFLDPG GRIIDDANGD ISHSEGQGYG LLLAFLAGSR ADFELIWSFT RRELLLRDDG LAAWKWSPGE APHVSDTNNA TDGDILIAYA LARAGVSWDR KDYTRAATAL AEAILEKTVV EHGGLTLLLP GAQGFSAADR ADGPVINPSY WVFEAFPVLE QLVPSPAWKA LAADGEAILK RLEFGPKKLP ADWISARTVF KPAEGFPSEY GYNALRIPLY LVRSGRTDSE LLSRIYRGMS DAKGAVLLSD VESGAVEETL TDPGYRIINH ILACVLQGTK LPDDMKTFEP TQYYPSTMHL LGLSFVEEMR PECL
|
| |