Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3669 |
Symbol | |
ID | 5318066 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 107941 |
End bp | 108918 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640775482 |
Product | putative cellulase H precursor protein |
Protein accession | YP_001312415 |
Protein GI | 150375819 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4124] Beta-mannanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.55561 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00741413 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAACAGGA AATCAAAGGC CGTTGCGTTC GGATGTCTCG GACTATTGCT GTCGGGGACA GTGCTCGCAG CCAATATGCC TCGAGGTATC CCCGACGAGA ACTCGCCGAC GAAGGCAACG CCATCGGACA AGCGGCCGGT GCTGACAGAG ACTTCCCTCG ATTTCGGCGC TTACGATCCT CATGGTGACT TCGGCACGCC TGGGAGCTCG AAGATCGAGC ATCTCTTCCT GCCCTGGGAA GACGTCGATC TGTCGACCCT TGCGCTTGCC GACGACTATG CCCAGGCGCG AGGCCGCTCG CTCCTGATTA CCATCGAACC CTGGTCGTGG TCGCCGGAGT GGCGCGTGAC CGAGGAAGAG CTCCTGCGTT CGATTCTGAG CGGTGAGCGG GATCAGAACA TGGCCCAGGT CTGTTCGGCC GCGGCGGCAC TGAAAAGCCC GGTGATCATC CGTTGGGGCC AGGAAATGGA CGAAACCGAC AACCAGTTCT CCTGGTCGCA TTGGCGGGGC GAGGACTTCA AGGCAGCCTA TCGTCACGTG GTGGGCGTCT GCCGGGGCCA TCTCAAGAAT GCGAAATTCA TGTGGTCGCC CAAGGGGAAC GAGGGGTTGG ACGCCTTCTA TCCAGGTGAT GATGTCGTCG ATATCGTCGG GCTTTCGGTA TTCGGCTACC AACCCTACGA CGAAGGCACG ACCGGGCGCG CTCAGACGTT CGTGGAGCGG CTGGCGCCCG GTTACGGACG GGTGATGAAC TACGGCAAGC CGGTCATGGT CGCCGAGCTC GGCTATGAGG GCGATGGCTC ATACGTCGCG AGCTGGGCGG CATCCGTTGC TGAGCGTCAC GCCGAATTTC CGCAACTGAC AGCCGTGGTC TATTTCAACG ATCGCGAGAT CTATGACTGG CCGCAAGGCT ATGGCCGCCC CGATTGGCGG GTCGTCCGCG AAACCATCAG CGGGCAAGCG CCCGACGGAG GCATGTGA
|
Protein sequence | MNRKSKAVAF GCLGLLLSGT VLAANMPRGI PDENSPTKAT PSDKRPVLTE TSLDFGAYDP HGDFGTPGSS KIEHLFLPWE DVDLSTLALA DDYAQARGRS LLITIEPWSW SPEWRVTEEE LLRSILSGER DQNMAQVCSA AAALKSPVII RWGQEMDETD NQFSWSHWRG EDFKAAYRHV VGVCRGHLKN AKFMWSPKGN EGLDAFYPGD DVVDIVGLSV FGYQPYDEGT TGRAQTFVER LAPGYGRVMN YGKPVMVAEL GYEGDGSYVA SWAASVAERH AEFPQLTAVV YFNDREIYDW PQGYGRPDWR VVRETISGQA PDGGM
|
| |