Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5209 |
Symbol | |
ID | 5319511 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | + |
Start bp | 170075 |
End bp | 172396 |
Gene Length | 2322 bp |
Protein Length | 773 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640776987 |
Product | cellulose synthase subunit B |
Protein accession | YP_001313919 |
Protein GI | 150377324 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.656767 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTGGCGC TGCTTACCTG TGGGCCCTTG GGAGCCCAGC CGGCGCCGTT CGACATGACG GGGGAACGCC CCATTGAAGA TCAGGCGCCC GCCCCCAAAC AACAGGAGCA ACCTTCGAGG AACGGTCCGG CTGACGGCAT TGAAGGTGAA AAAATCCAAC TTGCACCGGT GAAAGACGCA GCATCTTATC GCAGGCACAT CGTTCCCTTC GCCAGTTTGT CACTGACCGG TGAAGTCGAC GAGCGAGCGT GGTCGGTCTA CTTGACGCCG GCGCAAGCCG CCGCAGGCGG CGATCTCATT TTCGAGTACC AGAACGCGGT CGTGATCGCG CCGGAAAGTT CCGTTCTGTC GGTCTTTGTG AACGGCAAGT TGGTCGGTGA GGGACCGGTA CAGGCAGGGG AAAAGCCGGA ATCGCGACGC TATAAGCTCC CGTCCAACCT TTTGCGGACG GGCTCGAACG AAATTCGCTT TCGGGTGCAA CAGCGCCACC GGACCGACTG CACGGTCGAA TCGACATACG AGCTCTGGAC CGAAATTGCC TCTGGAAAGG CCTACATCCA GTTTGAAGGC CGGGATGCGG CAAGGCTCAC GACCCTTGAA GACATCCAAG CGGTCGGCGT CGATGCCAGT GGGCGCACGC GGTTCAACAT CGTCGCACCG GCTTTTGACC AGCCGAGCCG CACCGCCGCG TTGATGCGGC TAGCGCAAGG CCTCGCTCTT CTTGGCCACA TGCCGGCACA GAGCTTCGAC GTTCGCGGAG ATCTCCCGGA ACTCGGCAGA GCCGGGGAAC TGACGGTTCT TGCCGGAACG GTTGCGGAAC TACAGCCTCT TTTCCCATCA CTGCCGGGTG AGGCATCGAC GACGGCAGTG ACGGCCCTTG TTGAAAGTCG GCCGGAACGG GCGCCAATCC TCGTTTTGAC TGGTCCGGAT TGGTCGGCGG TCGAAGGGGC TATCGAATCA ATCACGAGGG TGACGGATAA ACCTGCCAAC GTTGCGCGGG ACGTTATTGA AACCGGGAGG TGGAGATTGC CCCAGGCGCC ACAGGTGATC TCGGGCAAAA GGCTGCCGTT TTCTGCGCTC GGCGTGTCGA CGACAGAATT CTCTGGACGC CGCTTTCGCA CTGCCTTCGC AGTCGCGGTT CCGCCGGATT TTTACGCCAA TGCCTATGGC GAAGCGACGG TGCTTCTCGA CGTCGGCTAC ACTAAGGCGG TTCGGCCGGG CAGCCGGATC GACATATACG TCAATGGAAA TATCGCCTCA ACGGCTCCGC TGGACTCCTC GGGTGGCTTG GTGCGCCACT TGCCGATCAA CGTGACGCTG CGCCATTTTC ACCCAGGTGC GAATCTGATC GAATTGGAGG CCGTTCTTCT CACCGACGCC GACAGGACAT GTGCGCCAGG AACTGCCGCC TCGACAGAGC CACGATTCGC GCTATTTGAC ACATCAGAAT TTGCAATGGC CGACTTCGCA AGGATCGGAC GCCTTCCCGA TCTCGCGGCA GCCGCGGGAT CTGCCTACCC GTTTCGCAGC AGCGCCAAAC CGATCGCTCT ACACGTGGAC AGGGCTGAGA CCCTTAGCCT ATCAGCGGCG GCGACCTTTC TTGCCCGCTT GGCCGTGGCA GGGGGGCGTC CGATGGCAAT CGAAACGATT ACATCGCCGG TTGCTTCCGC GGACAGGGAA GCGCTCTTCG TCGGCGCGAT GCCCCAGATA CCGAAATCCG TTCTTACAGA AGCCGGGATT GACTTGAACA GTCAGATCTC CTGGGGCAGT TCCGCATCGT CTGAGGTTCT TGCCGACTCG CGGGCGGCTT TTGATGCCTG GCAGACCCGC TTGAACGGCG GAACCTGGCG CTCTCACCTT CGTGGTCTTG AGGATTGGGT GAAGGACACC TTCGACATAT CATTGAATTC ACTCAGGCTC CTCCCGCCTG ACGAGGCTCC CTTCGCGCCG CCGGATACCG CGACACTGCT CATCGCGCAA TCGTCAAATC CGGACGCGGG AACGACTTGG ACGCTCGTCA CGTCGCCTAC ACCAGAACAT CTACGCGACG CGGTATCTAC GATTTCCGAT GTCCGCAGAT GGAGCCAGAT GTCGGGGCAC ATCTCCATCT ACGAGCCCGC CAACGATCGG ATCAGTTCGA TCCCGGCGCA ACACTTCGAG TTCGTCGAAA CTCGGCCACC CTCGCTCGCG AACTATCGTC TTGTCCTTGC CAATTGGCTG TCGAGCAACA TATTGGTGTA CGCCGTGCTG CTCGTCTGCC TTGTCGTTCT GCTCGGATTG GCCACTGCTA GCCTACTTGC CAGATTGGGG CGCCGCCAAT GA
|
Protein sequence | MVALLTCGPL GAQPAPFDMT GERPIEDQAP APKQQEQPSR NGPADGIEGE KIQLAPVKDA ASYRRHIVPF ASLSLTGEVD ERAWSVYLTP AQAAAGGDLI FEYQNAVVIA PESSVLSVFV NGKLVGEGPV QAGEKPESRR YKLPSNLLRT GSNEIRFRVQ QRHRTDCTVE STYELWTEIA SGKAYIQFEG RDAARLTTLE DIQAVGVDAS GRTRFNIVAP AFDQPSRTAA LMRLAQGLAL LGHMPAQSFD VRGDLPELGR AGELTVLAGT VAELQPLFPS LPGEASTTAV TALVESRPER APILVLTGPD WSAVEGAIES ITRVTDKPAN VARDVIETGR WRLPQAPQVI SGKRLPFSAL GVSTTEFSGR RFRTAFAVAV PPDFYANAYG EATVLLDVGY TKAVRPGSRI DIYVNGNIAS TAPLDSSGGL VRHLPINVTL RHFHPGANLI ELEAVLLTDA DRTCAPGTAA STEPRFALFD TSEFAMADFA RIGRLPDLAA AAGSAYPFRS SAKPIALHVD RAETLSLSAA ATFLARLAVA GGRPMAIETI TSPVASADRE ALFVGAMPQI PKSVLTEAGI DLNSQISWGS SASSEVLADS RAAFDAWQTR LNGGTWRSHL RGLEDWVKDT FDISLNSLRL LPPDEAPFAP PDTATLLIAQ SSNPDAGTTW TLVTSPTPEH LRDAVSTISD VRRWSQMSGH ISIYEPANDR ISSIPAQHFE FVETRPPSLA NYRLVLANWL SSNILVYAVL LVCLVVLLGL ATASLLARLG RRQ
|
| |