Gene Smed_5209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5209 
Symbol 
ID5319511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp170075 
End bp172396 
Gene Length2322 bp 
Protein Length773 aa 
Translation table11 
GC content60% 
IMG OID640776987 
Productcellulose synthase subunit B 
Protein accessionYP_001313919 
Protein GI150377324 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.656767 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGGCGC TGCTTACCTG TGGGCCCTTG GGAGCCCAGC CGGCGCCGTT CGACATGACG 
GGGGAACGCC CCATTGAAGA TCAGGCGCCC GCCCCCAAAC AACAGGAGCA ACCTTCGAGG
AACGGTCCGG CTGACGGCAT TGAAGGTGAA AAAATCCAAC TTGCACCGGT GAAAGACGCA
GCATCTTATC GCAGGCACAT CGTTCCCTTC GCCAGTTTGT CACTGACCGG TGAAGTCGAC
GAGCGAGCGT GGTCGGTCTA CTTGACGCCG GCGCAAGCCG CCGCAGGCGG CGATCTCATT
TTCGAGTACC AGAACGCGGT CGTGATCGCG CCGGAAAGTT CCGTTCTGTC GGTCTTTGTG
AACGGCAAGT TGGTCGGTGA GGGACCGGTA CAGGCAGGGG AAAAGCCGGA ATCGCGACGC
TATAAGCTCC CGTCCAACCT TTTGCGGACG GGCTCGAACG AAATTCGCTT TCGGGTGCAA
CAGCGCCACC GGACCGACTG CACGGTCGAA TCGACATACG AGCTCTGGAC CGAAATTGCC
TCTGGAAAGG CCTACATCCA GTTTGAAGGC CGGGATGCGG CAAGGCTCAC GACCCTTGAA
GACATCCAAG CGGTCGGCGT CGATGCCAGT GGGCGCACGC GGTTCAACAT CGTCGCACCG
GCTTTTGACC AGCCGAGCCG CACCGCCGCG TTGATGCGGC TAGCGCAAGG CCTCGCTCTT
CTTGGCCACA TGCCGGCACA GAGCTTCGAC GTTCGCGGAG ATCTCCCGGA ACTCGGCAGA
GCCGGGGAAC TGACGGTTCT TGCCGGAACG GTTGCGGAAC TACAGCCTCT TTTCCCATCA
CTGCCGGGTG AGGCATCGAC GACGGCAGTG ACGGCCCTTG TTGAAAGTCG GCCGGAACGG
GCGCCAATCC TCGTTTTGAC TGGTCCGGAT TGGTCGGCGG TCGAAGGGGC TATCGAATCA
ATCACGAGGG TGACGGATAA ACCTGCCAAC GTTGCGCGGG ACGTTATTGA AACCGGGAGG
TGGAGATTGC CCCAGGCGCC ACAGGTGATC TCGGGCAAAA GGCTGCCGTT TTCTGCGCTC
GGCGTGTCGA CGACAGAATT CTCTGGACGC CGCTTTCGCA CTGCCTTCGC AGTCGCGGTT
CCGCCGGATT TTTACGCCAA TGCCTATGGC GAAGCGACGG TGCTTCTCGA CGTCGGCTAC
ACTAAGGCGG TTCGGCCGGG CAGCCGGATC GACATATACG TCAATGGAAA TATCGCCTCA
ACGGCTCCGC TGGACTCCTC GGGTGGCTTG GTGCGCCACT TGCCGATCAA CGTGACGCTG
CGCCATTTTC ACCCAGGTGC GAATCTGATC GAATTGGAGG CCGTTCTTCT CACCGACGCC
GACAGGACAT GTGCGCCAGG AACTGCCGCC TCGACAGAGC CACGATTCGC GCTATTTGAC
ACATCAGAAT TTGCAATGGC CGACTTCGCA AGGATCGGAC GCCTTCCCGA TCTCGCGGCA
GCCGCGGGAT CTGCCTACCC GTTTCGCAGC AGCGCCAAAC CGATCGCTCT ACACGTGGAC
AGGGCTGAGA CCCTTAGCCT ATCAGCGGCG GCGACCTTTC TTGCCCGCTT GGCCGTGGCA
GGGGGGCGTC CGATGGCAAT CGAAACGATT ACATCGCCGG TTGCTTCCGC GGACAGGGAA
GCGCTCTTCG TCGGCGCGAT GCCCCAGATA CCGAAATCCG TTCTTACAGA AGCCGGGATT
GACTTGAACA GTCAGATCTC CTGGGGCAGT TCCGCATCGT CTGAGGTTCT TGCCGACTCG
CGGGCGGCTT TTGATGCCTG GCAGACCCGC TTGAACGGCG GAACCTGGCG CTCTCACCTT
CGTGGTCTTG AGGATTGGGT GAAGGACACC TTCGACATAT CATTGAATTC ACTCAGGCTC
CTCCCGCCTG ACGAGGCTCC CTTCGCGCCG CCGGATACCG CGACACTGCT CATCGCGCAA
TCGTCAAATC CGGACGCGGG AACGACTTGG ACGCTCGTCA CGTCGCCTAC ACCAGAACAT
CTACGCGACG CGGTATCTAC GATTTCCGAT GTCCGCAGAT GGAGCCAGAT GTCGGGGCAC
ATCTCCATCT ACGAGCCCGC CAACGATCGG ATCAGTTCGA TCCCGGCGCA ACACTTCGAG
TTCGTCGAAA CTCGGCCACC CTCGCTCGCG AACTATCGTC TTGTCCTTGC CAATTGGCTG
TCGAGCAACA TATTGGTGTA CGCCGTGCTG CTCGTCTGCC TTGTCGTTCT GCTCGGATTG
GCCACTGCTA GCCTACTTGC CAGATTGGGG CGCCGCCAAT GA
 
Protein sequence
MVALLTCGPL GAQPAPFDMT GERPIEDQAP APKQQEQPSR NGPADGIEGE KIQLAPVKDA 
ASYRRHIVPF ASLSLTGEVD ERAWSVYLTP AQAAAGGDLI FEYQNAVVIA PESSVLSVFV
NGKLVGEGPV QAGEKPESRR YKLPSNLLRT GSNEIRFRVQ QRHRTDCTVE STYELWTEIA
SGKAYIQFEG RDAARLTTLE DIQAVGVDAS GRTRFNIVAP AFDQPSRTAA LMRLAQGLAL
LGHMPAQSFD VRGDLPELGR AGELTVLAGT VAELQPLFPS LPGEASTTAV TALVESRPER
APILVLTGPD WSAVEGAIES ITRVTDKPAN VARDVIETGR WRLPQAPQVI SGKRLPFSAL
GVSTTEFSGR RFRTAFAVAV PPDFYANAYG EATVLLDVGY TKAVRPGSRI DIYVNGNIAS
TAPLDSSGGL VRHLPINVTL RHFHPGANLI ELEAVLLTDA DRTCAPGTAA STEPRFALFD
TSEFAMADFA RIGRLPDLAA AAGSAYPFRS SAKPIALHVD RAETLSLSAA ATFLARLAVA
GGRPMAIETI TSPVASADRE ALFVGAMPQI PKSVLTEAGI DLNSQISWGS SASSEVLADS
RAAFDAWQTR LNGGTWRSHL RGLEDWVKDT FDISLNSLRL LPPDEAPFAP PDTATLLIAQ
SSNPDAGTTW TLVTSPTPEH LRDAVSTISD VRRWSQMSGH ISIYEPANDR ISSIPAQHFE
FVETRPPSLA NYRLVLANWL SSNILVYAVL LVCLVVLLGL ATASLLARLG RRQ