Gene Smed_5210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5210 
Symbol 
ID5319512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp172393 
End bp173427 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content58% 
IMG OID640776988 
Productcellulase 
Protein accessionYP_001313920 
Protein GI150377325 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3405] Endoglucanase Y 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.558539 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCCAC TCGTCGTGCT AATGCTCCTG CTGCTAGCTT ATCCGACGCA TGCCCAGGAG 
CCAGCGACGG TGGAAGCAAC GGCGTGGCAA AAGTATAAGA CGCGATTTCT CGATCCGGGC
GGCCGGATTA TCGACGACGC CAACGGTGAT ATAAGCCATA GCGAGGGACA GGGTTACGGC
CTTCTGTTGG CCTTTCTCGC GGGCAGCCGG GCCGATTTTG AACTCATCTG GTCGTTCACG
CGCCGCGAAC TCCTCCTGCG CGACGACGGC CTTGCGGCCT GGAAATGGAG TCCGGGCGAA
GCGCCTCATG TTTCCGATAC GAACAATGCA ACAGACGGCG ACATATTGAT CGCCTATGCC
TTGGCCCGTG CAGGGGTTTC CTGGGATCGC AAGGACTACA CGCGTGCGGC TACTGCGCTG
GCAGAGGCGA TTCTCGAGAA AACCGTCGTC GAACATGGGG GGCTGACCTT GCTCTTGCCG
GGCGCGCAGG GATTTTCTGC GGCCGATCGG GCCGACGGGC CCGTCATCAA CCCGTCCTAC
TGGGTTTTCG AAGCCTTTCC GGTTCTCGAA CAATTGGTTC CTTCTCCTGC CTGGAAAGCG
CTCGCAGCGG ACGGTGAAGC TATTCTCAAG AGACTGGAGT TCGGGCCAAA GAAGTTGCCC
GCCGACTGGA TTAGTGCACG AACCGTGTTC AAGCCGGCGG AGGGCTTCCC GTCCGAATAC
GGCTATAACG CGCTGCGCAT CCCGCTTTAT CTTGTTCGCT CGGGAAGGAC AGACAGTGAG
CTCCTTTCAC GGATCTACAG GGGCATGTCC GACGCGAAAG GTGCGGTTTT GCTCAGCGAT
GTCGAAAGCG GTGCCGTGGA GGAGACCCTT ACCGATCCAG GTTATCGAAT TATTAACCAT
ATCCTGGCCT GTGTCCTCCA AGGGACGAAG CTCCCCGACG ACATGAAAAC GTTTGAACCG
ACCCAATACT ATCCCTCGAC AATGCACCTG CTGGGTTTGT CTTTCGTGGA GGAAATGCGT
CCGGAGTGCC TATGA
 
Protein sequence
MKPLVVLMLL LLAYPTHAQE PATVEATAWQ KYKTRFLDPG GRIIDDANGD ISHSEGQGYG 
LLLAFLAGSR ADFELIWSFT RRELLLRDDG LAAWKWSPGE APHVSDTNNA TDGDILIAYA
LARAGVSWDR KDYTRAATAL AEAILEKTVV EHGGLTLLLP GAQGFSAADR ADGPVINPSY
WVFEAFPVLE QLVPSPAWKA LAADGEAILK RLEFGPKKLP ADWISARTVF KPAEGFPSEY
GYNALRIPLY LVRSGRTDSE LLSRIYRGMS DAKGAVLLSD VESGAVEETL TDPGYRIINH
ILACVLQGTK LPDDMKTFEP TQYYPSTMHL LGLSFVEEMR PECL