Gene Smed_3669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3669 
Symbol 
ID5318066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp107941 
End bp108918 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content62% 
IMG OID640775482 
Productputative cellulase H precursor protein 
Protein accessionYP_001312415 
Protein GI150375819 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4124] Beta-mannanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.55561 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00741413 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACAGGA AATCAAAGGC CGTTGCGTTC GGATGTCTCG GACTATTGCT GTCGGGGACA 
GTGCTCGCAG CCAATATGCC TCGAGGTATC CCCGACGAGA ACTCGCCGAC GAAGGCAACG
CCATCGGACA AGCGGCCGGT GCTGACAGAG ACTTCCCTCG ATTTCGGCGC TTACGATCCT
CATGGTGACT TCGGCACGCC TGGGAGCTCG AAGATCGAGC ATCTCTTCCT GCCCTGGGAA
GACGTCGATC TGTCGACCCT TGCGCTTGCC GACGACTATG CCCAGGCGCG AGGCCGCTCG
CTCCTGATTA CCATCGAACC CTGGTCGTGG TCGCCGGAGT GGCGCGTGAC CGAGGAAGAG
CTCCTGCGTT CGATTCTGAG CGGTGAGCGG GATCAGAACA TGGCCCAGGT CTGTTCGGCC
GCGGCGGCAC TGAAAAGCCC GGTGATCATC CGTTGGGGCC AGGAAATGGA CGAAACCGAC
AACCAGTTCT CCTGGTCGCA TTGGCGGGGC GAGGACTTCA AGGCAGCCTA TCGTCACGTG
GTGGGCGTCT GCCGGGGCCA TCTCAAGAAT GCGAAATTCA TGTGGTCGCC CAAGGGGAAC
GAGGGGTTGG ACGCCTTCTA TCCAGGTGAT GATGTCGTCG ATATCGTCGG GCTTTCGGTA
TTCGGCTACC AACCCTACGA CGAAGGCACG ACCGGGCGCG CTCAGACGTT CGTGGAGCGG
CTGGCGCCCG GTTACGGACG GGTGATGAAC TACGGCAAGC CGGTCATGGT CGCCGAGCTC
GGCTATGAGG GCGATGGCTC ATACGTCGCG AGCTGGGCGG CATCCGTTGC TGAGCGTCAC
GCCGAATTTC CGCAACTGAC AGCCGTGGTC TATTTCAACG ATCGCGAGAT CTATGACTGG
CCGCAAGGCT ATGGCCGCCC CGATTGGCGG GTCGTCCGCG AAACCATCAG CGGGCAAGCG
CCCGACGGAG GCATGTGA
 
Protein sequence
MNRKSKAVAF GCLGLLLSGT VLAANMPRGI PDENSPTKAT PSDKRPVLTE TSLDFGAYDP 
HGDFGTPGSS KIEHLFLPWE DVDLSTLALA DDYAQARGRS LLITIEPWSW SPEWRVTEEE
LLRSILSGER DQNMAQVCSA AAALKSPVII RWGQEMDETD NQFSWSHWRG EDFKAAYRHV
VGVCRGHLKN AKFMWSPKGN EGLDAFYPGD DVVDIVGLSV FGYQPYDEGT TGRAQTFVER
LAPGYGRVMN YGKPVMVAEL GYEGDGSYVA SWAASVAERH AEFPQLTAVV YFNDREIYDW
PQGYGRPDWR VVRETISGQA PDGGM