Gene Smed_4408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4408 
Symbol 
ID5318121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp900951 
End bp902756 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content63% 
IMG OID640776212 
Productmalto-oligosyltrehalose trehalohydrolase 
Protein accessionYP_001313145 
Protein GI150376549 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0296] 1,4-alpha-glucan branching enzyme 
TIGRFAM ID[TIGR02402] malto-oligosyltrehalose trehalohydrolase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.354382 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGAAA GCTCAACGCC TCCGTGTTTC AGAAAAACTT GGGGGCCCAA TTTCGTCGGC 
GAAGACACTT GCCGGTTTCG GCTGTGGGCT CCCGACGAGC GGGGTGTCGA TCTCGTCCTC
GACGGGACCC CGCACGAGAT GCGGCAGAGC GATGGCGGCT GGTTCGAAAT CACGGTCGAG
ACGAAGCCGG GAGACCGTTA TTGCTTCCGA TTGGCCGACG GAACCGAGGT TGCGGATCCG
GCTTCGTCGG CACAGGAGCG GGGTGCGGAA GGTGCGTCGC TCGTCGTCGA CCAGGCGGCC
TATGAATGGC GGGCCGCCTC CTGGCGTGGC AGGCCCTGGG AGGAGGCGGT CATCTCGGAG
CTGCATATCG GCTGCTTCAC GCCGGAGGGC ACATTCCGCG CCGCGATCAA CCGCCTGCCG
CATCTGGCAT CGGCCGGAAT TACCGCGATT GAAATCATGC CCGTCGCACA GTTCCCCGGC
GCGCGGGGCT GGGGCTACGA TGGTGTGCTG CATTATGCCC CGCATAATGC CTATGGGACG
CCGGATGACC TGAAGGGGCT CGTGGATGCA GCGCATTCGC TCGGCCTCAT GGTCCTGCTC
GACGTGGTTT ACAACCATTT TGGGCCCGAG CAGAATTACC TTTCCCTGTA CGCAAGCCGC
TTCTTCAACA AGGATCGCCC CACACCCTGG GGTGCGTCCA TTGCCTTCGA GGAAGAAGCA
GTCCGGCGAT ATTTTATCGA GAATGCGCTT TACTGGCTCG GCGATTTCCG TTTCGACGGT
TTGCGCCTCG ATGCGACTGA ACAGATTCGT GACACGAGCA ATCCGCATTT CCTCGTCGCG
CTGGAGCACG AAGTGCGCAA ATGCTTCGCC GACCGCCAAA TCCACCTGGT GGTGGAGGAC
GCCAATCGCC GCAGAAGCCT GCTCGAGCGT GACGCCAACG GCACGCCCAT GCTCTTCGAC
GCGGCATGGA ACGACGACCT TCACAACGCG CTCCATGTCG TAGCGACCGG CGAAACCAGG
GGCCACTATC GCCCCTTCGC GGAGGCCCCG TGGAGCAAGA TCCGCAGCGC GCTGGCCGAA
GGTTTCGCCG TGCCCGCAAA GGAGGACAAC TTCTCCGCTG AAGGAAGCCG CGCCCGGGTG
CCGCCGCAGG GCCGTGTGAA TTTCCTGCAG AACCACGACC AAATCGGCAA CCGCGCTTTC
GGCGAGCGCC TTGCCTCGCT GGTTCGAGAG GATAGCCTGC GGGTGTTGAC GGCCATGCAC
ATGCTCGCGC CGCAAATTCC CCTTCTGTTC ATGGGTGAGG AATACGGCGA GACACAGCCC
TTCTACTTCT TCTCCGACTA TCAGGGAGAA ATCGCCGACG CGATCCGGCT GGGACGCCGG
GACGAAGCCG AGAATTTCGG CGGGCTTCCA AATGGCAAGA CCGTGGAGGA TCTGCCCGAC
CCGCTGGATC CGGAGGTTTT TTCCGGCTCG AAGCTGCGCT GGAGCCGCGC GGCAAGTCCG
GCCGGCGAGC GGCGTCTTGC CTATATGCGC GACCTTGCTT CGATCCGGCA GAGGCATATC
GTGCCGATGA TGGCCGGTAC CGCAATGCCG GAACACCGGG CATTCGAGAC AGAGGACGGT
GTCATCGCCG TCGACTGGCA GTTCGGGGAG TCCTGCCTGG AAATGCGCGT CAATCTTTCA
CAGGAGACGC ACGCCATGCC CCCGTTCCGG GGTCAGCCGA TCTTCGCCAG CGAAGCGGCC
GGCGAGAGGA CGCCCGACGT CACCGAACTG GCCGGCCTCG GCATCGTGGT CGCCATTGCA
CGATGA
 
Protein sequence
MRESSTPPCF RKTWGPNFVG EDTCRFRLWA PDERGVDLVL DGTPHEMRQS DGGWFEITVE 
TKPGDRYCFR LADGTEVADP ASSAQERGAE GASLVVDQAA YEWRAASWRG RPWEEAVISE
LHIGCFTPEG TFRAAINRLP HLASAGITAI EIMPVAQFPG ARGWGYDGVL HYAPHNAYGT
PDDLKGLVDA AHSLGLMVLL DVVYNHFGPE QNYLSLYASR FFNKDRPTPW GASIAFEEEA
VRRYFIENAL YWLGDFRFDG LRLDATEQIR DTSNPHFLVA LEHEVRKCFA DRQIHLVVED
ANRRRSLLER DANGTPMLFD AAWNDDLHNA LHVVATGETR GHYRPFAEAP WSKIRSALAE
GFAVPAKEDN FSAEGSRARV PPQGRVNFLQ NHDQIGNRAF GERLASLVRE DSLRVLTAMH
MLAPQIPLLF MGEEYGETQP FYFFSDYQGE IADAIRLGRR DEAENFGGLP NGKTVEDLPD
PLDPEVFSGS KLRWSRAASP AGERRLAYMR DLASIRQRHI VPMMAGTAMP EHRAFETEDG
VIAVDWQFGE SCLEMRVNLS QETHAMPPFR GQPIFASEAA GERTPDVTEL AGLGIVVAIA
R