Gene Smed_4144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4144 
Symbol 
ID5319140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp616757 
End bp618130 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content62% 
IMG OID640775949 
Productglycoside hydrolase family protein 
Protein accessionYP_001312882 
Protein GI150376286 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.663959 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAGAC AACCCAGGAT CACTTTCATC GGCGCCGGTT CCACCGTGTT CATGAAGAAC 
ATTATCGGCG ATATCTTGCA GCGCCCGGCG CTTTCGGCCG CGACCATCGC CTTGATGGAC
GTCAACCCGG AGCGCCTGGC GGAAAGCGAG ATCGTCGCGG GCAAGCTGGC GCGCACGCTG
GGCGCCGGCG CCAGGATCGA GACGCACTCC GACCAGCGCA AGGCGCTCAC GGGAGCGGAC
TTCGTCGTGG TTGCCTTCCA GATCGGCGGC TACGAGCCAT GCACCGTGAC AGATTTCGAG
GTGCCGAAAA AATACGGACT GCGCCAGACG ATCGCCGACA CGCTCGGCGT CGGCGGCATC
ATGCGGGGGT TGCGCACCGT CCCGCATCTC TGGAAGATCT GCGAGGACAT GCTCGAGGTC
TGCCCCGAGG CGATCCTCCT GCAATATGTA AACCCGATGG CGATCAACAC CTGGGCGATC
GCCGAGAGGT ATCCGGCCAT CAAGCAGGTG GGCCTCTGCC ACTCCGTGCA GGGCACGGCC
TATGAACTCG CCCGCGATCT CGAGATACCG CTCGAGGAGA TCCGCTATCG CGCCGCCGGC
ATCAACCACA TGGCCTTCTA TCTGAAATTC GAGCACCGTC AGAAGGACGG CAGCTATCGT
GATCTCTATC CGGACCTTAT CCGCGGCTAC CGCGAGGGGC GCTTTCCGAA GCCGAGCCAT
TGGAACCCGC GCTGCCCAAA CAAGGTACGT TACGAAATGC TGACGCGGCT CGGCTATTTC
GTCACCGAAA GCTCGGAGCA TTTCGCCGAG TACACGCCCT ATTTCATCAA GGAGGGGCGT
CCCGATCTGA TCGAAAAATT CGGGATTCCG CTCGACGAGT ATCCGAAGCG TTGCATCGAG
CAGATCGAGC GCTGGAAGGG CCAGGCGGCC GCCTTCAAGG AGGCGGAGAC GATGGAAGTC
GCAGAGAGCC GCGAATATGC CTCCTCGATC ATGAACTCGG TCTGGACCGG CGAGCCCTCG
GTGATTTACG GCAACCTCAG AAACAATGGC TGCATCACCT CGCTGCCGGA AAACTGCGCG
GCGGAGATGC CGTGTCTCGT CGATCAGTCG GGTATTCAGC CGACCCATAT CGGCGCGCTG
CCGCCGCAAC TCACGGCGTT GATCCGCACC AACATCAACG TACAGGAGTT GACGGTTCAG
GCGCTCGTCA CTGAAAACCG GGAGCATCTC TACCATGCGG CAATGATGGA TCCGCATACC
GCCGCCGAGC TCGACCTCGA CCAGATCTGG TCCCTTGTCG ACGATCTGCT CACAGCGCAC
CGCGACTGGA TCCCGGAATG GGCCCGCGTC GCGCAGAAGG TAGCGGCCGC CTGA
 
Protein sequence
MTRQPRITFI GAGSTVFMKN IIGDILQRPA LSAATIALMD VNPERLAESE IVAGKLARTL 
GAGARIETHS DQRKALTGAD FVVVAFQIGG YEPCTVTDFE VPKKYGLRQT IADTLGVGGI
MRGLRTVPHL WKICEDMLEV CPEAILLQYV NPMAINTWAI AERYPAIKQV GLCHSVQGTA
YELARDLEIP LEEIRYRAAG INHMAFYLKF EHRQKDGSYR DLYPDLIRGY REGRFPKPSH
WNPRCPNKVR YEMLTRLGYF VTESSEHFAE YTPYFIKEGR PDLIEKFGIP LDEYPKRCIE
QIERWKGQAA AFKEAETMEV AESREYASSI MNSVWTGEPS VIYGNLRNNG CITSLPENCA
AEMPCLVDQS GIQPTHIGAL PPQLTALIRT NINVQELTVQ ALVTENREHL YHAAMMDPHT
AAELDLDQIW SLVDDLLTAH RDWIPEWARV AQKVAAA