Gene Smed_4139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4139 
Symbol 
ID5319281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp609032 
End bp610504 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content61% 
IMG OID640775944 
Productglycoside hydrolase family protein 
Protein accessionYP_001312877 
Protein GI150376281 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGCT TCAAGATCGC CATTATCGGC GCCGGCAGCA TCGGCTTCAC CAAGAAGCTC 
TTCACGGACA TTCTTTCCGT GCCGGAGCTT CGCGACGTCG AGTTTGCCCT GACGGATCTG
AGCGAGCACA ACCTCGCGAT GATCAAGTCT ATCCTCGACC GGATTGTGGA GGCCAACAAA
CTCCCCACCC GGGTGACGGC AACCACCGAC CGCCGCAGGG CACTTGAGGG CGCGCGCTAT
ATCATCAGCT GCGTGCGTGT CGGCGGCCTC GAAGCCTATG CCGACGATAT CCGGATACCG
TTGAAATATG GCGTCGATCA ATGCGTCGGC GACACGATCT GTGCTGGCGG CATTCTTTAT
GGCCAGCGCA ACATTCCGGT GATCCTCGAT TTCTGCAAGG ACATCCGCGA GGTGGCAGAG
CCCGGCGCGA AGTTCCTGAA CTATGCCAAT CCGATGGCGA TGAACAGCTG GGCGGCGATC
GAATACGGCA AGGTCGACAC GGTCGGGCTC TGCCATGGCG TCCAGCACGG AGCCGAGCAG
ATCGCGGAGA TTCTCGGCGC CGGGGAGGGT GAGCTCGACT ACATCTGCTC CGGCATCAAC
CACCAGACCT GGTTCGTGGA TATTCGCCTT GGCGGCCGCA AAATCGGCAA GGACGAACTC
GTCGCCGCCT TCGAAGCGCA TCCGATTTTC TCGCAGCAGG AGAAGCTCCG CATCGACGTG
TTGAAGCGTT TCGGCGTCTA TTCAACCGAA AGCAACGGCC ATCTTTCGGA ATACCTCCCC
TGGTACCGCA AGCGTCCCGA CGAGATTTCG AGATGGATCG ACATGTCGGA TTGGATCCAC
GGCGAGACCG GCGGATATCT CCGCTATTCG ACCGAGACCC GCAACTGGTT CGAAACGGAA
TACCCGCGCT TCCTCGAAGA GGCGAGCCGG CCGCTGGAGA CGATCAAGCG CTCGAACGAA
CATGCAAGCC GCATTCTGGA AGCACTCGAG ACGGGACGCG TCTATCGCGG CCACTTCAAT
GTCAAGAACA ACGGCGTAAT CACCAACCTC CCGGCGGATG CGATAATCGA GTCTCCGGGC
TTCGTCGACC GCTTCGGCAT CAATATGGTG GCGGGCATCA CCTTGCCGGA GGCCTGCGCG
GCCACCTGCA TTGCCTCGAT CAACGTCCAG CGCATGTCGG TTCATGCGGC AATCACGGGC
GACATCGATC TCCTGAAGCT CGCCGTTCTG CACGACCCGC TGGTCGGCGC TATCTGCACG
CCGGAGGAGG TCTGGCAAAT GGTGGATGAA ATGGTCGTCG CCCAGGCCGA ATGGCTGCCG
CAATATGCCC ATGCGATCGA CGCGGCCAAG GAAAGGCTCG CCCGCGCCAC CGTCGCTACG
CGGGAGTGGA AGGGTGCAGC GCGCCGCGAG GTGCGCTCGA TCGAGGAAAT CCGCGCGGAA
AAGGAAGCGG CGAAGCTGCG TGCCGCCGGG TAG
 
Protein sequence
MASFKIAIIG AGSIGFTKKL FTDILSVPEL RDVEFALTDL SEHNLAMIKS ILDRIVEANK 
LPTRVTATTD RRRALEGARY IISCVRVGGL EAYADDIRIP LKYGVDQCVG DTICAGGILY
GQRNIPVILD FCKDIREVAE PGAKFLNYAN PMAMNSWAAI EYGKVDTVGL CHGVQHGAEQ
IAEILGAGEG ELDYICSGIN HQTWFVDIRL GGRKIGKDEL VAAFEAHPIF SQQEKLRIDV
LKRFGVYSTE SNGHLSEYLP WYRKRPDEIS RWIDMSDWIH GETGGYLRYS TETRNWFETE
YPRFLEEASR PLETIKRSNE HASRILEALE TGRVYRGHFN VKNNGVITNL PADAIIESPG
FVDRFGINMV AGITLPEACA ATCIASINVQ RMSVHAAITG DIDLLKLAVL HDPLVGAICT
PEEVWQMVDE MVVAQAEWLP QYAHAIDAAK ERLARATVAT REWKGAARRE VRSIEEIRAE
KEAAKLRAAG