Gene Smed_4653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4653 
Symbol 
ID5318816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1161201 
End bp1163120 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content62% 
IMG OID640776451 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_001313383 
Protein GI150376787 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.292919 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0791763 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCAGG CCGCTTATCG CCTTGAATCT GCCTGGCAAC CGGATGGTGG CCCATTCGGG 
CGCTTCACCT TCAGCCTCTT TAACCTTTCG GGCGAGCCGC TTAGCGACTT TCGGCTCGTC
TACACGTCGC TCACCCGCGT CATCGATCCG GACGCATGCG AGAATGCGGT CTTCCTCCGC
AGGAATGCCA ATTTCCACGA GTTCGCGCCG CCCGAAGGAC TGACGCTTGC CCACGGAGAG
CACTGGACAT TCATGATCAG CGGTCTGCAC CGGCAGGCGA AGCATTGCAC CGACGGGGCG
AAGTCGGCCT ATCTGACGCT CGCCGATGGA AGTCATGTCG CTGTGGCGGT TTCCGACCTT
CGTCTGGAGG GCGCCGTAAG CGAGCCGCCG CCGGAGCGCC TCCCCGAAGG CTGGCTCGAC
CTTCCTTTCG CGCTGCAGCC CTGGCCCGCG GAGATCGATG CAGCGCCCGG AGACCTGCTT
CCGGATTTCC TCTATCCTGC GGCGGGAAGC GCTGCGGACG AGATCGAGGC GGTCTCCAAC
GTACTCGCGC TCTTCCACAG GCTGTTTTCG GCGGGTCACG CTCCATTCAG CCTTGCGCCT
TCCTCCCAGG GGCGGCCGAT CATGTTCAAA AAGGCAGCCG AGCTCGGCGG AGAAGCCTAC
AGGCTTCATT TTTCCGACGA GGAGATCCGG CTCGACTATG GAGCTGCCGC CGGGAGGCAA
TACGGCCTGA CGTCCCTCGC CCAACTGATC GACGGTGCCC GCAACCACGC CGGGAGTTTC
AGTTTTCCGG TATCCGGTGT AATCGCGGAC GGGCCGCGCT ACGGGTGGCG CGGCTGTCAT
CTCGATGTGT CGCGGCAATT CTATCCGACC GCCGACATTC TGCGCCTCAT TGATATCCTC
GCCTGGTTCA AGCTCAACAT CTTCCATTGG CACTTGACCG ATGACGAAGC CTGGCGGCTT
GAAATCAAGG CCTATCCGAC GCTCACGACC CTGGGCGTCA TGCGGGGGCC GGACGAGCCG
ATGCTGCCTC AGCTCGGCAA CGGCGCCGAA CCGGTCGGCG GCTTCTACAG CCAGGAGGAA
GCAAAGGCGA TCGTCGCGCA TGCTGCGGCG CTCAGCATCG AAGTCGTTCC GGAGATCGAC
ATTCCCGGAC ACAGCACTGC CGCGCTCGTG GCGATTGCCG AACTCTCCGA TGGACAGGAG
GCGCCGGAAA GCTACCATTC GGTCCAGGGC TATCCCAACA ACGCCCTTAA CCCTGCCATT
CCGCTCACCT ACGAATTCCT CGAGAAGGTG TTCGACGAAA TGGTCGAGCT CTTCCCGAGC
CGATATATCC ATGTCGGCGG TGACGAGGTG GCAGACGGTT CGTGGCTCGC TTCGCCGCTG
GCGCGAAAGC TTATGGAACA GGAAGGCATT TCCGGCACCT TCGCACTGCA GTCCTATTTC
CTAAAGAAAG TGAAACAGAT GCTGACGGCG CGAGGCCGCA AGCTTGTCGG CTGGAACGAG
GTTGCCCATG GCGGCGGGGT CGGAACAAAG GACACGCTGC TGATGGCTTG GGAGAACCCC
AAGGTTGGGA TCGAGCTGGC ACGCGAGGGT TACGACGTCG TGATGACGCC CGGCCAGGCC
TATTATCTCG ACATGGCCCA GGCGGATGCC TGGCAGGAAC CCGGTGCCAG CTGGGCCGGC
ACGGCAACGC CGTCGCACAC CTACGCCTAT GAGGCGGAAG GGGAGTTCCC GAAAGAGCTG
AAGAGCCGCA TGAAGGGCGT CCAGGCCTGC ATCTGGTCCG AGCATTTCCT CTCGCGGGGA
TATTTCAACC GCCTCGTCTT TCCCAGGCTG CCGGCAATTG CCGAGGCGGC CTGGACGCCG
AAGGACAAGA AGGATTGGTT GCGCTTTGCC GCGATCGCGC CATTGAGTCC CGTTCTGTGA
 
Protein sequence
MLQAAYRLES AWQPDGGPFG RFTFSLFNLS GEPLSDFRLV YTSLTRVIDP DACENAVFLR 
RNANFHEFAP PEGLTLAHGE HWTFMISGLH RQAKHCTDGA KSAYLTLADG SHVAVAVSDL
RLEGAVSEPP PERLPEGWLD LPFALQPWPA EIDAAPGDLL PDFLYPAAGS AADEIEAVSN
VLALFHRLFS AGHAPFSLAP SSQGRPIMFK KAAELGGEAY RLHFSDEEIR LDYGAAAGRQ
YGLTSLAQLI DGARNHAGSF SFPVSGVIAD GPRYGWRGCH LDVSRQFYPT ADILRLIDIL
AWFKLNIFHW HLTDDEAWRL EIKAYPTLTT LGVMRGPDEP MLPQLGNGAE PVGGFYSQEE
AKAIVAHAAA LSIEVVPEID IPGHSTAALV AIAELSDGQE APESYHSVQG YPNNALNPAI
PLTYEFLEKV FDEMVELFPS RYIHVGGDEV ADGSWLASPL ARKLMEQEGI SGTFALQSYF
LKKVKQMLTA RGRKLVGWNE VAHGGGVGTK DTLLMAWENP KVGIELAREG YDVVMTPGQA
YYLDMAQADA WQEPGASWAG TATPSHTYAY EAEGEFPKEL KSRMKGVQAC IWSEHFLSRG
YFNRLVFPRL PAIAEAAWTP KDKKDWLRFA AIAPLSPVL