Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4653 |
Symbol | |
ID | 5318816 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 1161201 |
End bp | 1163120 |
Gene Length | 1920 bp |
Protein Length | 639 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640776451 |
Product | Beta-N-acetylhexosaminidase |
Protein accession | YP_001313383 |
Protein GI | 150376787 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.292919 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0791763 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGCAGG CCGCTTATCG CCTTGAATCT GCCTGGCAAC CGGATGGTGG CCCATTCGGG CGCTTCACCT TCAGCCTCTT TAACCTTTCG GGCGAGCCGC TTAGCGACTT TCGGCTCGTC TACACGTCGC TCACCCGCGT CATCGATCCG GACGCATGCG AGAATGCGGT CTTCCTCCGC AGGAATGCCA ATTTCCACGA GTTCGCGCCG CCCGAAGGAC TGACGCTTGC CCACGGAGAG CACTGGACAT TCATGATCAG CGGTCTGCAC CGGCAGGCGA AGCATTGCAC CGACGGGGCG AAGTCGGCCT ATCTGACGCT CGCCGATGGA AGTCATGTCG CTGTGGCGGT TTCCGACCTT CGTCTGGAGG GCGCCGTAAG CGAGCCGCCG CCGGAGCGCC TCCCCGAAGG CTGGCTCGAC CTTCCTTTCG CGCTGCAGCC CTGGCCCGCG GAGATCGATG CAGCGCCCGG AGACCTGCTT CCGGATTTCC TCTATCCTGC GGCGGGAAGC GCTGCGGACG AGATCGAGGC GGTCTCCAAC GTACTCGCGC TCTTCCACAG GCTGTTTTCG GCGGGTCACG CTCCATTCAG CCTTGCGCCT TCCTCCCAGG GGCGGCCGAT CATGTTCAAA AAGGCAGCCG AGCTCGGCGG AGAAGCCTAC AGGCTTCATT TTTCCGACGA GGAGATCCGG CTCGACTATG GAGCTGCCGC CGGGAGGCAA TACGGCCTGA CGTCCCTCGC CCAACTGATC GACGGTGCCC GCAACCACGC CGGGAGTTTC AGTTTTCCGG TATCCGGTGT AATCGCGGAC GGGCCGCGCT ACGGGTGGCG CGGCTGTCAT CTCGATGTGT CGCGGCAATT CTATCCGACC GCCGACATTC TGCGCCTCAT TGATATCCTC GCCTGGTTCA AGCTCAACAT CTTCCATTGG CACTTGACCG ATGACGAAGC CTGGCGGCTT GAAATCAAGG CCTATCCGAC GCTCACGACC CTGGGCGTCA TGCGGGGGCC GGACGAGCCG ATGCTGCCTC AGCTCGGCAA CGGCGCCGAA CCGGTCGGCG GCTTCTACAG CCAGGAGGAA GCAAAGGCGA TCGTCGCGCA TGCTGCGGCG CTCAGCATCG AAGTCGTTCC GGAGATCGAC ATTCCCGGAC ACAGCACTGC CGCGCTCGTG GCGATTGCCG AACTCTCCGA TGGACAGGAG GCGCCGGAAA GCTACCATTC GGTCCAGGGC TATCCCAACA ACGCCCTTAA CCCTGCCATT CCGCTCACCT ACGAATTCCT CGAGAAGGTG TTCGACGAAA TGGTCGAGCT CTTCCCGAGC CGATATATCC ATGTCGGCGG TGACGAGGTG GCAGACGGTT CGTGGCTCGC TTCGCCGCTG GCGCGAAAGC TTATGGAACA GGAAGGCATT TCCGGCACCT TCGCACTGCA GTCCTATTTC CTAAAGAAAG TGAAACAGAT GCTGACGGCG CGAGGCCGCA AGCTTGTCGG CTGGAACGAG GTTGCCCATG GCGGCGGGGT CGGAACAAAG GACACGCTGC TGATGGCTTG GGAGAACCCC AAGGTTGGGA TCGAGCTGGC ACGCGAGGGT TACGACGTCG TGATGACGCC CGGCCAGGCC TATTATCTCG ACATGGCCCA GGCGGATGCC TGGCAGGAAC CCGGTGCCAG CTGGGCCGGC ACGGCAACGC CGTCGCACAC CTACGCCTAT GAGGCGGAAG GGGAGTTCCC GAAAGAGCTG AAGAGCCGCA TGAAGGGCGT CCAGGCCTGC ATCTGGTCCG AGCATTTCCT CTCGCGGGGA TATTTCAACC GCCTCGTCTT TCCCAGGCTG CCGGCAATTG CCGAGGCGGC CTGGACGCCG AAGGACAAGA AGGATTGGTT GCGCTTTGCC GCGATCGCGC CATTGAGTCC CGTTCTGTGA
|
Protein sequence | MLQAAYRLES AWQPDGGPFG RFTFSLFNLS GEPLSDFRLV YTSLTRVIDP DACENAVFLR RNANFHEFAP PEGLTLAHGE HWTFMISGLH RQAKHCTDGA KSAYLTLADG SHVAVAVSDL RLEGAVSEPP PERLPEGWLD LPFALQPWPA EIDAAPGDLL PDFLYPAAGS AADEIEAVSN VLALFHRLFS AGHAPFSLAP SSQGRPIMFK KAAELGGEAY RLHFSDEEIR LDYGAAAGRQ YGLTSLAQLI DGARNHAGSF SFPVSGVIAD GPRYGWRGCH LDVSRQFYPT ADILRLIDIL AWFKLNIFHW HLTDDEAWRL EIKAYPTLTT LGVMRGPDEP MLPQLGNGAE PVGGFYSQEE AKAIVAHAAA LSIEVVPEID IPGHSTAALV AIAELSDGQE APESYHSVQG YPNNALNPAI PLTYEFLEKV FDEMVELFPS RYIHVGGDEV ADGSWLASPL ARKLMEQEGI SGTFALQSYF LKKVKQMLTA RGRKLVGWNE VAHGGGVGTK DTLLMAWENP KVGIELAREG YDVVMTPGQA YYLDMAQADA WQEPGASWAG TATPSHTYAY EAEGEFPKEL KSRMKGVQAC IWSEHFLSRG YFNRLVFPRL PAIAEAAWTP KDKKDWLRFA AIAPLSPVL
|
| |