Gene Amir_1091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_1091 
Symbol 
ID8325265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp1212320 
End bp1213867 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content75% 
IMG OID644941637 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_003098893 
Protein GI256375233 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGGGCGG CCAGCACCGC GCCCCCGCCG AGCCGAGCGC GGGGGCGCCG AACCGCCGCG 
GAGAGCCCCT TGCACACGAT CGTGCCAGCG CCCGTCCTGG TTGAGCCGCG CCCCGGCGCC
GCGTTCACCC TCGCCCCCGA CGCCGAGGTC CGCGTCCCGC CCGGCTCACC CGGCGCGCGC
GACGTCGGCG AGCTGCTGGC GGAGCTGCTG CGCCCGGCGA CCGGCTACCC GCTCCCGGTG
GTGGAGGGCG CGACCGGACC GGGCGTCGTG CTGCTGCTGG AGGGCGCCGC CGCCGAGGTC
GGCGACGAGG GCTACGAGCT CGACACCGCC GGGGACACCG CCGTGCTCAG GGCGAACACC
CCGGCGGGCC TGTGGTCCGG GGTGCAGACG CTGCGGCAGC TGCTGCCCGC CGCCGTCGAG
AGCCCGGAGC GGCAGGACGG CCCGTTCACC GCGCCCGCCG TGCACGTGCT CGACCACCCG
CGCTTCCCGC ACCGGGGCGT GATGCTGGAC GTGGCCAGGC ACTTCTTCGG CGTCGACGAC
GTCAAGCGCT ACCTGGACCT GGCCGTCGCG CACAAGGTCA ACACCCTGCA CCTGCACCTG
AGCGACGACC AGGGCTGGCG CCTGGAGGTC GAGAGCTGGC CGAACCTCAC CGCGCACGGG
TCGACCAGCT CGGTCGGCGG CGGCCCCGGC GGCTTCTACA CCCAGGACGA GTACCGCGAG
ATCGTCGCGT ACGCCGCCCG CAGGCACGTC GTGGTCGTGC CGGAGATCGA CCTGCCCGGC
CACACCGCCG CCGCGCTGTC CTCCTACCCG GAGCTGAACC CCGACGGCGT CGCGCCCAAG
CTCTACACCG GCATCGAGGT CGGCTTCTCC ACCCTCGACA TCGCCTCCGA GACCACCTAC
CGGTTCGTGG CCGACGTGCT GCGCGAGGTC GCCGCGCTGA CCCCCGGCCC GTACCTGCAC
ATCGGCGGCG ACGAGGCGTT CGCGACCGAG GCCGGGGACT ACCGGGCCTT CATGGCGCGG
GTGCTGCCCA TGGTCGAGGA GCACGGCAAG CGCGCCATGG GCTGGTCCGA GTTCACCCGC
GCCGACCTGC CCGCGACGGC GGTCGCGCAG TACTGGGACA CCGGGCGGCC CGCGGGTCCC
GAGCTGGCCG AGGCGGCGGC GCGCGGGGTG CGGTTCGTGC TGTCCCCGGC GAACCGGGTC
TACCTGGACA TGAAGTACGC CGAGCAGACC GAGCTGGGCC TGAAGTGGGC CGGGACCGTG
GAGGTCGACG CCACCTACGG CTGGGACCCG GCGACCCTGC TGGACGGGGT GCCGGAGTCG
GCGGTGCTGG GCGTGGAGGC CCCGCTGTGG ACCGAGACGC TCACGACGAT GAGCGAGCTG
GAGCTCATGG CCTTCCCGCG CCTGGCCGCG GTGGCCGAGG TGGGCTGGAC CGCCGCGACC
GGCCGGGACT GGGCGGACTT CCGGTCCCGG CTGGCCGCGC AGGGCCCGAG GTGGGAGGCG
AAGGGCGTGG CCTTCCACCC CTCACCCACC GTCCCGTGGC AGGGCTGA
 
Protein sequence
MGAASTAPPP SRARGRRTAA ESPLHTIVPA PVLVEPRPGA AFTLAPDAEV RVPPGSPGAR 
DVGELLAELL RPATGYPLPV VEGATGPGVV LLLEGAAAEV GDEGYELDTA GDTAVLRANT
PAGLWSGVQT LRQLLPAAVE SPERQDGPFT APAVHVLDHP RFPHRGVMLD VARHFFGVDD
VKRYLDLAVA HKVNTLHLHL SDDQGWRLEV ESWPNLTAHG STSSVGGGPG GFYTQDEYRE
IVAYAARRHV VVVPEIDLPG HTAAALSSYP ELNPDGVAPK LYTGIEVGFS TLDIASETTY
RFVADVLREV AALTPGPYLH IGGDEAFATE AGDYRAFMAR VLPMVEEHGK RAMGWSEFTR
ADLPATAVAQ YWDTGRPAGP ELAEAAARGV RFVLSPANRV YLDMKYAEQT ELGLKWAGTV
EVDATYGWDP ATLLDGVPES AVLGVEAPLW TETLTTMSEL ELMAFPRLAA VAEVGWTAAT
GRDWADFRSR LAAQGPRWEA KGVAFHPSPT VPWQG