Gene Amir_0449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_0449 
Symbol 
ID8324608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp485657 
End bp487000 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content76% 
IMG OID644940993 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_003098262 
Protein GI256374602 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACGCTGC TGCCCGAGCC GGTGTCGGTG GTCTGGGGCG GGGGGACCGT GCCGTGGTCG 
ACGCCGGTCG TGCGGCGCGC GCCAGGACCG GCCGAGGGCT ACCGGATCTC CATCTCCGCG
GGCGGCGTCC ACGTCGACGC GTCCGACGAC GCGGGCGAGT TCTACGCCCA CCAGACCCTG
CGCCAGCTGC GCGGACCCGA CGCGTTCCGG GCCGCGCCGA TCCGGCCGGA CGGGCCCGTG
CCGGTGTGCG AGATCGTCGA CCACCCGGAG CACCGGTGGC GCGGCTGCAT GATCGACGTG
GCGCGGCACT TCCTGCCCAA GCACGACCTG CTGCGGTACG TGGACCTGCT GGCCGCGCAC
AAGCTCAACG TGCTGCACCT GCACCTGACC GACGACCAGG GCTGGCGGGT CGAGTCCGAG
CGCTTCCCGA GGCTGCACGA GGTCGGCGGC TGGCGGCCGG ACTCGCGGTG GGGCGACCGG
CGCGGCGGGC TGAGCACCGG GAGGCCGCAC GGCGGGTGCT ACACGCGGGA CGACCTGCGC
GAGGTCGTGG CGTACGCGGC GGCGCGGCAC GTGACCGTGG TGCCGGAGAT CGACGTGCCG
GGGCACTCGC AGGCGGCCAT CGCGGCGTAC CCGGAGCTGG GCGTCGACGG CGGCGGGGTG
TGGACCGACT GGGGGGTGAA CCCGCGCGTC CTCAACGGGT CGCAGTCCAC AGTGGACTTC
TACCGCGCGG TGTTCGACGA GCTGCTGGAG GTGTTCCCCG GCGAGGTGGT CGGCTTCGGC
GGCGACGAGG CGCCGGGTGG CGACGGGCGG TTCGTGCGCC TGATCGCGGA GCACCTCGTG
GCGCGCGGGC GCAGGCCCTA CGGCTGGGAC GAGGTGCTGG ACGTCGAGGG GCTGCCGGAG
GAGACGGTCA TCGCGGCGTG GCGCTCGGAG GAGGCCGTGG AGCGGGCGCT GGAGCGCGGG
CTGGACGTGG TCGCCTGCCC GGAGCGGCAC GCGTACCTGG ACTACCGGCA GTCCGAGGAC
GCGGACGAGC CGATCCCGGT GGGCACGGTC CTCACCACCG AGGACGTGCG CGCGTACCGG
CCGGTGGCCG GGGTGCTCGG CGCGCAGGCG AACATCTGGA CCGAGCACCT GGACAGCCCG
AGGCGCCTGG ACTACGCCGC GTTCCCGAGG CTCTCGGCGT TCGCGGAGGT GGTGTGGAAC
CCGGCCCCGG TGGACGGGGC CGGGTTCGCG GCGAGGCTGG GCGCGCACCT GCCCAGGCTC
GCCGCGCTCG GCGTCGAGTA CCGGCCGCCG GGCGGACCGC TGCCGTGGCA GCGCCTGCCA
GGGGTGCCCG GTCACCCGCG CTGA
 
Protein sequence
MTLLPEPVSV VWGGGTVPWS TPVVRRAPGP AEGYRISISA GGVHVDASDD AGEFYAHQTL 
RQLRGPDAFR AAPIRPDGPV PVCEIVDHPE HRWRGCMIDV ARHFLPKHDL LRYVDLLAAH
KLNVLHLHLT DDQGWRVESE RFPRLHEVGG WRPDSRWGDR RGGLSTGRPH GGCYTRDDLR
EVVAYAAARH VTVVPEIDVP GHSQAAIAAY PELGVDGGGV WTDWGVNPRV LNGSQSTVDF
YRAVFDELLE VFPGEVVGFG GDEAPGGDGR FVRLIAEHLV ARGRRPYGWD EVLDVEGLPE
ETVIAAWRSE EAVERALERG LDVVACPERH AYLDYRQSED ADEPIPVGTV LTTEDVRAYR
PVAGVLGAQA NIWTEHLDSP RRLDYAAFPR LSAFAEVVWN PAPVDGAGFA ARLGAHLPRL
AALGVEYRPP GGPLPWQRLP GVPGHPR