Gene Amir_1788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_1788 
Symbol 
ID8325973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp1964330 
End bp1965748 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content71% 
IMG OID644942337 
Productbeta-galactosidase 
Protein accessionYP_003099582 
Protein GI256375922 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.586379 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAGTCCA TTTCCTTTCC CGAGGGCTTC GTGTGGGGGG CCGCGACAGC GGCGTTCCAG 
GTCGAGGGGG CGTCCAAGGA GGACGGTCGC TCCCCTTCCA TCTGGGACAC CTTCTGCGCG
CTGCCCGGCG CCGTCGCCGG TGGTGACAAC GGTGACGTGG CCGTGGACCA CTACCACCGG
GTCGAGCAGG ACGTCGCGAT GATGGCGGAC CTCGGCCTCG GCGCGTACCG CTTCTCCACC
GCCTGGCCGA GGATCCGGCC CGACGGCGGC GAGCCCAACC AGGCCGGGCT GGACTTCTAC
AGCAGGCTGG TCGACACCCT GCTGGAGCGC GGCATCGACC CGTGGGTCAC GCTCTACCAC
TGGGACCTGC CGCAGGCCCT GGAGGACGCG GGCGGCTGGG CCAACCGGGA CACCGCGCAC
CGGTTCGCCG ACTACGCGGC CACGGTCGTG GAGGCGCTCG GCGACCGGGT GTCCAACTGG
ACCACGCTGA ACGAGCCGTG GTGCTCGGCG TTCCTCGGCT ACGCGGGCGG CATCCACGCG
CCCGGCCGCC AGGAGCCCGC CGCCGCCGTC GCGGCCGTCC ACCACCTGCT GCTCGGCCAC
GGCCTCGCCA CCGCGGCGAT CCGCTCGGCC AAGCCGGAGG CCAAGGTCGG CATCACGCTC
AACATGTACC CGATCATCCC CGCCGACCCC TCGTCCGAGG CGGACCTGGA CGCGGTGCGG
CGGCTCGACG GGCTGCAGAA CCGGATCTTC CTGGACCCGC TGTTCAAGGG CGAGTACCCG
GCGGACATCG TCGCTGACCT CGCGCCGTAC GGGTTCGCCG ACCACATCAA GCCCGAGGAC
CTGGCGATCA TCTCGGCGCC GCTGGACCAG CTCGGCGTGA ACTACTACAC CGAGCACTTC
GTCAGCTCCG AGCCCGCCGC GCCCAGCGAG CCCAAGCCGG GCCGCCGCGC CACCGGGTCG
CCGTGGGTCG GGGCCGAGCA CGTCAGCTTC CCGGTCCGGG ACGACGCGAC GCGCACCGAC
ATGGAGTGGG AGGTGCGACC GCGCGGGATC TACCAGCTCC TCACCCGGCT GCACGAGGAG
TACCCGCGCC TGCCGATCTA CATCACCGAG AACGGCGCGG CGTACCGCGA CGCGGTGTCC
GACGACGGCT CGGTCAACGA CCCGGAGCGC CTGGCCTACA TCGACTCGCA CCTGCGCGCG
GCGCACGACG CGATCACCGA GGGCGTCGAC CTCCGCGGCT ACTTCGCGTG GTCGCTGATG
GACAACTTCG AGTGGGCCGA GGGTTACGCC AAGCGGTTCG GGATCGTGCA TGTCGACTAC
GGCACGCAGG TCAGGACGCC TAAGATGAGC GCCATGTGGT ACTCCGAGGT CGCCCGCGGC
AACGCGCTGC CCGCGCCGTC AGCGACGGCT GCGCCGTGA
 
Protein sequence
MESISFPEGF VWGAATAAFQ VEGASKEDGR SPSIWDTFCA LPGAVAGGDN GDVAVDHYHR 
VEQDVAMMAD LGLGAYRFST AWPRIRPDGG EPNQAGLDFY SRLVDTLLER GIDPWVTLYH
WDLPQALEDA GGWANRDTAH RFADYAATVV EALGDRVSNW TTLNEPWCSA FLGYAGGIHA
PGRQEPAAAV AAVHHLLLGH GLATAAIRSA KPEAKVGITL NMYPIIPADP SSEADLDAVR
RLDGLQNRIF LDPLFKGEYP ADIVADLAPY GFADHIKPED LAIISAPLDQ LGVNYYTEHF
VSSEPAAPSE PKPGRRATGS PWVGAEHVSF PVRDDATRTD MEWEVRPRGI YQLLTRLHEE
YPRLPIYITE NGAAYRDAVS DDGSVNDPER LAYIDSHLRA AHDAITEGVD LRGYFAWSLM
DNFEWAEGYA KRFGIVHVDY GTQVRTPKMS AMWYSEVARG NALPAPSATA AP