Gene Amir_4634 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_4634 
Symbol 
ID8328832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp5516079 
End bp5520353 
Gene Length4275 bp 
Protein Length1424 aa 
Translation table11 
GC content72% 
IMG OID644945080 
Productglycoside hydrolase family 2 sugar binding 
Protein accessionYP_003102312 
Protein GI256378652 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.893507 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTCGTG TCGTCGTAGC CACGGTGGTA GCGCTCTCAC TCCTGACGCC CGCGACCGCG 
CGGGCCCGAC CGGCCGACCC CGTGGACGCG CTCGCGTACC TGGAGGACCC CCGGATGACC
GGGGAGAACC AGGAGCCGCC GCACCCGGAC CTGAAGCCCG AGCAGCGGCT CAGCCTGAAC
GGCGAGTGGC GGATCAGGAT GTTCGACAAG CCGGAGGACG TGCGCGAGAC GGCGGACGGC
TGGCGCACGG TCTCGGTGCC GCACACCTGG CAGACCGACT TCCTGGACCA CCCGGTGTTC
CGCAACATCC CGACGGAGAT GTACCCGGAC GCCCCGCCGT CCGTGCCGCG CGACGTGAAC
CCGACCGGCG TGTACGAGAA GGAGTTCGAC CTCCCGCCGG AGTGGGACCG CACGCTGCTC
CGGTTCGAGG GGGTGACCAG CGGCTACTTC GTGTGGGTCA ACGGGGAGTA CGTCGGGTAC
GACCAGGGCG GGTACACGCC CGCCGAGTTC GACGTGACCA GCAGGCTCAA GCCGGGGCGC
AACGCGGTCC GGGTGCAGGT GCACCGCTGG GGTTCCGGGT CGCACCTGGA GGACTTCGAC
CAGTGGCGGT TCTCCGGGAT CTTCCGCGAG GTGTGGCTGT ACTCCACGCC GAGGACCTAC
CTGGCGGACG TGACGATCAG GACCGACCTG GACGCGGACT ACCGGGACGC GACGCTCAGC
GCGGACGTGG TGCTCGGCGG TCCGGCCGAG GGGCACTCGG TGCGGACCAG GCTGTTCGAC
CCGAGCGGGA CCGAGGTGCC GGTCGTGGAC GGGCGGGTGT CGAACCCGTT GAAGTGGACC
GACGAGACGC CGAACGTCCA CGAGCTGCGC GTTGAGCTGC TGCGGGACGG GCAGGTGGTC
CAGACCGGCA GGCAGCCCGT GGGGTTCCGG GAGATCGAGA TCGTGGACCG GCAGCTCAAG
GTCAACGGGA AGCGGGTGCT GTTCCGGGGG GTCAACCGGG CCGAGACGAG CGTGCGCGGC
AGCCGTCACG TCACCCGCGA GGAGCAGGAG GAGGACGTGC GGTTGATGAA GCGGTTCAAC
GTCAACGCGG TGCGGACCTC CCACTACCCC AGCGATCCGC ACTTCTACGA GCTGGCCGAC
CGCGCCGGGC TGATGATCGC CGACGAGGTC GACACCGAGA CGCACCACCA CGACAACTGC
CCCACCGACT GCCTGGCCGA GCGCCCGGAG TGGCAGGACG CGTTCCTCGA CCGGTTCACC
GGCATGATGC AGCGGGACAA GAACCACCCG AGCGTCGTCA TGTGGGACAC CGGGAACGAG
GCCGGGCTCG GCAGGGCGCA CCACGCGATG GCCGAGCTGG CGCGGGCCCG CGACACCCGG
CCGCTCTACC ACCAGCCGAA CGTGCCGGAC GGCGACGCGC CGTTCGCGGA CGTGGCGGGG
CCGCGCTACC CGTCGCCGTC GAGCCTGGAG GCGAAGGCGC GCACGACGAC CAAGCCGATC
ATCATGGGCG AGTACGCGCA CGCGATGGGC AACAGCCTGG GCAACTTCAA GGAGTTCTGG
GACGTCGTGC GCGCCTACCC GCAGGTGCAG GGCGGCTTCA TCTGGGACTG GGCCGAGCAG
AACATCGCCC TGCCCCTCCA CACCACCCCG GACGACGCGA ACGGCATCCT GGCGTGGCTG
TCCGGCAAAC CGTCCCGAGT GGACGGACCG CACGGGAAGG CGTTGCACCT GAGCGGGTTG
GACGACTTCG TCGAGGTCTA CCGGGACCGG AGGTTCGACG AGGTGCGGGA CGGGCTCACC
CTGGACGCCT GGGTCAAGCC GGACGCGTGG ACCGGCGACT TCACGGTCAT CGCCAAGGGC
GATCACCAGT ACGCGCTGAA GATGTCCGAC GCCGCCACGC TGGAGTTCTT CGTGCACAGC
GGGACCTGGC GCACCGTCCG GGCGAGGGTG CCCGCCGACT GGACCGGGAA CTGGCACCGC
GTCACCGGCA CGTTCGACGG CGCGGCGCTG CGGTTGCTGA TCGACGGTGA GCAGGTCGCG
GAGACCGCCT TCACCGGGAC GGTCGACCCG TCGCACTGGC CGGTGAACAT CGGGCGCAAC
CCCGAGACCA TGCAGGAGAA CGTGCGCACC CGGATGGCGC ACGGCGCGAT CGACCAGGTG
CGGATCTACC ACCGGGCGCT GACCGGGTCC GAGCTGGCGG CCGACCCCAA GGGCTCGGCG
GTGCTGGCGC TGGACTTCGA GACCGTGGAG GACGAGGGGC GGCAGCAATC CTACGGCGCG
GGCACCGGCG GCGTGGACGG GCTGGTGTGG GCGGACCGCA GGCCGCAGCC GGAGACCACC
GAGCTGATGG CGGTGCACTC CCCCATCCGG TTCTCGTTCG CGGACAACCG GTTGACGGTC
GTGAGCGAGC GGCAGTTCAC CGGCACCGAC GACCTGGAGC TGCGCTGGGA GGTCAGGGAC
AACGGGCGCG TGGTGGACGA CCACCGGGGC CCGCTCGTGC CGGGCGTGGT CGAACTGCCG
GACCGGTCGG CGGTCACCGA GCGCCTGCTG ACCGTGCGCG CGACCGACCG GGCGGGCGAT
GACGTCGGCA TCGCCCAGTT CCCGCTCGGC GGCGAGCGGA TCGCCGGGCT GCACACGGGC
CTCCGGTCCG GCAGCACCAC CACGACGCAG GACGAGAACG AGGTCGTGGT GTCCGGCCAG
GGCTTCCGCT ACGCGATCAG CAAGAGGACC GGGACGCTCA CGTCGATGCG CGTGCGCGGG
GACGAGCTGC TCACCGCCGG TCCGAAGCTG GACGCGTGGC GCGCGCCGCT GTCCAACGAG
TTCATGAGCG AGGACGGCTC CTGGTACCGC AACGGCCTCG ACCGGTTGAC CACCACGCCG
TCGAGCGTGG AGGTGGGCCG GGACTCGGTG GACGCGGTCG TCACCGTGAA GTCGACCGCG
CAGGCGGTGG CGGGGTCGTC GTTCGGGCAG ACCTTCACCT ACCGGATCAC CGGCGACGGC
GAGATCCACG TCGGGCACCG GGTCGCCGCG CAGGGCGCGA TGCGGGACCT GAGCTACCTG
CCGGGCATCG GTTTCACGCT GAAGGTTCCG GAGCGGTACC GGCAGTTCAC CTGGTACGGG
CGCGGGCCGG GCGAGAACTA CGACGACCGC AAGTCCGGCG ACCCGATCGG CCTGTACAAG
TCCACTGTGG ACGGGCAGTT CCACGACTAC TACAAGCCGC AGGACTTCGG GAACCACGCG
GACACCCGGT GGGCGACGCT GTCCGACGGG CGCTCCGGCC TGCTGGTGGC GGGCGACCTG
GACGTGCGGG TGTCGCGGCA CGACGACCTG GACCGGGCCG CCTACCCGTT CGCGCTCAAG
CAGAACGACG GCTGGACCAC GCTGCACGCC GCGCACCGGG TCACCGGTGT GGGCGAGACG
TTCCACGAGC CGCTCCAGCC GTACCAGGTC GAGGCCGGGA CCGAGTACGC GTACTCGGTG
CTGCTGCGCC CGCTCACCCC GGCCGAGGCG GCCACGGGCG AGCTGGGCGG CCAGGTGGAC
TGCGTGCCCG CCGTCGAGCT GAACGCCCCC GACACCGCGC TGGAGCCGGG CGGGCGGGTG
ACGGCAGAGC TGGTGGTCAC CGAGCCGTGC GCCTCGCCCG CGCGGGCCAG GGTGGGCGTG
CCGGACGGCT GGACGGCGAC GCCCGCGTCG GTGGACCTGT CCGGCGGGTC GGCGCGGGTG
GTGATCACCC GCGAGGGCGG GCCGACCGGG ACCAGGCCGG TGTTCGTGGA CGTGGTGGCG
GGCAAGGGGA CCACGACGCT GAGCCGGGAC TTCACCGCCG TGCCGAGCGC GCCGCGCGGT
GAGGCTCGGG TGTCCGCGCT GGAGTTCCTG GACGAGCGCA ACGGGTGGGG GCCGATCGAG
CGCGACCGCA GCAACGGTGA GGACGTCGGC GGTGACGGCA ACCCGATCCG CTTGCGGGGC
ACCGGGTTCG ACGCGGGCGT CGGGGTGCAC GCGGACTCGG AGTTCCGGGT GCACACCGGC
GGGCGGTGCT CCCGGTTGAC GGCCGTGGTC GGGGTGGACG ACGAGACCGG CGGCACGGGC
AGCGTCCGGT TCGAGGTGCT CGCGGACGGG CGGCAGGTGC ACCTGAGCCC GGTGCTGACC
GGGCGGAGCG CGGCCGAGGC GATCAGCGTC GACACCTCCG GGGCGCGGGT GCTCTCGTTC
CGGGTGACCG ACGGCGGTGA CGGCAACGCG CACGACCACG CCGACTGGGC GAACCCGGTG
CTGAGCTGCG GGTGA
 
Protein sequence
MRRVVVATVV ALSLLTPATA RARPADPVDA LAYLEDPRMT GENQEPPHPD LKPEQRLSLN 
GEWRIRMFDK PEDVRETADG WRTVSVPHTW QTDFLDHPVF RNIPTEMYPD APPSVPRDVN
PTGVYEKEFD LPPEWDRTLL RFEGVTSGYF VWVNGEYVGY DQGGYTPAEF DVTSRLKPGR
NAVRVQVHRW GSGSHLEDFD QWRFSGIFRE VWLYSTPRTY LADVTIRTDL DADYRDATLS
ADVVLGGPAE GHSVRTRLFD PSGTEVPVVD GRVSNPLKWT DETPNVHELR VELLRDGQVV
QTGRQPVGFR EIEIVDRQLK VNGKRVLFRG VNRAETSVRG SRHVTREEQE EDVRLMKRFN
VNAVRTSHYP SDPHFYELAD RAGLMIADEV DTETHHHDNC PTDCLAERPE WQDAFLDRFT
GMMQRDKNHP SVVMWDTGNE AGLGRAHHAM AELARARDTR PLYHQPNVPD GDAPFADVAG
PRYPSPSSLE AKARTTTKPI IMGEYAHAMG NSLGNFKEFW DVVRAYPQVQ GGFIWDWAEQ
NIALPLHTTP DDANGILAWL SGKPSRVDGP HGKALHLSGL DDFVEVYRDR RFDEVRDGLT
LDAWVKPDAW TGDFTVIAKG DHQYALKMSD AATLEFFVHS GTWRTVRARV PADWTGNWHR
VTGTFDGAAL RLLIDGEQVA ETAFTGTVDP SHWPVNIGRN PETMQENVRT RMAHGAIDQV
RIYHRALTGS ELAADPKGSA VLALDFETVE DEGRQQSYGA GTGGVDGLVW ADRRPQPETT
ELMAVHSPIR FSFADNRLTV VSERQFTGTD DLELRWEVRD NGRVVDDHRG PLVPGVVELP
DRSAVTERLL TVRATDRAGD DVGIAQFPLG GERIAGLHTG LRSGSTTTTQ DENEVVVSGQ
GFRYAISKRT GTLTSMRVRG DELLTAGPKL DAWRAPLSNE FMSEDGSWYR NGLDRLTTTP
SSVEVGRDSV DAVVTVKSTA QAVAGSSFGQ TFTYRITGDG EIHVGHRVAA QGAMRDLSYL
PGIGFTLKVP ERYRQFTWYG RGPGENYDDR KSGDPIGLYK STVDGQFHDY YKPQDFGNHA
DTRWATLSDG RSGLLVAGDL DVRVSRHDDL DRAAYPFALK QNDGWTTLHA AHRVTGVGET
FHEPLQPYQV EAGTEYAYSV LLRPLTPAEA ATGELGGQVD CVPAVELNAP DTALEPGGRV
TAELVVTEPC ASPARARVGV PDGWTATPAS VDLSGGSARV VITREGGPTG TRPVFVDVVA
GKGTTTLSRD FTAVPSAPRG EARVSALEFL DERNGWGPIE RDRSNGEDVG GDGNPIRLRG
TGFDAGVGVH ADSEFRVHTG GRCSRLTAVV GVDDETGGTG SVRFEVLADG RQVHLSPVLT
GRSAAEAISV DTSGARVLSF RVTDGGDGNA HDHADWANPV LSCG