Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amir_4634 |
Symbol | |
ID | 8328832 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Actinosynnema mirum DSM 43827 |
Kingdom | Bacteria |
Replicon accession | NC_013093 |
Strand | + |
Start bp | 5516079 |
End bp | 5520353 |
Gene Length | 4275 bp |
Protein Length | 1424 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 644945080 |
Product | glycoside hydrolase family 2 sugar binding |
Protein accession | YP_003102312 |
Protein GI | 256378652 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.893507 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTCGTG TCGTCGTAGC CACGGTGGTA GCGCTCTCAC TCCTGACGCC CGCGACCGCG CGGGCCCGAC CGGCCGACCC CGTGGACGCG CTCGCGTACC TGGAGGACCC CCGGATGACC GGGGAGAACC AGGAGCCGCC GCACCCGGAC CTGAAGCCCG AGCAGCGGCT CAGCCTGAAC GGCGAGTGGC GGATCAGGAT GTTCGACAAG CCGGAGGACG TGCGCGAGAC GGCGGACGGC TGGCGCACGG TCTCGGTGCC GCACACCTGG CAGACCGACT TCCTGGACCA CCCGGTGTTC CGCAACATCC CGACGGAGAT GTACCCGGAC GCCCCGCCGT CCGTGCCGCG CGACGTGAAC CCGACCGGCG TGTACGAGAA GGAGTTCGAC CTCCCGCCGG AGTGGGACCG CACGCTGCTC CGGTTCGAGG GGGTGACCAG CGGCTACTTC GTGTGGGTCA ACGGGGAGTA CGTCGGGTAC GACCAGGGCG GGTACACGCC CGCCGAGTTC GACGTGACCA GCAGGCTCAA GCCGGGGCGC AACGCGGTCC GGGTGCAGGT GCACCGCTGG GGTTCCGGGT CGCACCTGGA GGACTTCGAC CAGTGGCGGT TCTCCGGGAT CTTCCGCGAG GTGTGGCTGT ACTCCACGCC GAGGACCTAC CTGGCGGACG TGACGATCAG GACCGACCTG GACGCGGACT ACCGGGACGC GACGCTCAGC GCGGACGTGG TGCTCGGCGG TCCGGCCGAG GGGCACTCGG TGCGGACCAG GCTGTTCGAC CCGAGCGGGA CCGAGGTGCC GGTCGTGGAC GGGCGGGTGT CGAACCCGTT GAAGTGGACC GACGAGACGC CGAACGTCCA CGAGCTGCGC GTTGAGCTGC TGCGGGACGG GCAGGTGGTC CAGACCGGCA GGCAGCCCGT GGGGTTCCGG GAGATCGAGA TCGTGGACCG GCAGCTCAAG GTCAACGGGA AGCGGGTGCT GTTCCGGGGG GTCAACCGGG CCGAGACGAG CGTGCGCGGC AGCCGTCACG TCACCCGCGA GGAGCAGGAG GAGGACGTGC GGTTGATGAA GCGGTTCAAC GTCAACGCGG TGCGGACCTC CCACTACCCC AGCGATCCGC ACTTCTACGA GCTGGCCGAC CGCGCCGGGC TGATGATCGC CGACGAGGTC GACACCGAGA CGCACCACCA CGACAACTGC CCCACCGACT GCCTGGCCGA GCGCCCGGAG TGGCAGGACG CGTTCCTCGA CCGGTTCACC GGCATGATGC AGCGGGACAA GAACCACCCG AGCGTCGTCA TGTGGGACAC CGGGAACGAG GCCGGGCTCG GCAGGGCGCA CCACGCGATG GCCGAGCTGG CGCGGGCCCG CGACACCCGG CCGCTCTACC ACCAGCCGAA CGTGCCGGAC GGCGACGCGC CGTTCGCGGA CGTGGCGGGG CCGCGCTACC CGTCGCCGTC GAGCCTGGAG GCGAAGGCGC GCACGACGAC CAAGCCGATC ATCATGGGCG AGTACGCGCA CGCGATGGGC AACAGCCTGG GCAACTTCAA GGAGTTCTGG GACGTCGTGC GCGCCTACCC GCAGGTGCAG GGCGGCTTCA TCTGGGACTG GGCCGAGCAG AACATCGCCC TGCCCCTCCA CACCACCCCG GACGACGCGA ACGGCATCCT GGCGTGGCTG TCCGGCAAAC CGTCCCGAGT GGACGGACCG CACGGGAAGG CGTTGCACCT GAGCGGGTTG GACGACTTCG TCGAGGTCTA CCGGGACCGG AGGTTCGACG AGGTGCGGGA CGGGCTCACC CTGGACGCCT GGGTCAAGCC GGACGCGTGG ACCGGCGACT TCACGGTCAT CGCCAAGGGC GATCACCAGT ACGCGCTGAA GATGTCCGAC GCCGCCACGC TGGAGTTCTT CGTGCACAGC GGGACCTGGC GCACCGTCCG GGCGAGGGTG CCCGCCGACT GGACCGGGAA CTGGCACCGC GTCACCGGCA CGTTCGACGG CGCGGCGCTG CGGTTGCTGA TCGACGGTGA GCAGGTCGCG GAGACCGCCT TCACCGGGAC GGTCGACCCG TCGCACTGGC CGGTGAACAT CGGGCGCAAC CCCGAGACCA TGCAGGAGAA CGTGCGCACC CGGATGGCGC ACGGCGCGAT CGACCAGGTG CGGATCTACC ACCGGGCGCT GACCGGGTCC GAGCTGGCGG CCGACCCCAA GGGCTCGGCG GTGCTGGCGC TGGACTTCGA GACCGTGGAG GACGAGGGGC GGCAGCAATC CTACGGCGCG GGCACCGGCG GCGTGGACGG GCTGGTGTGG GCGGACCGCA GGCCGCAGCC GGAGACCACC GAGCTGATGG CGGTGCACTC CCCCATCCGG TTCTCGTTCG CGGACAACCG GTTGACGGTC GTGAGCGAGC GGCAGTTCAC CGGCACCGAC GACCTGGAGC TGCGCTGGGA GGTCAGGGAC AACGGGCGCG TGGTGGACGA CCACCGGGGC CCGCTCGTGC CGGGCGTGGT CGAACTGCCG GACCGGTCGG CGGTCACCGA GCGCCTGCTG ACCGTGCGCG CGACCGACCG GGCGGGCGAT GACGTCGGCA TCGCCCAGTT CCCGCTCGGC GGCGAGCGGA TCGCCGGGCT GCACACGGGC CTCCGGTCCG GCAGCACCAC CACGACGCAG GACGAGAACG AGGTCGTGGT GTCCGGCCAG GGCTTCCGCT ACGCGATCAG CAAGAGGACC GGGACGCTCA CGTCGATGCG CGTGCGCGGG GACGAGCTGC TCACCGCCGG TCCGAAGCTG GACGCGTGGC GCGCGCCGCT GTCCAACGAG TTCATGAGCG AGGACGGCTC CTGGTACCGC AACGGCCTCG ACCGGTTGAC CACCACGCCG TCGAGCGTGG AGGTGGGCCG GGACTCGGTG GACGCGGTCG TCACCGTGAA GTCGACCGCG CAGGCGGTGG CGGGGTCGTC GTTCGGGCAG ACCTTCACCT ACCGGATCAC CGGCGACGGC GAGATCCACG TCGGGCACCG GGTCGCCGCG CAGGGCGCGA TGCGGGACCT GAGCTACCTG CCGGGCATCG GTTTCACGCT GAAGGTTCCG GAGCGGTACC GGCAGTTCAC CTGGTACGGG CGCGGGCCGG GCGAGAACTA CGACGACCGC AAGTCCGGCG ACCCGATCGG CCTGTACAAG TCCACTGTGG ACGGGCAGTT CCACGACTAC TACAAGCCGC AGGACTTCGG GAACCACGCG GACACCCGGT GGGCGACGCT GTCCGACGGG CGCTCCGGCC TGCTGGTGGC GGGCGACCTG GACGTGCGGG TGTCGCGGCA CGACGACCTG GACCGGGCCG CCTACCCGTT CGCGCTCAAG CAGAACGACG GCTGGACCAC GCTGCACGCC GCGCACCGGG TCACCGGTGT GGGCGAGACG TTCCACGAGC CGCTCCAGCC GTACCAGGTC GAGGCCGGGA CCGAGTACGC GTACTCGGTG CTGCTGCGCC CGCTCACCCC GGCCGAGGCG GCCACGGGCG AGCTGGGCGG CCAGGTGGAC TGCGTGCCCG CCGTCGAGCT GAACGCCCCC GACACCGCGC TGGAGCCGGG CGGGCGGGTG ACGGCAGAGC TGGTGGTCAC CGAGCCGTGC GCCTCGCCCG CGCGGGCCAG GGTGGGCGTG CCGGACGGCT GGACGGCGAC GCCCGCGTCG GTGGACCTGT CCGGCGGGTC GGCGCGGGTG GTGATCACCC GCGAGGGCGG GCCGACCGGG ACCAGGCCGG TGTTCGTGGA CGTGGTGGCG GGCAAGGGGA CCACGACGCT GAGCCGGGAC TTCACCGCCG TGCCGAGCGC GCCGCGCGGT GAGGCTCGGG TGTCCGCGCT GGAGTTCCTG GACGAGCGCA ACGGGTGGGG GCCGATCGAG CGCGACCGCA GCAACGGTGA GGACGTCGGC GGTGACGGCA ACCCGATCCG CTTGCGGGGC ACCGGGTTCG ACGCGGGCGT CGGGGTGCAC GCGGACTCGG AGTTCCGGGT GCACACCGGC GGGCGGTGCT CCCGGTTGAC GGCCGTGGTC GGGGTGGACG ACGAGACCGG CGGCACGGGC AGCGTCCGGT TCGAGGTGCT CGCGGACGGG CGGCAGGTGC ACCTGAGCCC GGTGCTGACC GGGCGGAGCG CGGCCGAGGC GATCAGCGTC GACACCTCCG GGGCGCGGGT GCTCTCGTTC CGGGTGACCG ACGGCGGTGA CGGCAACGCG CACGACCACG CCGACTGGGC GAACCCGGTG CTGAGCTGCG GGTGA
|
Protein sequence | MRRVVVATVV ALSLLTPATA RARPADPVDA LAYLEDPRMT GENQEPPHPD LKPEQRLSLN GEWRIRMFDK PEDVRETADG WRTVSVPHTW QTDFLDHPVF RNIPTEMYPD APPSVPRDVN PTGVYEKEFD LPPEWDRTLL RFEGVTSGYF VWVNGEYVGY DQGGYTPAEF DVTSRLKPGR NAVRVQVHRW GSGSHLEDFD QWRFSGIFRE VWLYSTPRTY LADVTIRTDL DADYRDATLS ADVVLGGPAE GHSVRTRLFD PSGTEVPVVD GRVSNPLKWT DETPNVHELR VELLRDGQVV QTGRQPVGFR EIEIVDRQLK VNGKRVLFRG VNRAETSVRG SRHVTREEQE EDVRLMKRFN VNAVRTSHYP SDPHFYELAD RAGLMIADEV DTETHHHDNC PTDCLAERPE WQDAFLDRFT GMMQRDKNHP SVVMWDTGNE AGLGRAHHAM AELARARDTR PLYHQPNVPD GDAPFADVAG PRYPSPSSLE AKARTTTKPI IMGEYAHAMG NSLGNFKEFW DVVRAYPQVQ GGFIWDWAEQ NIALPLHTTP DDANGILAWL SGKPSRVDGP HGKALHLSGL DDFVEVYRDR RFDEVRDGLT LDAWVKPDAW TGDFTVIAKG DHQYALKMSD AATLEFFVHS GTWRTVRARV PADWTGNWHR VTGTFDGAAL RLLIDGEQVA ETAFTGTVDP SHWPVNIGRN PETMQENVRT RMAHGAIDQV RIYHRALTGS ELAADPKGSA VLALDFETVE DEGRQQSYGA GTGGVDGLVW ADRRPQPETT ELMAVHSPIR FSFADNRLTV VSERQFTGTD DLELRWEVRD NGRVVDDHRG PLVPGVVELP DRSAVTERLL TVRATDRAGD DVGIAQFPLG GERIAGLHTG LRSGSTTTTQ DENEVVVSGQ GFRYAISKRT GTLTSMRVRG DELLTAGPKL DAWRAPLSNE FMSEDGSWYR NGLDRLTTTP SSVEVGRDSV DAVVTVKSTA QAVAGSSFGQ TFTYRITGDG EIHVGHRVAA QGAMRDLSYL PGIGFTLKVP ERYRQFTWYG RGPGENYDDR KSGDPIGLYK STVDGQFHDY YKPQDFGNHA DTRWATLSDG RSGLLVAGDL DVRVSRHDDL DRAAYPFALK QNDGWTTLHA AHRVTGVGET FHEPLQPYQV EAGTEYAYSV LLRPLTPAEA ATGELGGQVD CVPAVELNAP DTALEPGGRV TAELVVTEPC ASPARARVGV PDGWTATPAS VDLSGGSARV VITREGGPTG TRPVFVDVVA GKGTTTLSRD FTAVPSAPRG EARVSALEFL DERNGWGPIE RDRSNGEDVG GDGNPIRLRG TGFDAGVGVH ADSEFRVHTG GRCSRLTAVV GVDDETGGTG SVRFEVLADG RQVHLSPVLT GRSAAEAISV DTSGARVLSF RVTDGGDGNA HDHADWANPV LSCG
|
| |