Gene Amir_4612 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_4612 
Symbol 
ID8328810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp5493373 
End bp5495688 
Gene Length2316 bp 
Protein Length771 aa 
Translation table11 
GC content77% 
IMG OID644945059 
Productglycoside hydrolase family 3 domain protein 
Protein accessionYP_003102291 
Protein GI256378631 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCACCG CTTCCGCGCT GTCGCCGCAG AACCGGGTCG GCCAGGTCAA CCAGCGCCTC 
AAGGGCTGGG AGGCGCTGCG CTGGGTCGAC GGGGCGCCTC GGGTGACCGA CGTGCTCAAG
CGGGAGGTCG ACCGGTTCGG CGGCCTCGGC GCGATCTACG GCGTGCTGCG CGCCGACCCG
TGGTCGGAGG TGAACTGGCG CAACGGGATT CCGCCCGAGC GCAGCGCGGA GGCGTGCGCG
GCGGTGCAGG AGTACGTGAC GCGCAACGGT TGTGGCGTGC CGGTGCTGTT CGTCGAGGAG
GTCCCGCACG GGTTGCAGGC GCTCGGCGGG ACGACGCTGC CGGTGAACCT CGCGCTCGGG
GCGGGCATGG ACGCCGGGCT GACCGAGGAG CTGGCGGCGG CGGTGGCCGC CGAGGTCCGG
GCCAGGGGCA CGCACGTGGC GCTGGTGTCG GGGCTGGACG TGCTGCGCGA TCCGCGCTGG
GGGCGCGCCG AGGAGTGCTT CGGCGAGGAC CCGGCGCTGG CCGCGCTGCT GGTGGCCGCG
ACCGTGCGCG GGATGCAGGG GACGGCTCCG GGTCCGATCG GCGCGGGGCG CGTCGCGGTG
GTGGCCAAGC ACCTGGCGGC CCAGGGCGCC GGGATCGGCG GGCGCAACGG TTCCGGCGCG
CCGATCGGGC CCAGGGAGCT GGGCGAGGTC CACCTGCGGC CCGCGCACGC GGCGGCGCGC
GCCGGGGTGG CCGGGTTCAT GGCCGCCTAC AACGACGTCG ACGGCGTGCC GTGCACCGGG
AACCGGGAGC TGCTGACGGG CGTGCTGCGG GAGGACTGGG GCTGGGACGG GATCGTCATG
GCCGACGGCA CCGCGATCGA CCGGTTGCGC GACAGCACCC CCGACCCGGC GGCCGCGGCG
GCGCTGGCCC TGCGCGCCGG GGTCGACCTG AGCCTGTGGG ACGAGGCGTT CACGCACCTC
GGGGAGGCGC TGGACCGGGG GCTGGTGGCG GAGGCCGAGC TGGACCGCGC GGTCGACCGG
GTGCTCGCGC TCAAGCGGCG GGTCGGGCTG CTGGACGAGC CGGCGGCGTC CGGGCCTGCG
GCGTCGGGGC CGGCGGCGTC GGTGCCGGTG GCGTCCGGGC CGGTGGCGTC GGGGCCGGTG
GTGGACCTGC CCGCGTCGCG TAACGTGGCC CGGCTGGTCG ACCGGGCCGC CCGGCAGGCC
GTGGTCCTGG TGCGGGACGA CGGGGTGCTG CCGCTCGACC CGTCCGGCGT GGTCGCGCTG
ATCGGCCCGA ACGCCGACGA CCTGGACGCG CAGCTCGGCG ACTACACCCC GCCTCGGCCC
GCCGACGACC CCGGCGCGTC GACCGTGCGC TCGGCGCTCG TGGCGCGGCT GGGCGAGGAG
CGGGTGCCGC ACGCGCCCGG CTCGCGGGTG CGCAGCGCGC TCGGCCCGGA CGCGCTCGCG
GCGGCGCGGG ACGCGGTCGA CCGCGCCGAC GTCTGCGTGC TGGTGCTCGG CGGCACGAGC
AAGCGGAGCT ACGACGACGA GTTCGCCGAC AACGGGGCGG TCGCGGAGTC GGCGGCGGAC
ACGACCAACG GGGAGGGCGT CGACCTGGCC TCGATCGCGC TTCCCCTGCC GCAACTGGAA
CTCGCGCGTG CGGCCCGGTC GAGCGGCAAG CCGGTCGTGG CCGTGGTCGT CGACGGCCGA
CCGCGCGCGC TCACCGAACT GGCCGGGCTG GTGGACGCGC TGCTCGTGGT GCCCTTCCCC
GGCCCGAGCG GGGGTGCGGC GGTCGTGGGC GCGCTGCTCG ACGGCACCGC GTCGGGTCGG
CTGCCCGCGT CGTTCCCGGT GGCCGACGGG GTCTTCCCGG TGGCGCACGA CGAGCGGGTG
GAGACCGCGC GCGGGTACGC CGACCAGCGG CGCCCGGTGG GCATCCCCTT CGGCAGCGGT
TCGCCGCCCT CGGTCACCAC GCGGGTCCGC GAAGGCGAGC ACCGGATTTC CGCCGCCGCG
CTGGAGAGCG GCGGTTCGCT GCGGGTCGCG GTGGAGGTGG TGAGCACCGG CGGGCCGCGA
TCCGTGGCCG TGCCGCTCTA CGGCCGCCGC CACGAGCTGG GCGTGCGCCC GCGCCGCCGG
ACCCTGCTGG CCGTGCGGCG CGTGCTGTGC GAGCCGGGCG AATCCGTGGT GGAGTTCGCG
CTGGGCCTGG ACGAGCTGGG CTCGTGGGCC ACCGGGCGGC CGGTCGCGCT CCCGGTGGAG
ATCGGCGCGT GGAGCGGCGA CGAGGTGGAC GAACCGGCTG ACGCGGTGCG GATCAGCGTC
ACCGACGAGG GAGGGAGCAC GCTGTGGCGA CGGTGA
 
Protein sequence
MITASALSPQ NRVGQVNQRL KGWEALRWVD GAPRVTDVLK REVDRFGGLG AIYGVLRADP 
WSEVNWRNGI PPERSAEACA AVQEYVTRNG CGVPVLFVEE VPHGLQALGG TTLPVNLALG
AGMDAGLTEE LAAAVAAEVR ARGTHVALVS GLDVLRDPRW GRAEECFGED PALAALLVAA
TVRGMQGTAP GPIGAGRVAV VAKHLAAQGA GIGGRNGSGA PIGPRELGEV HLRPAHAAAR
AGVAGFMAAY NDVDGVPCTG NRELLTGVLR EDWGWDGIVM ADGTAIDRLR DSTPDPAAAA
ALALRAGVDL SLWDEAFTHL GEALDRGLVA EAELDRAVDR VLALKRRVGL LDEPAASGPA
ASGPAASVPV ASGPVASGPV VDLPASRNVA RLVDRAARQA VVLVRDDGVL PLDPSGVVAL
IGPNADDLDA QLGDYTPPRP ADDPGASTVR SALVARLGEE RVPHAPGSRV RSALGPDALA
AARDAVDRAD VCVLVLGGTS KRSYDDEFAD NGAVAESAAD TTNGEGVDLA SIALPLPQLE
LARAARSSGK PVVAVVVDGR PRALTELAGL VDALLVVPFP GPSGGAAVVG ALLDGTASGR
LPASFPVADG VFPVAHDERV ETARGYADQR RPVGIPFGSG SPPSVTTRVR EGEHRISAAA
LESGGSLRVA VEVVSTGGPR SVAVPLYGRR HELGVRPRRR TLLAVRRVLC EPGESVVEFA
LGLDELGSWA TGRPVALPVE IGAWSGDEVD EPADAVRISV TDEGGSTLWR R