Gene Amir_1044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_1044 
Symbol 
ID8325216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp1157476 
End bp1159557 
Gene Length2082 bp 
Protein Length693 aa 
Translation table11 
GC content74% 
IMG OID644941588 
ProductAlpha-galactosidase 
Protein accessionYP_003098846 
Protein GI256375186 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGACGAGT CGACGAACGC GGCCGGGGAC GCCCCGGTCG ACCAGGACGG CACGACGGGC 
GACGGCGCGG TGAGCGGCGG GACGGCAGGC GGCAGCTCAG CGGACGACAG CGCGGCGGGC
AGCGGCGCGG CGGGCAGCGG CGCGGCGGAA ACCGGTGCGC CGCAGGAGAT CAGGCCGCCC
GCGCTGGCGC CGACTCCCCC GATGGGCTTC AACAACTGGA ACTCCACCCA GTGCGGGCCC
GAGTTCACCG ACTCGATGAT CCGGGGCATC GCCGACCTCT TCCTCAGCCT CGGCCTCAAG
GACGCGGGCT ACGAGTACGT CAACATCGAC GACTGCTGGG CGCTGCCGCA GCGCGACGCG
GACGGCGACC TGGTGCCCGA CCCGGTCCGG TTCCCCGAGG GCATGAAGCC GCTCGTGGAC
TACGTGCACT CCAAGGGCCT GAAGTTCGGC ATCTACACCA GCGCGGGCAC CAGGACGTGC
AGCGAGCGCG GGTTCCCCGG CGCGCTCGGG CACGAGCGGC AGGACGCCGC GCTGTTCGCC
TCCTGGGGCG TGGACTACCT GAAGTACGAC AACTGCCACA ACCAGGGCGT CGACGCGAGG
CTGCGCTACC GGGCCATGCG CGACGCGATC GCCGCCACCG GCCGCCCGAT CGTGCTGAGC
GTGTGCGAGT GGGGCGAGAA CCGGCCGTGG GAGTGGGCGT TCGAGGTCGG GCAGCTGTGG
CGCACCACCC CGGACATCCG GGACAGCTGG GACTCGGTGC TGGAGATCGC CAAGGCGAAC
ATGGCGCTGG CCGAGCACGC CGGGCCGAAC CGGTGGAACG ACCCGGACAT GCTGGAGGTC
GGCAACGGCG GCCTGACCTG GGAGGAGTGC CGCACGCACT TCAGCCTGTG GGCGATGATG
GCCGCTCCCC TGCTGATCGG CGTGGACCTG CGGTCGGTGG CGCCCGAGGC GGTGGAGATC
CTGACCAACC GCGAGGTGAT CGCGCTCGAC CAGGACCCGC TCGGCGAGCA GGCGCGGGTG
GTGCGCTCCG AGGACGGGCT GCACGTGCTG GTGAAGCGGC TGCAGGACGG CGGGCGCGCG
GTGGCGCTGT TCAACGAGAA CGACGTGCCC GCGCGGATCT CGACCAGCGC CGCCGAGGCC
GGGCTCCCCC GGTCGACCGG GTACCGGCTG CGGGACGTGT GGGCGCGGAC CGACGCGCAC
TCGGCGGGCG ACGTCACGGC GTGGGTGCCG CCGCACGGCG CGGTCGTCTA CCGGGTGACG
CCCGAACCGG CGTGGCTGCT GCTGCCGCCC GCCGTGGACG CGGGGGTGGA GCCGGTGCTG
TCCCGGCCCG GCGCGCTGCC GCTGGTCGAC CCGGACGCCC CGTCGCTGGT GACCACCTCG
CTGGGCGACA ACGGCTGGCT GCCGGTGCTC GGCGCGCGGG TGGACCTGGA GGCCCCCGCC
GGGTGGCGGG TGCGGCCGAG GGGGCAGCGG GCCCGGTCGG TGCTCGCGGG CGGGGACCGG
CTGGACACCA CCTGGGAGGT GCTGCCGCCC GCCGGGCTCG AACCGGGCCG GTACCGGCTG
ACGGCGCTGT TCGCCTACCT GTACGGGTGG GGGCGGCGGG TGAGCGCGGA CCTGGAGGTG
GTGGTGCCGC ACCGCCTGCC GTCCGGCACC TCGTACCTGA GCGACGCGCC GTGGCTGCGG
GCGAGCAACG GGTTCGGGCC GGTCGAGGTC GACACCAGCA ACGGCGAGGC CGAGGCCGGG
GACGGCGGGC CGCTCACGGT CAACGGGAGG GTGTTCGAGA AGGGCCTCGG GGTGCACGCG
CCCAGCTCGG TCGAGTACTT CACCGGTGGC CGCTGCACGT CCGTGTCGGC GTTCGTGGGC
GTGGACGACG AGAAGCCCGC GGCCGGGTCG GTGGTGTTCC AGGTGTGGGC GGATGAGCGG
AAGGTCGCCG ACAGCGGGGC GCTGACCACG CGGGACGACG CGGTCGAGCT GGTCGCGGAC
GTGACCGGGG CGCGGACCGT GCGGCTGGTG GTGACCGACG CGGGCAACGG CGTCGACAGC
GACCACGGTG ACTGGTGCGA CCTGAAGGCC ACCTGCGAGT GA
 
Protein sequence
MDESTNAAGD APVDQDGTTG DGAVSGGTAG GSSADDSAAG SGAAGSGAAE TGAPQEIRPP 
ALAPTPPMGF NNWNSTQCGP EFTDSMIRGI ADLFLSLGLK DAGYEYVNID DCWALPQRDA
DGDLVPDPVR FPEGMKPLVD YVHSKGLKFG IYTSAGTRTC SERGFPGALG HERQDAALFA
SWGVDYLKYD NCHNQGVDAR LRYRAMRDAI AATGRPIVLS VCEWGENRPW EWAFEVGQLW
RTTPDIRDSW DSVLEIAKAN MALAEHAGPN RWNDPDMLEV GNGGLTWEEC RTHFSLWAMM
AAPLLIGVDL RSVAPEAVEI LTNREVIALD QDPLGEQARV VRSEDGLHVL VKRLQDGGRA
VALFNENDVP ARISTSAAEA GLPRSTGYRL RDVWARTDAH SAGDVTAWVP PHGAVVYRVT
PEPAWLLLPP AVDAGVEPVL SRPGALPLVD PDAPSLVTTS LGDNGWLPVL GARVDLEAPA
GWRVRPRGQR ARSVLAGGDR LDTTWEVLPP AGLEPGRYRL TALFAYLYGW GRRVSADLEV
VVPHRLPSGT SYLSDAPWLR ASNGFGPVEV DTSNGEAEAG DGGPLTVNGR VFEKGLGVHA
PSSVEYFTGG RCTSVSAFVG VDDEKPAAGS VVFQVWADER KVADSGALTT RDDAVELVAD
VTGARTVRLV VTDAGNGVDS DHGDWCDLKA TCE