Gene Amir_3216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_3216 
Symbol 
ID8327406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp3748226 
End bp3750175 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content72% 
IMG OID644943730 
Productglycoside hydrolase family 5 
Protein accessionYP_003100970 
Protein GI256377310 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTGTCAC CCACCCACGG GCGCGGACGG CGGTGGCTGG CCGCGCTCGC GGCCGTCCCG 
CTGGTCGCCG CCACCCTCGC GGCCTCCGCG CTCGCCCCGC CCGCCGCCTC CGCCGCACCG
CCCGCCAAGG ACTGGCTGCA CGTCCAGGGC AACCGGATCG TCGACGCGGC GGGCAACCGC
GTCCAGCTCA CCGGAGCCAA CTGGTTCGGC TTCAACGCCA CCGAGCGCGT CTTCCACGGC
CTGTGGTCGG CCAACATCAC CGAGATCACC AAGGCGATGG CCGATCGTGG CGTCAACCTG
GTGCGCGTGC CCGTGTCCAC CCAGCTGCTG CAGGAGTGGA AGGAGGGCAG GACCGTCGCC
AAGCCGAACA TCAACGACTA CGCGAACCCC GAGCTGGCGG GGATGAACAA CCTCCAGATC
TTCGACTTCT GGCTGCGGCT GTGCGAGCGG TTCGGGCTCA AGGTGCTGCT GGACGTGCAC
AGCGCCGAGG CCGACAACTC CGGCCACGTG CACCCCGTCT GGTACAAGGG CTCGGTCACG
CCGGAGGTGT TCTACTCCAC CTGGGAGTGG GTCACCGCGC GGTACGAGGA CAACGACACG
ATCATCGCCG TCGACGTCAA GAACGAGCCC CACGGCGGTC CCGGCGACTC GCCGCGCGCC
AAGTGGGACG GCTCGACCGA CGTCGACAAC TGGAAGCACA CCTGCGAGAC CGCGGGCCGC
CGCATCCTCG CGATCAACCC CGAACTGCTG GTGCTGTGCG AGGGCAACGA GGTCTACCCG
CGCCCCGGCA AGGGCTGGGA CGCGCCGGAC ACCAACCCCG ACCGCACGCC CAACTACTTC
CACACCTGGT GGGGCGGCAA CCTGCGCGGC GTCGCCGAGC ACCCGGTGAA CCTGGGCGCG
AACCAGGACC AGCTCGTCTA CTCCCCGCAC GACTACGGCC CCCTGGTGTT CAACCAGCCG
TGGTTCGACA AGCCGTTCAC CAAGGAGTCG CTGATCACCG ACGTGTGGCG GCCCAACTGG
CTGTACCTGC ACGAGCAGGG CGCCGCGCCG CTGCTGATCG GCGAGTGGGG CGGGCGGCTC
GGCGAGGACG AGCGGCAGGA CCGGTGGATG GCCGCGCTGC GCGACCTGAT CGTGGAGGAG
GGGCTGCACC AGACGTTCTG GGCGCTCAAC CCGAACTCCG GCGACACCGG CGGGCTGCTG
CTCGACGACT GGAAGACCTG GGACGCGGCC AAGTACGCGC TGCTCAAGCC CGCGCTGTGG
CAGCACGCAG GCAAGTTCGT GAGCCTGGAC CACCAGGTGC CGCTCGGCGG CGAGGGCTCC
ACCACCGGCA TCAGCCTCGC CCAGCGCTAC GGCGACGGCG GCGGCCCCGG CGACACCGCG
GCCCCGACCG CGCCGACCGG GCTGGCGATC GGCACGACCA CCGCCTCGTC GGTCGCGCTG
AGCTGGCAGG CCGCGACCGA CGACGTCGGC GTCACCGGGT ACGACGTGTA CCGGGGCGGC
GCGAAGGTCG GCACGAGCGC CACGACCTCC TACGTGGACA CCGGGCTCAG CGGCGGCACC
ACCTACAGCT ACTCGGTGCG GGCCAGGGAC GCGGCGGGCA ACACCTCGGC GGCCTCGGCC
TCCCGCAGCG CGACGACCCC GCCGGGCGGT GGTGGCGGTG ACGCCGGGTG CGCGGCGGTG
CTGCGCGTGG TCAACAGCTG GCAGGGCGGC TACCAGGGGG AGGTCACCGT GACCAACTCC
GGGACGGCGG CCACGCGGGG CTGGAAGGTC GCGCTGACGA CCGCGTCCGG CACCACGATC
TCCAGCGTGT GGAACGGGAC CTACGCCTCG GGGACGGTCG TGAACGCCGC GCACAACGGC
GCGCTGGCCC CGGCGGGCAG CACCACGTTC GGGCTGACCG GGACGGGGCA GGCGACCGGG
GTGACGATCA CCGGCTGCAC GGCTTCCTGA
 
Protein sequence
MLSPTHGRGR RWLAALAAVP LVAATLAASA LAPPAASAAP PAKDWLHVQG NRIVDAAGNR 
VQLTGANWFG FNATERVFHG LWSANITEIT KAMADRGVNL VRVPVSTQLL QEWKEGRTVA
KPNINDYANP ELAGMNNLQI FDFWLRLCER FGLKVLLDVH SAEADNSGHV HPVWYKGSVT
PEVFYSTWEW VTARYEDNDT IIAVDVKNEP HGGPGDSPRA KWDGSTDVDN WKHTCETAGR
RILAINPELL VLCEGNEVYP RPGKGWDAPD TNPDRTPNYF HTWWGGNLRG VAEHPVNLGA
NQDQLVYSPH DYGPLVFNQP WFDKPFTKES LITDVWRPNW LYLHEQGAAP LLIGEWGGRL
GEDERQDRWM AALRDLIVEE GLHQTFWALN PNSGDTGGLL LDDWKTWDAA KYALLKPALW
QHAGKFVSLD HQVPLGGEGS TTGISLAQRY GDGGGPGDTA APTAPTGLAI GTTTASSVAL
SWQAATDDVG VTGYDVYRGG AKVGTSATTS YVDTGLSGGT TYSYSVRARD AAGNTSAASA
SRSATTPPGG GGGDAGCAAV LRVVNSWQGG YQGEVTVTNS GTAATRGWKV ALTTASGTTI
SSVWNGTYAS GTVVNAAHNG ALAPAGSTTF GLTGTGQATG VTITGCTAS