Gene Amir_2191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_2191 
Symbol 
ID8326380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp2422379 
End bp2424196 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content69% 
IMG OID644942741 
Productglycoside hydrolase family 6 
Protein accessionYP_003099982 
Protein GI256376322 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCTGC GCACTTTTGC CCGGTCACGG CGTTTCGCCG GGACCGCGGC AGCGCTCTCG 
GTGACCGCTC TGGCCGCGAC CGCGGTCGTG GTGGGCGGCC CGAGCGCCAA CGCCGCACCG
GGGTGCCGGG TGAACTACAG CGTCACGAAC CAGTGGTCCG ACGGCTTCGG CGCCACGGTC
ACGGTCACCA ACCTCGGTGA TGCCATCACC GGCGGTTGGA CCCTGGAGTG GGACTTCGCC
GCAGGTCAGC GAGTCGGCCA GGGGTGGAAC GGCACGTTCG CCCAGTCGGG CGCGAAGGTG
ACCGTCACCA ACCCGACCTG GAGCCCCGGC CTCGGCAGCA ACGCGTCGGT GTCCCCCGGC
TTCAACGGCA CCTGGAGCGG CAGCAACCCC GTGCCGACCC AGTTCAAGCT GAACGGCACC
GTCTGCACCG GCTCGGTCGG CCCGACGTCC ACCACGACGA CGACCACGAC CACGACCACC
GGTGGCAACA CGACGACCAC GACCTCCGGG AACAACCAGC CCGGCACCAG GGTCGACAAC
CCGTACGTCG GCGCAGGCGT CTACGTCAAC CCGCAGTGGT CGGCCCGCGC CGCCGCCGAG
CCGGGCGGGT CGCGCATCGC GAACCAGCCC ACCGGCGTGT GGATGGACCG CATCAGCGCG
ATCGACGGCA ACGGCTCGCC CACCACGGGC AGCATGGGCC TGGTCGACCA CCTGGACGAG
GCGGTCAAGC AGGCCCGCAC CGCGCCCGGC GGCAACCTGG TCTTCCAGGT CGTCATCTAC
AACCTGCCCG GCCGCGACTG CGCCGCGCTC GCCTCGAACG GCGAGCTCGG GCCGAACGAC
CTGCCGCGCT ACAAGACCGA GTACATCGAC AAGATCGCGG GCATCCTCGC CCGGCCCGCC
TACGCGAGCC TGCGCATCGT GGCCGTGATC GAGATCGACT CGCTGCCCAA CCTGGTCACC
AACGTCTCGC CGCGGCCGAC CCAGACGCCG AACTGCGACA CGATGAAGGC CAACCAGAAC
TACCAGAACG GCGTGGCGTA CGCGGTGTCC AAGCTCGGCG ACATCGGCAA CGTCTACAAC
TACCTCGACT CCGGCCACCA CGGCTGGATC GGCTGGGGCG ACCCGATCCC CGAGTACGAC
AACTTCCACG CCTCGGCGAA GATGATGGCC TCGATCCTGG GCCGCGAGGG CGCCACCAAG
GCCGACGTGC ACGGCTTCAT CACCAACACG GCGAACTACT CCGCGCTGGA GGAGCCGTTC
TGGACGGTGG ACGACGTGGT CGGCGGCCAG GCGGTCAAGG AGAAGTCGAA GTGGGTCGAC
TGGAACGACT TCAACGGTGA GCTCGGCTTC GCCACCGCGT TCCGCCAGGA GCTCGTCGCC
AACGGCTTCG ACGCCGGCGT CGGTATGCTG ATCGACACTT CGCGCAACGG CTGGGGCGGC
TCCGGCAGGC CCACCGCCAA GTCGAGCTCG ACCGACCCGT CGGTCTACGT CGACCAGTCG
CGCATCGACA AGCGCATCCA GAAGGGCAAC TGGTGCAACC AGTCCGGCGC CGGTCTCGGT
GAGCGGCCCA AGGCCGCTCC CAAGCCGAAC ATCGACGCCT ACGTCTGGAT CAAGCCGCCG
GGCGAGTCCG ACGGCTCCAG CACCCAGATC CCGAACAACG AGGGCAAGGG CTTCGACCGG
ATGTGCGACC CGACCTACGG CGGCAACCCG CGCAACGGCA ACAACCCGTC CGGCGCGCTG
GCCAACGCCC CCATCTCGGG CCACTGGTTC TCCGCGCAGT TCCAGGAGCT CATGCGCAAC
GCCTACCCGA CGCTCTGA
 
Protein sequence
MSLRTFARSR RFAGTAAALS VTALAATAVV VGGPSANAAP GCRVNYSVTN QWSDGFGATV 
TVTNLGDAIT GGWTLEWDFA AGQRVGQGWN GTFAQSGAKV TVTNPTWSPG LGSNASVSPG
FNGTWSGSNP VPTQFKLNGT VCTGSVGPTS TTTTTTTTTT GGNTTTTTSG NNQPGTRVDN
PYVGAGVYVN PQWSARAAAE PGGSRIANQP TGVWMDRISA IDGNGSPTTG SMGLVDHLDE
AVKQARTAPG GNLVFQVVIY NLPGRDCAAL ASNGELGPND LPRYKTEYID KIAGILARPA
YASLRIVAVI EIDSLPNLVT NVSPRPTQTP NCDTMKANQN YQNGVAYAVS KLGDIGNVYN
YLDSGHHGWI GWGDPIPEYD NFHASAKMMA SILGREGATK ADVHGFITNT ANYSALEEPF
WTVDDVVGGQ AVKEKSKWVD WNDFNGELGF ATAFRQELVA NGFDAGVGML IDTSRNGWGG
SGRPTAKSSS TDPSVYVDQS RIDKRIQKGN WCNQSGAGLG ERPKAAPKPN IDAYVWIKPP
GESDGSSTQI PNNEGKGFDR MCDPTYGGNP RNGNNPSGAL ANAPISGHWF SAQFQELMRN
AYPTL