Gene Amir_4152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_4152 
Symbol 
ID8328345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp4885000 
End bp4886349 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content69% 
IMG OID644944616 
Productcellulose-binding family II 
Protein accessionYP_003101853 
Protein GI256378193 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAGGG TATTCGCCTT GCTCGGCGCC ACGCTGCTGG CCATCGCCGG CCTCGTGGTC 
GTCGCCCAAC CGCCGCAGGC GCAAGCCGCC GTCGGCCTGC ACGTCAGCGG CACGAAGATC
GTGGAGGCCA ACGGGCAGCC GTTCGTCATG CGCGGGGTCA ACCACCCGCA CGTCTGGTAC
ACCGGCCGCA CCAGCTCGTT CGCCGACGTC AAGGCGCTCG GCTCGAACAC GGTGCGCGTG
GTGCTGGGCA GCGGCAAGCG GTGGGGGCCG TCCAGCGACA CCGCCGCGGT CATCGCGCTG
TGCAAGCAGA ACAAGCTGAT CTGCGTGCTG GAGGTGCACG ACACCACCGG GTACGGAGAG
GAGGCCGCGG CCGCCTCGCT CGACGAGGCG GTGGACTACT GGATCTCCCA GAAGTCCGCG
CTGGTCGGCC AGGAGGACTA CGTCGTCATC AACATCGGCA ACGAGCCCAT CGGCAACACC
AACGCCGCCC AGTGGACCGA CGCGACCGTC AACGCGGTCA AGAAGATGCG GACCAACGGG
TTCCAGCACC TGCTGATGGT CGACGGCCCG AACTGGGGCC AGGACTGGCA GTACGTCATG
CGCGACAACG CGCAGCGCGT CCTGGACGCC GACACCCAGC GCAACACCGT GCTGTCGATC
CACATGTACG CGGTGTTCAG CACGCCCGCC AGCATCATCG ACTACCTGGA CCGCTTCCAG
GCCAACGGGT GGCCGCTGGT GATCGGCGAG TTCGGCTGGA AGTTCGCCTC CGGCGAGGTC
GACCACGAGA CCATCCTCGC CCAGGCGCAG GCCAGGGGGA TGGGCTACCT GGGCTGGTCG
TGGAGCGGCA ACACCGATCC GATCCTGGAC ATGGCCACCG ACTTCGACCC GGCCAGGCTG
ACCACGTGGG GGCAGCGCAT CTTCAACGGC GCCAACGGGA TCAAGGCCAC CTCGCGCGAG
GCGTCGATCT ACGGCGGCGG CAACCAGCCC ACCAGCACGA CGACGACGAC CACCACCACG
ACGACCACCA CGACCACGGT CCCGCCCACC GGCTCGTGCA CCGCGAGCTA CACCACCACC
GGCCAGTGGC AGGGCGGGTT CCAGGCGGAG GTGCGGGTCA CGGCGGGGTC CAAGGCGATC
AAGTCGTGGA CGGTGACCTG GACGTTCGCG GACGGCCAGA CCGTGAGCAA CGCGTGGAAC
GCCGACGTCA CCTCGTCCGG CGCGACCGTG ACCGCGCGCA ACGCGTCGTA CAACGGGTCG
CTGGGGGCGG GCGCGAGCAC CGCGTTCGGC TTCCTCGGCA CGACGCGCGG CGCGAACACC
GCGCCGACGC TGAGCTGCGC GGCGTCCTAG
 
Protein sequence
MTRVFALLGA TLLAIAGLVV VAQPPQAQAA VGLHVSGTKI VEANGQPFVM RGVNHPHVWY 
TGRTSSFADV KALGSNTVRV VLGSGKRWGP SSDTAAVIAL CKQNKLICVL EVHDTTGYGE
EAAAASLDEA VDYWISQKSA LVGQEDYVVI NIGNEPIGNT NAAQWTDATV NAVKKMRTNG
FQHLLMVDGP NWGQDWQYVM RDNAQRVLDA DTQRNTVLSI HMYAVFSTPA SIIDYLDRFQ
ANGWPLVIGE FGWKFASGEV DHETILAQAQ ARGMGYLGWS WSGNTDPILD MATDFDPARL
TTWGQRIFNG ANGIKATSRE ASIYGGGNQP TSTTTTTTTT TTTTTTVPPT GSCTASYTTT
GQWQGGFQAE VRVTAGSKAI KSWTVTWTFA DGQTVSNAWN ADVTSSGATV TARNASYNGS
LGAGASTAFG FLGTTRGANT APTLSCAAS