Gene Amir_2166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_2166 
Symbol 
ID8326355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp2389972 
End bp2391369 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content73% 
IMG OID644942716 
ProductCellulase 
Protein accessionYP_003099957 
Protein GI256376297 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGATCAA CGACCACGGC GCGCGGCAAC GGCGCCACGC CGTCGAAGTC CCGTCGCGGG 
AAGGTGGCAG CGGGAGCGCT GTCCAGCGCG CTGGTCGCCG CGGCCACCGC CATCGCCACG
GGGACGGCTT CCCCCGCGGC GGTGGCGGCG GACTCGGAGT TCTACTCCGA CCCGGCGACC
AGCGCGGCCA GGTGGGTGGC GGCGAACCCG AACGACAGCA GGGCCGCCGT CATCCGCGAC
CGCGTCGCCT CGGTACCCCA GGCGAAGTGG TTCACCACGA CCAACACGTC CACGATCCGC
GCCGAGGTGG ACGCCCACAC CTCGGCGGCG GCGTCGGCGG GCAAGACCCC GATCCTGGTG
GTCTACAACA TCCCCAACCG CGACTGCGGC GGCGCGAGCG GCGGCGGCGC GCCCTCGCAC
GGCGCCTACC GGCAGTGGGT CGACCAGTTC GCGGCGGGAC TCGCGGGCCG CCCGGCCGCG
ATCATCCTCG AACCCGACGT CCTCCCGATC ATGAGCACCT GCCAGAGCGC GTCCCAGCAG
GCCGAGACCC GCGCGTCGAT GGCCTACGCG GGCAAAGCGC TCAAGGCCGC GTCGAGCCAG
GCGAAGGTGT ACTTCGACAT CGGCCACTCG GCCTGGCTGA CCCCGGCCGA GGCGGCGAAC
CGCCTGCGCG CGGCGGAGGT CTCCACCAGC GCGGACGGCA TCGCGACCAA CGTCTCCAAC
TACCGCCGCA CCGCCGACGA GGTGGCGTTC GCCAAGGCCA CCCTGAACGC GCTCGGCGAC
GGCAGGCTCA AGGCCGTCGT CGACACCAGC CGCAACGGCA ACGGACCGCT CGGCAGCGAG
TGGTGCGACC CGCCCGGCCG CGCGATCGGC ACGCCCAGCA CCAGGAACAC CGGCGACCCG
CAGATCGACG CCTTCCTGTG GGTGAAGATC CCCGGCGAGG CGGACGGCTG CATCGCGGGC
GCGGGCCAGT TCGTGCCGCA GCGCGCGTAC GACATGGCGG TGGCCGCAGG TCCCGCCCCG
ACGACGACAA CGACGACCAC CACGACCACG CGCGTCACCA CGACCACCAC CACGCCCCCG
CCGAACGGCG CGGCCTGCGT GGTGCGGCAC CGGGTGGTCA GCTCGTGGTC GGGCGGCCAC
ACCGGCGAGG TGGTGATCGA GAACCGGGGT CCGGCGCTCC AGAACTGGAC CTTGGAGTTC
TCCGCCCCCG GCGTGGCCGT CTCCCAGGGC TGGAACGGGA CGTGGACGGA CCTGGGCGAC
ACCGTCCGGG TCACGAGCGC GTCCTGGAAC GGCGGGATCG CCACCGGTGG AACCGCGACC
ACCGGCTACT CGGCGAGCTT CAGCGGCGGC ACGCCCCCGT TCACGTCTCC CGTGCTGAAC
GGAACGGCCT GCGCCTGA
 
Protein sequence
MRSTTTARGN GATPSKSRRG KVAAGALSSA LVAAATAIAT GTASPAAVAA DSEFYSDPAT 
SAARWVAANP NDSRAAVIRD RVASVPQAKW FTTTNTSTIR AEVDAHTSAA ASAGKTPILV
VYNIPNRDCG GASGGGAPSH GAYRQWVDQF AAGLAGRPAA IILEPDVLPI MSTCQSASQQ
AETRASMAYA GKALKAASSQ AKVYFDIGHS AWLTPAEAAN RLRAAEVSTS ADGIATNVSN
YRRTADEVAF AKATLNALGD GRLKAVVDTS RNGNGPLGSE WCDPPGRAIG TPSTRNTGDP
QIDAFLWVKI PGEADGCIAG AGQFVPQRAY DMAVAAGPAP TTTTTTTTTT RVTTTTTTPP
PNGAACVVRH RVVSSWSGGH TGEVVIENRG PALQNWTLEF SAPGVAVSQG WNGTWTDLGD
TVRVTSASWN GGIATGGTAT TGYSASFSGG TPPFTSPVLN GTACA