Gene Amir_1868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_1868 
Symbol 
ID8326053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp2062832 
End bp2065066 
Gene Length2235 bp 
Protein Length744 aa 
Translation table11 
GC content72% 
IMG OID644942417 
Productcellulose-binding family II 
Protein accessionYP_003099662 
Protein GI256376002 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTCACCA ACTCGTTCCG CGCGCGGTTG GGCGCGGCCG TCGCCGCCGC CACCGCGCTG 
AGCGGCGGAC TGGTCGTCGT GGGCACGGCT TCGGCCGCCC CAGCCTGCAA GGTCGACTAC
ACCGTGCAGA ACCAGTGGCA GGGCGGCTTC TCCGCCAACG TCACCGTCAC CAACCTCGGC
GACCCCGTCG ACGGGTGGTC CCTCGCCTGG ACCGCCGCCT CCGGCGAGAA GGTCGACCAG
GGGTGGAACG CGGCGTTCTC CTCCTCGGGC ACGTCGATCA CCGCGTCCAA CGTGGACTGG
AACAGGGCCA TCCCCACCGG TGGCTCGGTC TCGTTCGGGT TCAACGGTTC CCGGACCGGC
GAGCCGGTCG TCCCGTCGAG CTTCGCGCTC AACGGCACCA CGTGCACCGG CGGCGTCGCG
CCCACCACGA CCACCACCCC CGGCGGCCCC ACCACCACGA CCGGGCCGAC CACCACGACC
ACCGGCGGCG GCACCGCGCC GCTGCCCGAC CCGACCGGGC GCAAGCAGGT CGAGCGGCTC
GACCGCGGCC TGATCAGCGT CCGCTCCGGC AGCGGCAACC TGGTCAGCTG GCGGCTCCTG
GGCGGCGACC CGCAGAACGT GGCGTTCACC GTCTACCGGG GCGGCGCCAA GCTCACCGCC
TCGCCGATCA CCGGGTCCAC GAACTACCTG GACAGCGGCG CGCCGGCCGA CGCCTCCTAC
ACGGTCCGCG CGGTGGTCAA CGGCGTCGAG CAGCCCGCGT CCCCGGCCTC GCTGGCCTTC
GGCAACGGCT ACCGGGACGT GCCGATCCAG CCCCCGTCCG GCTCCTACGC CGCGAACGAC
GCGAGCGTCG GCGACCTGGA CGGCGACGGG GTCTACGAGT TCGTCCTCAA GTGGGAGCCC
AACAACGCCA AGGACAACTC CCAGTCCGGC GTCACCGACG TGGTCTACGT CGACGCCTAC
AAGCTCAACG GCGCCAGGCT GTGGCGCGTC AACCTGGGCC GCAACATCCG GGCGGGCGCG
CACTACACCC AGTTCCAGGT CTACGACTAC GACGGCGACG GCAAGGCCGA GGTGGCCATG
AAGACCGCCG ACGCCACCGT GGACGGGCGC GGCACGGTGA TCGGCAACGC CTCCGCCGAC
CACCGCAACT CCTCCGGCTA CGTGCTGGCG GGCCCCGAGT ACCTGACCAT GTTCAACGGC
CAGACCGGCG CCGCGATGTC CACGGTCGAC TACGACCCGC CGCGCGGCAC CGTCGCCGAC
TGGGGCGATA ACTACGGCAA CCGGGTCGAC CGGTTCCTCG CCGGGACCGC CTACCTGGAC
GGCGCACGCC CCTCGCTGAT CATGGCTCGT GGGTACTACA CCCGCGCGGT GATCGCGGCC
TGGGACTTCC GCAACGGGGC GCTCACCAAG CGCTGGGTCT TCGACTCCAA GGCCTCCGGC
AACAGCGGCT ACGCGGGGCA GGGCAACCAC TCGCTCACCA TCGGCGACGT CGACGCGGAC
GGCCGGGACG AGATCGTCTA CGGCGCGGCG GCCGTCGACG ACAGCGGCAA GGGCCTGTGG
AACACCGGGC TCGGGCACGG GGACGCCGCG CACCTGGGCG ACCTCGACCC GTCGCGGGCG
GGCCTGGAGT ACTTCAAGGT CGACGAGGAC ACCTCCAAGC CCGGCTCGTT CTTCGCCGAC
GCCCGCACCG GTTCCCGCGT GTGGACCACG GCGTCCGGCG GCGACAACGG GCGCGGCGTC
GCGGGCGACA TCTGGGCGGG CAGCCCCGGA GCCGAGGCGT GGTCGGCGCG GGACGCGAAC
CTGCGCAGCG CCAAGGGCGC GGAGATCGGG CGCAAGCCCG GCTCCATCAA CTTCCTGGCC
TGGTGGGACG GCGACCCGGT GCGGGAGCTG GTGGACCAGA CCCGGATCGA CAAGTACGGC
ACCGGCGGCG ACACCCGCCT GCTGACCGCC TCCGACGTGC ACAGCAACAA CGGCACCAAG
GCCACCCCGT CGCTGTCCGG CGACCTGTTC GGCGACTGGC GCGAGGAGGT GGTGTGGCCC
ACCACGAACA ACACCGCGCT GCGGATCCAC TCCACCCCGC ACCAGACCGA CCGGCGCATC
CCGACCCTGA TGCACGACAC CATGTACCGG GTCGCGATCG CCTGGCAGAA CACGGCCTAC
AACCAGCCGC CGCACCCGAG CTTCTTCATC GGCGACGGCA TGGCCACCCC GCCCTGGCCG
GACGTCCACT ACTGA
 
Protein sequence
MLTNSFRARL GAAVAAATAL SGGLVVVGTA SAAPACKVDY TVQNQWQGGF SANVTVTNLG 
DPVDGWSLAW TAASGEKVDQ GWNAAFSSSG TSITASNVDW NRAIPTGGSV SFGFNGSRTG
EPVVPSSFAL NGTTCTGGVA PTTTTTPGGP TTTTGPTTTT TGGGTAPLPD PTGRKQVERL
DRGLISVRSG SGNLVSWRLL GGDPQNVAFT VYRGGAKLTA SPITGSTNYL DSGAPADASY
TVRAVVNGVE QPASPASLAF GNGYRDVPIQ PPSGSYAAND ASVGDLDGDG VYEFVLKWEP
NNAKDNSQSG VTDVVYVDAY KLNGARLWRV NLGRNIRAGA HYTQFQVYDY DGDGKAEVAM
KTADATVDGR GTVIGNASAD HRNSSGYVLA GPEYLTMFNG QTGAAMSTVD YDPPRGTVAD
WGDNYGNRVD RFLAGTAYLD GARPSLIMAR GYYTRAVIAA WDFRNGALTK RWVFDSKASG
NSGYAGQGNH SLTIGDVDAD GRDEIVYGAA AVDDSGKGLW NTGLGHGDAA HLGDLDPSRA
GLEYFKVDED TSKPGSFFAD ARTGSRVWTT ASGGDNGRGV AGDIWAGSPG AEAWSARDAN
LRSAKGAEIG RKPGSINFLA WWDGDPVREL VDQTRIDKYG TGGDTRLLTA SDVHSNNGTK
ATPSLSGDLF GDWREEVVWP TTNNTALRIH STPHQTDRRI PTLMHDTMYR VAIAWQNTAY
NQPPHPSFFI GDGMATPPWP DVHY