Gene Amir_3536 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_3536 
Symbol 
ID8327726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp4109560 
End bp4110729 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content74% 
IMG OID644944034 
Productcellulose-binding family II 
Protein accessionYP_003101274 
Protein GI256377614 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0870492 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCACCC CGTTCCGGCT CGCGGCGCTC GCCGCGGCCC TGGTCGCGGT CACCGGGTTG 
GTCGTGCTCG GCGGCGGTGG AGGCGGCGGC ACCGCCGTCG CCGCGCCCGC CTGCGCCGTC
GACTACACGC CCAACCAGTG GGCCACCGGC TTCACCACCG AGGTCGCGGT CACCAACCGC
GCCGCGCCCG TCACCACCTG GACGCTGACC TGGACCTGGG CGGGCAACCA GACCGTCACC
TCGGCCTGGA ACGCCCAGGT CACCCAGACC GGCGCCAAGG TCACCGCCCG CGACGTCGGC
TGGAACGGCC AGCTCGGCAC CGGCGCGACC GCCCGCTTCG GGTTCCAGGG CGCCTACTCC
GGGACCAGCG CCGCCCCCAC CGACTTCGCC CTCAACGGCA CCCCCTGCAA CGGCGACACC
CCGCCCACCA GCACCACCAC CCCGCCGACC ACCACGCCGC CGACCACGAC CACCACCCCG
CAGCAGCCCA CCGGCTGCGG CACCGCGACC CTGTGCGACG ACTTCGAGTC CCAGACCGGG
ACCACCCCCT CCGGCAAGTG GAGCGTCGGC GCGGCCAACT GCACCGGCAC CGGCACGGTC
ACCGTCGACA GCTCCGTCGC GCGCAGCGGC TCCAAGTCGG TCCGGGTCAA CGGCGGCGTC
GGCTACTGCA ACCACATCTT CTTCGGCGCC AGCGTGAGCG GACCGGTCGT GCACGGCCGG
TTCCACGTCC GGCACACCAC CGCGCTGCCC GCCGCGCACG TGACGTTCAT GGCGCTCAAG
GACTCCGCCG ACGGCGGCAA GGACCTGCGC ATGGGCGGCC AGAACGGCGC GCTGCAGTGG
AACCGCGAGT CCGACGACGC CACCCTGCCC GCGCAGAGCC CCCAGGGCGT CGCGCAGAGC
ATCCCGCTGC CCACCGGCCG CTGGACCTGC GTGCAGTTCA CCCTCGACGG CACGAGCGGC
AGGCTCAGCA CCAGCGTCGA CGGGGTCGCC GTCCCCGGCC TGCAGGTCGA CGGCGCCCCC
ACGCCCGACA TCGACCAGCA GTGGCTCGCC CGCGCCAACT GGCGGCCCAA CACGGTGGAC
GTGCGGCTGG GCTGGGAGAG CTACGGCGAC GGCGGCGACA CCCTTTGGTA TGACGATGTC
GCCTTCGGGA GTGCACCTCT GGCCTGCTAG
 
Protein sequence
MRTPFRLAAL AAALVAVTGL VVLGGGGGGG TAVAAPACAV DYTPNQWATG FTTEVAVTNR 
AAPVTTWTLT WTWAGNQTVT SAWNAQVTQT GAKVTARDVG WNGQLGTGAT ARFGFQGAYS
GTSAAPTDFA LNGTPCNGDT PPTSTTTPPT TTPPTTTTTP QQPTGCGTAT LCDDFESQTG
TTPSGKWSVG AANCTGTGTV TVDSSVARSG SKSVRVNGGV GYCNHIFFGA SVSGPVVHGR
FHVRHTTALP AAHVTFMALK DSADGGKDLR MGGQNGALQW NRESDDATLP AQSPQGVAQS
IPLPTGRWTC VQFTLDGTSG RLSTSVDGVA VPGLQVDGAP TPDIDQQWLA RANWRPNTVD
VRLGWESYGD GGDTLWYDDV AFGSAPLAC