Gene Amir_6238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_6238 
Symbol 
ID8330449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp7316946 
End bp7318604 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content70% 
IMG OID644946669 
Productcellulose-binding family II 
Protein accessionYP_003103888 
Protein GI256380228 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGACCA GAACGGGCGT GCTGCTCGCC GTCGCGGCTC TCGCCGCAGG TTCCTACGCG 
TTGATCCCGT CCGCGAGCGC GGCCACGAAC CTCACGGCCA CCTTCGCCAA GACCCAGGAC
TGGGGCAGCG GCTTCGAGGC GAAGTTCACC GTCGCCAACG GCGGGTCGTC GGCCTCGAAC
AACTGGAAGA TCGAGTTCGA CCTGCCCTCC GGCACCACGG TCGGCTCCTT CTGGGACGCG
CAGGTCACCC GGAACGGCGA CCGCTACACG GCGACCAACC GGGACTGGAA CGCGGCGGTC
GGCGCGGGCT CCTCGGTGGC GTTCGGGTTC ATCGGCGCGG GCGGCGGCGC CCCCACGAAC
TGCACGATCA ACGGCGCCCC CTGCACCGGC ACCGGCACCG GCAACCCCGG CGACACGGCC
GCGCCGAGCG TCCCCGGCGG CCTGAAGGCC ACCGCCACCA CGGCCGACTC GGTCACGCTG
GCCTGGAACG CGTCGGCCGA CAACGTCGGC GTGGTCGCGT ACGACGTGTA CAAGGGCGGC
GACAAGGCCA CCACCGTCGC GAGCCCCACC GCGATCGTGT CCGGCCTGAC CGCCGACACC
TCGTACCAGT TCAGCGTCGT GGCGCGCGAC GCGGCGGGCA ACGCCTCGGC GAAGAGCCCG
GCGCTGACCG CGAAGACCGC GAAGAAGGCG GGCACCACCC CGGAGCCCTC GCCCGAGCCG
TCGCCGAACC CCAACCCCAC CCCGCAGCCG AGCCCGGACC CCACCCCGGA CCCGCAGCCG
TCCCCGGCGG GCGGGCGCGG CGCCCCCTAC CTGTTCCTGG GCTGGGGCAA CCCGCAGTCC
GCGACCGCGG TGATGCAGCA GACCGGCGTC AAGTGGTTCA CGATGGCGTT CATCCTGTCC
TCGGGCGGCT GCACCCCCTC GTGGGACGGC ACCCGACCGC TGACCGGCAG CGTGGACGAG
ACCACGATCA AGGCGATCCG CGCGGCGGGT GGTGACATCG TGCCGTCGTT CGGCGGCTGG
AGCGGCAACA AGCTCGGCCC GAACTGCTCG ACCCCCGAGG CCCTGGCGGG CGCGTACCAG
AAGGTCATCG ACGCCTACCA GCTCAAGGCG ATCGACATCG ACATCGAGAA CTCCGACGAG
TTCGAGAACG AGGTCGTGCA GGACCGCGTG CTGTCCGCGC TGAAGATCGT CAAGCAGAAG
AACCCGAACG TGCAGACCAT CGTCACGTTC GGCACCGGCA CCACCGGCCC GAACTTCTGG
GGCAACCGCC TCATCGAGCG GGCGGGCGCG CTGGACGCCA AGATCGACGT CTTCACGATC
ATGCCGTTCG ACTTCGGCAG CTCCAACATC GCGACCGACA CCATCAGCGC GGCCACCGGG
CTGAAGAACA AGGTGAAGTC GACCTTCGGG TACAGCGACG CCGACGCCTA CAAGCACATC
GGCATCTCGG GCATGAACGG CCTGTCCGAC CAGAAGGAGC TGACCACCGC CGCGGACTGG
ACCAAGATCC GCGACTGGTC GAAGAACAAC GGCCTCGGCC GCCTCGCGTT CTGGGCGGTC
AACCGGGACC GCGGCGGCTG CGACGGCCAG GTGTCGGCCA GCTGCTCAGG CATCTCGCAG
GCCGACCTGG AGTTCACCCG CATCACCGCG GGCTTCTGA
 
Protein sequence
MKTRTGVLLA VAALAAGSYA LIPSASAATN LTATFAKTQD WGSGFEAKFT VANGGSSASN 
NWKIEFDLPS GTTVGSFWDA QVTRNGDRYT ATNRDWNAAV GAGSSVAFGF IGAGGGAPTN
CTINGAPCTG TGTGNPGDTA APSVPGGLKA TATTADSVTL AWNASADNVG VVAYDVYKGG
DKATTVASPT AIVSGLTADT SYQFSVVARD AAGNASAKSP ALTAKTAKKA GTTPEPSPEP
SPNPNPTPQP SPDPTPDPQP SPAGGRGAPY LFLGWGNPQS ATAVMQQTGV KWFTMAFILS
SGGCTPSWDG TRPLTGSVDE TTIKAIRAAG GDIVPSFGGW SGNKLGPNCS TPEALAGAYQ
KVIDAYQLKA IDIDIENSDE FENEVVQDRV LSALKIVKQK NPNVQTIVTF GTGTTGPNFW
GNRLIERAGA LDAKIDVFTI MPFDFGSSNI ATDTISAATG LKNKVKSTFG YSDADAYKHI
GISGMNGLSD QKELTTAADW TKIRDWSKNN GLGRLAFWAV NRDRGGCDGQ VSASCSGISQ
ADLEFTRITA GF