Gene Amir_1831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_1831 
Symbol 
ID8326016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp2014885 
End bp2016009 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content72% 
IMG OID644942380 
Productcellulose-binding family II 
Protein accessionYP_003099625 
Protein GI256375965 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.783622 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGGAC CCGATGCGGT CAAGCCCGGA GTGGCGCGCA GGGCCGCCCT GGCCGCCGTG 
CTCTCCGCGC TGACCCTGCT GTGCGGCTTC CTGATCACGG TTCCCACCTC GCAGGCCGCC
GCCGCCGCGC CCGTGCGCAT CATGCCGCTC GGCGACTCCA TCACCGGCTC GCCCGGCTGC
TGGCGCGCGC TGCTCGACGT CGACCTCAAG GCCGCCGGCT ACACGAACAT CGACTTCGTG
GGCACGCTCC CGTCGCAGGG GTGCGGCATC CCCCACGACG GCGACAACGA GGGCCACGGC
GGGCTCCTGG TCACCAACGT GGCCGGACAG AACCAGCTCG TGCCGTGGTT GCAGGCCACC
AGCCCCGACG TCGTGGTGAT GCACTTCGGC ACCAACGACG TGTGGAGCAA CATCGCGCCC
GCGACGATCC TGGCGGCGTA CGGCAAGCTC GTCGACCAGA TGCGGGCGCA GAACCCGGCG
ATGCGCATCC TGATCGCCAA GCTCATCCCG ATGGGCACCC CGCAGTGCGC CGACTGCGGG
CAGCGCGTGG TGGCGTTCAA CGCGGCGATC CAGCCGTGGG CGACGTCGAA GACCACGGCC
GCCTCGCCGA TCGTCGTGGT GGACCAGTGG ACCGGGTTCG ACACCGCCAC CGACACCTAC
GACGGCGTGC ACCCGAACGC CTCGGGCGAC CGGAAGATCG CGAACCGGTG GTTCCCGGTG
CTGCGGGACG CGCTCGGCGG CGTCCTGCCG ACGACGACGA CGACGGGTGG CACGACGACC
ACGACCGGTG GCACCACCAC GACCACCATC CCGCCGCCCA CCGGCAGCTG CGCCGCGTCG
GTGACCGTGG TGAACCGGTG GCAGGGCGGG TTCCAGGCCG ACGTGGTCTT CCGCAACACC
AGCACCGTCG CGTCCACGGC GTGGAGCGTG CGGTTCTCGC TGGGCGGCGG GATGAGCGTC
GCGCAGGCGT GGAACGGGAC GGCGACCACC AGCGGTTCGG TGGCGACCGT CGCGAACGCG
GCGTGGAACG GCTCGGTGGC GCCCGCGGGT TCGGCGACGG CCGGGATCAT CGTCAACGGC
GACCCGACGT CCTGGACGCC GTCGGCGAGC TGCACGAAGA GCTGA
 
Protein sequence
MTGPDAVKPG VARRAALAAV LSALTLLCGF LITVPTSQAA AAAPVRIMPL GDSITGSPGC 
WRALLDVDLK AAGYTNIDFV GTLPSQGCGI PHDGDNEGHG GLLVTNVAGQ NQLVPWLQAT
SPDVVVMHFG TNDVWSNIAP ATILAAYGKL VDQMRAQNPA MRILIAKLIP MGTPQCADCG
QRVVAFNAAI QPWATSKTTA ASPIVVVDQW TGFDTATDTY DGVHPNASGD RKIANRWFPV
LRDALGGVLP TTTTTGGTTT TTGGTTTTTI PPPTGSCAAS VTVVNRWQGG FQADVVFRNT
STVASTAWSV RFSLGGGMSV AQAWNGTATT SGSVATVANA AWNGSVAPAG SATAGIIVNG
DPTSWTPSAS CTKS