Gene Amir_3215 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_3215 
Symbol 
ID8327405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp3746657 
End bp3747973 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content72% 
IMG OID644943729 
Productglycoside hydrolase family 6 
Protein accessionYP_003100969 
Protein GI256377309 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAGTA TCGGGAAAGC AGCAGCGTGC GCGTTCTCCG CCGCGCTCGT GGCGGGCGCG 
GCCGTCCTGG TGACCGGGGG CGGGCAGACC GCCACCGCCG CCGACTCGGC GTTCTACACC
GATCCGGGGA GCTCCAGCGC GAGGTGGGTG GCGGCCAACC CGAACGACTC GCGCGCGGCG
GTCATCCGCG ACCGGGTCGC CTCGGTGCCG CAGGCGAAGT GGTTCACCAC CACCAACACC
TCGACCGTGC GCTCGGAGGT CAGCGCCTTC GTGGGCGCGG CGGCCTCGGC CGGGAAGATC
CCGATCCTGG TCGTCTACAA CATCCCCAAC CGGGACTGCG GCGGCGCCAG CGGCGGCGGC
GCGCCGTCCC ACCAGGCCTA CCGGGCGTGG GTGGACGAGG TCGCGGCCGG GCTCGGCGGA
CGACCGGCGT CGATCATCCT GGAACCGGAC GTGCTGCCGA TCATGTCCAA CTGCCAGAGC
GCGGACCAGC AGAACCAGAC CAAGGCGTCG ATGTCCTACG CGGGCCGCAA GCTCAAGTCC
GGCTCCGGGC AGGCGAAGGT CTACTTCGAC ATCGGCAACT CCGACTGGCT CGCGCCCGCC
GAGGCCGCGA ACCGGCTGCG CGGGGCCGAC GTGTCCGGCA GCTCCGACGG CATCGCCAGC
AACGTCTCCA ACTACCGCGC CACCCAGGCC GAGGTCTCCT ACACCAAGGC GATCCTGAAC
GCGCTGGGCG ACGGCAGGCT CAAGGCCGTC ATCGACACCA GCCGCAACGG CAACGGGCCG
CTCGGCAGCG AGTGGTGCGA CCCGCCCGGT CGCGCCATCG GCACGCCCAG CACGAAGAAC
ACCGGCGACT CGCAGATCGA CGCCTTCCTG TGGGTCAAGA TCGTCGGCGA GGCGGACGGG
TGCATCGCGA GCGCGGGGCA GTTCGTGCCG CAGCGCGCGT ACGACCTGGC GGTCGCGGCG
GGGCCCGTGC CCACGACGAC GACCACCACG CCGGGGGGCA ACCCCGGCGG CGGCTGCGCG
GTGACGCACC GCGTGGTCAG CCAGTGGAAC GGCGGGTTCA CCGGAGAGGT CGTCGTGGAG
AACAGGGGGC CCGCGATCTC CTCGTGGACC CTGGAGTTCT CCGCGCCCGG TGTGACCGTC
ACGCAGGGGT GGAACGGGAC GTGGACCGAC ACCGGCGACG GGGTCCGGGT CGTGAACACC
GCCTGGAACG GAGCGCTCGC GTCCGGCGGA CGGGTGACCG CCGGGTACAA CGCGAACTAC
GGTGGCGGCG CACCGCCGTT CTCGTCGCCG ACGCTGAACG GCGCCGCCTG CTCGTGA
 
Protein sequence
MASIGKAAAC AFSAALVAGA AVLVTGGGQT ATAADSAFYT DPGSSSARWV AANPNDSRAA 
VIRDRVASVP QAKWFTTTNT STVRSEVSAF VGAAASAGKI PILVVYNIPN RDCGGASGGG
APSHQAYRAW VDEVAAGLGG RPASIILEPD VLPIMSNCQS ADQQNQTKAS MSYAGRKLKS
GSGQAKVYFD IGNSDWLAPA EAANRLRGAD VSGSSDGIAS NVSNYRATQA EVSYTKAILN
ALGDGRLKAV IDTSRNGNGP LGSEWCDPPG RAIGTPSTKN TGDSQIDAFL WVKIVGEADG
CIASAGQFVP QRAYDLAVAA GPVPTTTTTT PGGNPGGGCA VTHRVVSQWN GGFTGEVVVE
NRGPAISSWT LEFSAPGVTV TQGWNGTWTD TGDGVRVVNT AWNGALASGG RVTAGYNANY
GGGAPPFSSP TLNGAACS