Gene Amir_3335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_3335 
Symbol 
ID8327525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp3911164 
End bp3912900 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content73% 
IMG OID644943846 
ProductCellulase 
Protein accessionYP_003101086 
Protein GI256377426 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.59527 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACTCGC ACCCGCGCGT AACCCGGTGG TCCACCCGCC TGCTCGCCCC AGCCCTCGCG 
CTGCTGATGG CGGCGGGCTC GCTCACCGCC ACCGCCACCG CCTCCCCGGC GCCGGCGCCC
GCGCCCGCGT CCCGCGACGG CGTCGCGCCC GCCGACGCGC GGGCGACGGT CGCGGCCATG
CAGCCGGGGT GGAACCTCGG CAACACCCTG GACGCCATCC CCGACGAGAC CTCGTGGGGC
AACCCCCGCA CCACCAGGGC GCTGCTGCGC CACGTGCGCG AGCAGGGCTA CCGCAGCGTC
CGCCTGCCGA TCACGTGGAG CAACCACCAC GGTCCCGCGC CGGACTTCAC GATCGACCCG
GCGTGGCTGG CCCGCGTCCG CGAGGTCGTG GACTGGTCGC TGGCCGAGGG CCTGCACGTG
ATGGTGAACC TGCACCACGA CTCGTGGCAG TGGCTCAACG CCTACCCGAC CGACCGCGCG
AACGTGCTGG CCCGCTACAC CGCGCTGTGG AAGCAGATCG CCACCGCCTT CCGCGAGCAC
CCCTCGAAGC TGGTGTTCGA GAACATCAAC GAACCCCAGT TCGCGGGCAC CTCCGGCGAC
GAGGAGGACT ACCGGGCGCT GCACGAGCTG AACGCCGAGT TCCACCGCGT CGTGCGCGGC
TCCGGCGGCG GCAACACCAC CCGCCTGCTG GTGCTGCCCA CCCTGCACAC CAGCGCCGAC
CAGGGCCGCC TCGACGCGTT CCTGGCGTCC TACGACCTGC TGCGGGACAC CAACATCGCG
GCGACCACGC ACTTCTACGG GTTCTGGCCG TTCAGCGTGA ACATCGCAGG CCACACCCGG
TTCGACGCGG AGGTGGAGCG GGACCTGGTC GACACGTTCG ACCGGGTGCG GCGGAGCCTG
GTGGACCGGG GCATCCCGGT GATCGTCGGG GAGTGGGCGC TGCTCAACCA CGACCACACC
CGGCCGGGGG TGATCGAGCG GGGCGAGTTC CTGAAGTTCC TCGAAGCCGT GGGCCACCAC
GCCCGCACCC GCGGCCTGAC CACGATGCTG TGGGACGCCG GGCAGTTCCT CGACCGCACC
GCGCTGCGCT GGCGCGACCA GGGCCTGCAC GACCTGGTCG AGTCGAGCTG GACCACCCGC
TCCGGAACGG CCGCGAGCGA CCGGATCCAC CTGGCGCGCA GCCAGCCGAT CACCGCGCGG
TCGCTCGCGC TCAACCTCAA CGGCACGGCG CTGCGCGAGG TGCGCTCGGG CGGGCGGACC
CTGGCCAGGG GCTCGGACTA CACCGTCTCC GGCAGCACCC TGACCTTCAG CGCCGCCGCC
CTCACCAGGT TGGCGGGCAA CCGGGCCCAC GGCGTCAACG CGACCGTCGA GCTCCGGTTC
TCTCAGGGCG TCCCGTGGCC GGTCGACGTG ATCAGCAGCG ACCGGCCGGT CCAGTCCGCG
GCCGAGGGCA CGACCGCCGC GTTCGCGATC CCCACCCGGT TCCTGGGCGA CCAGCTCGCC
ACCATGGAGG CCGCGTACGC CGACGGCAGC CCGGCCGGGC CGGCGAACTG GACGACGTAC
AAGGAGTTCT GGCAGCACTT CCAGCCCGAC CCGGCGGGGA ACCGGATCAT CCTCAAGCCG
GAGTTCTTCG CCGAGGTCGC GGACGGGGTG GTCGCCCTGA CGTTCCACTT CTGGAGCGGC
GAGCGCGTCG CGTACCGCGT CACCAAGGCC GGGGACCGGG TCACCGGGAC CCCGTGA
 
Protein sequence
MNSHPRVTRW STRLLAPALA LLMAAGSLTA TATASPAPAP APASRDGVAP ADARATVAAM 
QPGWNLGNTL DAIPDETSWG NPRTTRALLR HVREQGYRSV RLPITWSNHH GPAPDFTIDP
AWLARVREVV DWSLAEGLHV MVNLHHDSWQ WLNAYPTDRA NVLARYTALW KQIATAFREH
PSKLVFENIN EPQFAGTSGD EEDYRALHEL NAEFHRVVRG SGGGNTTRLL VLPTLHTSAD
QGRLDAFLAS YDLLRDTNIA ATTHFYGFWP FSVNIAGHTR FDAEVERDLV DTFDRVRRSL
VDRGIPVIVG EWALLNHDHT RPGVIERGEF LKFLEAVGHH ARTRGLTTML WDAGQFLDRT
ALRWRDQGLH DLVESSWTTR SGTAASDRIH LARSQPITAR SLALNLNGTA LREVRSGGRT
LARGSDYTVS GSTLTFSAAA LTRLAGNRAH GVNATVELRF SQGVPWPVDV ISSDRPVQSA
AEGTTAAFAI PTRFLGDQLA TMEAAYADGS PAGPANWTTY KEFWQHFQPD PAGNRIILKP
EFFAEVADGV VALTFHFWSG ERVAYRVTKA GDRVTGTP