Gene Amir_2048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_2048 
Symbol 
ID8326237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp2271015 
End bp2273054 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content72% 
IMG OID644942599 
Productcellulose-binding family II 
Protein accessionYP_003099840 
Protein GI256376180 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3534] Alpha-L-arabinofuranosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGCTC CCCGCACCGC TTTGTTGGCA CTCGCCCTGA TCGCCGCACC GCTCGCCACC 
CCCGCGGCGG CGTCCCCGAG CGCCGATCCC CCGACGTCCC AGGCCGTCGA CACCCGCGTC
TCGGTGAACA CCCGCGCGGG CCTGGAAACC GCCCCCGAGC TGGGAATGGG CGTCAACCAC
GCCATCTGGG ACTCCCAGCT CGGCACCCCG CAGGTCGCCG ACCTGCTCAA GGACGCAGGC
GTCCGCGCCA TGCGCTACCC CGGCGGCTCG TACTCGGACA TCTACCACTG GCGCGACCAC
ACCGCCCCCG GCGGCTACGT GGCCCCGAAC ACCGACTTCG ACACGTTCAT GGCAGGCGTG
CGGCGGGCGG GCGGTCAGCC GATCGTCACC GCCAACTACG GCACCGGGAC GCCGCAGGAG
GCCGCCGACT GGGTGCGCAG CGCCAACGTC GACAAGGGCT ACGGCGTCAA GTACTGGGAG
ATCGGCAACG AGCTCTACGG CAACGGCCAC TACGGCACCG CGTGGGAGGA GGACCACCAC
GAGGACAAGA GCCCGGCCGG GTACGCGGGC CTGGTGCGCG ACTACTCGCG GGCGATGAAG
GCCGTCGACC CGAGCATCAA GATCGGTGCG GTGCTGACCA CGCCCGGCGA GTGGCCGGAC
GCCATCACCG CGGCCGGTGA CGCGGGCTCG TGGAACCAGG TCGTGCTGTC CACGGCGGGC
GCGGACATCG ACTTCGTGGT CCTGCACTGG TACCCGCGCG GCACGAACGC CCCGGACCTG
CTGAGCAGCG CGGACACCAT CCCCGAGATG ATGGCCATGG CCAAGGAGCA GATCCGCAGG
CACACCGGGC GGGACCTCGG CATCGCGTTC ACCGAGCTGA ACACGAACTT CGCCCGCAAC
ACCCAGCCGA CCGCGCTGTT CGCCGCCGAC GCCTACGCGT CGCTGCTGGA GCGCGGCGCG
TTCACCGTCG ACTGGTGGAA CGTGCACAAC GGCCTGGGCA AGGTCACGAC GGTCGCCGGT
CAGACCGACT ACGACGACTT CGGCCTGCTC TCCAGCGCCA ACTGCACCGA GGACGGCTCG
GTGTGCCAGC CCGCGCTGAA CACGCCGTTC GCGCCGTACC ACGCGCTCGC GCTGACCTCC
CGGTTCGCGC GCCCCGGCGA CCAGTTCGTG GCCGCCTCCA GCACCGACCC GCTGGTGCGC
GCGCACGCCG CCCGACGTCC GGACGGCGGG CTGTCGGTGC TGCTGGTGAA CCGGGACCCG
GACGCGGCGA GGTCGGTCGC GCTGGACTAC GCCGGGTTCA CCCCGAAGTC CTCGGCGGTG
GTGCGGACCT ACGGCAACGG CGACACCGCG ATCACCACGT CCACCGGGAC GTCCGGCGCG
GTCACGCTCG CCCCGTACTC GATCACCGCG CTGGAGCTGG CGCCCACGTC CGCGACCACC
GGCCCGGCGA CCCCCGCCCG GCCGACCGCG TCCTCGGTCA CCGACCGCGG CGCCGTGATC
ACCCTGCCCA CTGCCTCTCC CGGTCTGAAG TACGAGGTGC ACCGCCAGGT CGGCACGCGC
ACCGACCAGT GGGGCGAGAC CACCGGCGGC AGCTTCACCG CCACGAACCT GTCCCCCAGC
ACCGAGTACA CCGTCAACGT CATCGCGCGC GACAACGCGG GCCGGGTGTC GTGGGCTTCG
CCGCCGCTGG TGTTCCGCAC CACCGCGCCC GCCTCGTCCA CCTGCGCGGT GCGGCTGGCC
AACACCAACG ACTGGGGCAA CGGCTGGGTC GGCGACGTGC AGGTCACCAA CACCGGGACC
ACCCCGGTGG ACGGCTGGGT GCTGGCGTTC GACTGGCCGA GCACCTGGCA GAAGGTCGAC
AGCGGCTGGA GCGGCACGTG GGCGCAGTCC GGCCGGTCGG TGACGGTGAC CGCCGACGCG
GGCAACCGGC TGCTGGCCCC CGGCGCGACG GTCTCGACCG GTTTCGTGGG CTCCTACCAG
GGCCCGAACC GGCTGCCGAC CGCGTTCTCG CTGAACGGCG TGCCGTGCTC GCTGCGGTGA
 
Protein sequence
MRAPRTALLA LALIAAPLAT PAAASPSADP PTSQAVDTRV SVNTRAGLET APELGMGVNH 
AIWDSQLGTP QVADLLKDAG VRAMRYPGGS YSDIYHWRDH TAPGGYVAPN TDFDTFMAGV
RRAGGQPIVT ANYGTGTPQE AADWVRSANV DKGYGVKYWE IGNELYGNGH YGTAWEEDHH
EDKSPAGYAG LVRDYSRAMK AVDPSIKIGA VLTTPGEWPD AITAAGDAGS WNQVVLSTAG
ADIDFVVLHW YPRGTNAPDL LSSADTIPEM MAMAKEQIRR HTGRDLGIAF TELNTNFARN
TQPTALFAAD AYASLLERGA FTVDWWNVHN GLGKVTTVAG QTDYDDFGLL SSANCTEDGS
VCQPALNTPF APYHALALTS RFARPGDQFV AASSTDPLVR AHAARRPDGG LSVLLVNRDP
DAARSVALDY AGFTPKSSAV VRTYGNGDTA ITTSTGTSGA VTLAPYSITA LELAPTSATT
GPATPARPTA SSVTDRGAVI TLPTASPGLK YEVHRQVGTR TDQWGETTGG SFTATNLSPS
TEYTVNVIAR DNAGRVSWAS PPLVFRTTAP ASSTCAVRLA NTNDWGNGWV GDVQVTNTGT
TPVDGWVLAF DWPSTWQKVD SGWSGTWAQS GRSVTVTADA GNRLLAPGAT VSTGFVGSYQ
GPNRLPTAFS LNGVPCSLR