Gene Amir_1971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_1971 
Symbol 
ID8326156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp2183048 
End bp2184484 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content68% 
IMG OID644942520 
Productalpha-L-arabinofuranosidase B 
Protein accessionYP_003099765 
Protein GI256376105 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000055654 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTCAGTA CCCGGTCCCT GGGCCGCCGG GTGGCGCTGC TCGTCGCGAC CCTGCTGCTA 
GCCGTCGCGC CGCACCAGGC CGCCGCGCGG CAAGCCATCG CGCAGCAAGC CGACCAGCCC
GCCACCGAGT CCTCGGACCC CACCACGCAG GCGACCTACG CCGCGTACGT GATGGGCTAC
TTCACCGAGT CCCCCAGCAC CACCGGCGCG AACTACGGCC TGCACCTCGC GGTCAGCGGC
GACGGCCTCA ACTGGACCCC GCTGGGCCAG AACAACCCCG TCGTCACCCC CACCGCGGGC
ACCAGGGGCC TGCGCGACCC GTTCATCCTG CGCAAGCAGG ACGGCACGTT CGTGGTCATC
GCCACCGACC TCAACGGCAC CGACTTCACG CAGAAGAACC AGTACATCCA CGCCTGGGAC
TCCACGAACC TGACCAGCTT CAGCAACTAC CGCAGGCTGA AGATGCACTC GATGGACACC
CACACCTGGG CCCCCGAGGC GTTCTACGAC GCGGCGCGCG GCCAGTACGG CATCCTCTAC
TCCGCGCACA ACGGAACCCG CGACGTCTTC ATGGTCAACT ACACCACCGA CTTCGTGGAC
GTCGGCTCCC CGCAGGTGTT CTTCGACCCC GGCTTCAACG TCCTCGACGG CACCGTCCTC
ACCAGCGGCG GCACGAACTA CCTGTACTAC AAGAACATGG CCGACGGGAA CCTGTACGGC
GCGCGCTCGT CCTCGTTGAA CCCCAACAGC TTCAGCACCT ACACGAGCCC GCTCAAGCAG
GCGAGCGGCA TCGAGGCGCC GATCCTGGTC AAGTCCAACA CCTCGGACAC CCACTACCTG
TGGGGCGACT CGTACTCCCC GGTGAACGGC GAGTTCTACG CCTGGTCCAC CACCAACCCC
GGCGCGAACT CCTGGTCGGT GCTGAACCAG CGCGCCTACA CCCAGCCGCT GAACTCCAAG
CACGCCACCA TCTCCCCGAT CACGGCGGCC GAGCAGTCCG CGCTGCTGTC CCGCTGGGGC
GCCCCGTCCT GGAACCGCCT GAAGTCCTCG AACTTCCCGG ACCGATTCGT GCGCCACCAG
AACTACCTCG GCCGCATCGA CCCGTACCCG TTCGACCCGT ACACCGACCA GCTCTGGAAG
CTCGTGCCGG GCCTGTCCGA CTCCTCGGGC GTCTCGTTCC AGTCGGTGTC CGACCCGACC
CGCTACCTGC GGCACTACGA GTACGCGATC CGCTTGGACG CCAACGACAA CACCGCCGCC
TTCCGCGCCG ACGCGACCTT CCACCGCGTC CCCGGCCTTG CCGACTCGTC CTGGTCCTCG
TTCCGCTCCG CGAACCTCCC GGACCGCTAC CTGCGGCACT CCGGCTACGC GCTGCGCGTC
GACCCGATCA GCACGGCCAC CGACCAGCAG GACGCGACCT TCCGCGTCGG CTCCTGA
 
Protein sequence
MVSTRSLGRR VALLVATLLL AVAPHQAAAR QAIAQQADQP ATESSDPTTQ ATYAAYVMGY 
FTESPSTTGA NYGLHLAVSG DGLNWTPLGQ NNPVVTPTAG TRGLRDPFIL RKQDGTFVVI
ATDLNGTDFT QKNQYIHAWD STNLTSFSNY RRLKMHSMDT HTWAPEAFYD AARGQYGILY
SAHNGTRDVF MVNYTTDFVD VGSPQVFFDP GFNVLDGTVL TSGGTNYLYY KNMADGNLYG
ARSSSLNPNS FSTYTSPLKQ ASGIEAPILV KSNTSDTHYL WGDSYSPVNG EFYAWSTTNP
GANSWSVLNQ RAYTQPLNSK HATISPITAA EQSALLSRWG APSWNRLKSS NFPDRFVRHQ
NYLGRIDPYP FDPYTDQLWK LVPGLSDSSG VSFQSVSDPT RYLRHYEYAI RLDANDNTAA
FRADATFHRV PGLADSSWSS FRSANLPDRY LRHSGYALRV DPISTATDQQ DATFRVGS