Gene Amir_1872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_1872 
Symbol 
ID8326057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp2068096 
End bp2071038 
Gene Length2943 bp 
Protein Length980 aa 
Translation table11 
GC content73% 
IMG OID644942421 
Productalpha-L-arabinofuranosidase B 
Protein accessionYP_003099666 
Protein GI256376006 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCAGAT CCCTGTCCCG CGCCCTCTTA GCCCCGCTGG CGGCCCTGCT GGTCGTCGGC 
TCGGCCGCCC CGGCGCTCGC CCAACCGGCC GCCCCGGCGC AGTCCGCGGA CGCCGCGCGA
CCGGCCGACG TGGCGCAGCC AGCCGACGTG GCCCAGCCGG CCGCCGCCGC CCAGCCGGCC
GCCGCCGCCC AGCCGGCCGA CGTGGCTCAA CCGGCCGCCG CAGCCCAACC GGCCGCCGCA
GCCCAGTCCG TCGACACCGC CCAGTCCGCC GCCCCCGAGC AGTCCGCCGA CGTGGTGTGG
CAGCCCAAGC CCGCGCCGCT GACCACGCCG TGGACGAGCC AGGTCTCGCC CACCAACGCC
CTGCCCGAGT ACCCGCGCCC CCAGCTGGTC CGGCCGGACT GGCAGAACCT CAACGGGGTG
TGGGAGTTCG CGGGCGCGGC GAACCTGGAC GACCCGCCGA TCGGCCGCCC GCTCGCCGAG
GGCGTCCTGG TGCCCTACGC GATCGAGTCG GCGCTGTCCG GCATCAAGCG GCACGAGGAC
AGCATGTTCT ACCGGCGGAC CTTCACCGTC CCCGCGGGCT GGGACGGGCG GCGGGTCAAG
CTCAACTTCG GCGCCGTCAC CTGGGAGAGC CGGGTCTGGG TCAACGGGAC GCAGGTCGGC
ACGCACACCG GCGGGTTCGA CCCGTTCTCG TTCGACGTCA CCGGCGCGCT CCGCAGTGGT
GGCAACGAGA TCGTGGTCGG GGTCAACTCG CCCGTCGACG GCCAGCGCTA CCCGATCGGC
AAGCAGCGGC GGAACCCGGG CGGCATCTGG TACACGCCCG CGTCCGGGAT CTGGCAGACC
GTGTGGCTGG AACCCGTGGC CACCAACCAC ATCACGCGCC TGGACACCAC GCCCGACGTG
CCCGCCGGGG TGCTGGACCT GGTCGTGCGG GGCAGCGCGG GCCAGCAGGT GCAGGCCCAG
GTGCTCAGCG GCGGGCAGGT CGTCGGCACG GCGTCCGGGA CCGTCGGGCA GCACCTGCGG
GTGCCGGTGC CGAACGCGCG GCTGTGGTCG CCGGACGACC CGTTCCTGTA CGACCTGCGC
GTCACGATGG CGGGCGGCGA CGCGGTCACC GGCTACTTCG GGATGCGCTC GCTCGGCAAG
GCCGTCATCG GCGGCGTGAC CAGGCCGCTG CTGAACGGGG ACTTCGTGTT CCAGCTCGGC
ACCCTGGACC AGGGCTACTG GCCGGACGGG GTCTACACCG CGCCCACCGA CGCGGCGCTG
CGCTCGGACC TGGAGCAGCA GAAGGCGTTG GGCTTCAACA TGGTCCGCAA GCACATCAAG
GTCGAACCGG CCCGCTGGTA CTACTGGGCG GACAGGCTCG GGCTGATGGT GTGGCAGGAC
ATGCCGTCGG TCGACTCGGT GGACGAGGCC CCGAACAGCC ACGCCAACTA CGAGTCCGAG
CTGCGCCGGA TGATCGAGAA CCTCAAGGGG ATCACGTCGA TCGTGCAGTG GGTGCCCTTC
AACGAGGGCT GGGGCGAGTA CGACGCCGGG CGCATCACGG ACCTGGTGCG GTCGCTGGAC
AGCACCAGGT TGATCAACCA CAACTCCGGG TCCAACTGCT GCGTCTCCGA CCCCGACCCC
GGCAACGGGG ACGTGATCGA CGACCACGCC TACCAGATGT CGTCCGGCAC CAGGCAGCCC
GACGGGCGGA TCGCGGTGCT CGGCGAGTAC GGCGGGCTCG GGCGGCGGAT CAGCGGGCAC
GAGTACCAGC CGGGCCAGGG CTTCGCCTAC GGCGACCTGT TCCCCGACGA GAACTCGTTG
ACCAGCCGGT ACGTCACGAT CACCGAGGAG GTCGGGCGGT TCGTGCAGAC GCGCGGGCTG
TCGGCGTCGG TGTACACCGA GCCGTACGAC GTGGAGAACG AGGTCAACGG CTTCCACACC
TACGACCGCC AGGTGCTGAA GATGAACGCG GCGCAGGTGC GGGCGGTCAA CCAGCGGGTC
CTGGCGCGCG CCAGGGGGAC CGAGGTCGGG CGGGACGAGC TGGTCTCGCT CAAGGTCGCC
ACCCCCGGCT TCACCACCCG CTTCCTGCGG CACCAGAACT CCCTCGCCCG CACCGACGTG
CTCACCCCCG GCAGCGGCGA GGGCGCGACC AAGGACGCCA CCTACCGGTT GCGCCCAGGG
CTGGCCGACC CGGCCTGCTA CTCCCTGGAG TCGCGGAACT TCCCCGGCAG CTACCTGCGG
CACGCCTCGT CGCGGGTCCG GCTGGACGCC AACGACGGCA GCGCGCTGTT CGCCGGGGAC
GCGACGTTCT GCGCCCGCGA CGGCTGGGGC GCGACGGCGT TCGAGTCCAA GAACCTGCCC
GGCCACTTCC TGCGGCACTA CAACGAGGGC GTGTACGTCG CCCGCAGCGG CGGCCCGAAC
CCGTGGGACG GCGCGGCGAG CTTCACCCAG GACACCACCT GGAACGTCAC GATCCCGCTG
TGGCGCAGCG GGGTCGACCT GCCCGTCGAC CAGGCGCGCT CGCTCCGGGT GACCACCCCC
GGCTTCACCG ACCGGTTCCT GCGGCACCGG GACGGCCTGG CCCGCACGGA CGTCGTCGGC
TCCGCGAGCG ACGCGACCAC CAAGGCCGAC GCGACGTTCG TGGTGCGGCG CGGGCTGGCC
GACCCGTCCT GCTACTCGTT CGAGTCGCGG AACCTGCCCG GCCGGTTCCT GCGGCACGCC
TCGTACCGGC TGCGCCTGGA CACCAACCCG AACACCGACC TCTTCCGCCG GGACGCCACG
TTCTGCGCCC AGCCGGGCGC CGGCGGCACG CGGCTGGCCT CGGTGAACGA GCTGGGGGCG
AACGTCCGGC ACTACAACGC GGAGGTGTGG GCGGCCAGCG ACGGCGGGGC CCACGCGTAC
GACAACCCGG TCTCGTACGC CCAGGACGTG AGCTGGAGCT TCGACCAGCC CTGGACGCCC
TGA
 
Protein sequence
MRRSLSRALL APLAALLVVG SAAPALAQPA APAQSADAAR PADVAQPADV AQPAAAAQPA 
AAAQPADVAQ PAAAAQPAAA AQSVDTAQSA APEQSADVVW QPKPAPLTTP WTSQVSPTNA
LPEYPRPQLV RPDWQNLNGV WEFAGAANLD DPPIGRPLAE GVLVPYAIES ALSGIKRHED
SMFYRRTFTV PAGWDGRRVK LNFGAVTWES RVWVNGTQVG THTGGFDPFS FDVTGALRSG
GNEIVVGVNS PVDGQRYPIG KQRRNPGGIW YTPASGIWQT VWLEPVATNH ITRLDTTPDV
PAGVLDLVVR GSAGQQVQAQ VLSGGQVVGT ASGTVGQHLR VPVPNARLWS PDDPFLYDLR
VTMAGGDAVT GYFGMRSLGK AVIGGVTRPL LNGDFVFQLG TLDQGYWPDG VYTAPTDAAL
RSDLEQQKAL GFNMVRKHIK VEPARWYYWA DRLGLMVWQD MPSVDSVDEA PNSHANYESE
LRRMIENLKG ITSIVQWVPF NEGWGEYDAG RITDLVRSLD STRLINHNSG SNCCVSDPDP
GNGDVIDDHA YQMSSGTRQP DGRIAVLGEY GGLGRRISGH EYQPGQGFAY GDLFPDENSL
TSRYVTITEE VGRFVQTRGL SASVYTEPYD VENEVNGFHT YDRQVLKMNA AQVRAVNQRV
LARARGTEVG RDELVSLKVA TPGFTTRFLR HQNSLARTDV LTPGSGEGAT KDATYRLRPG
LADPACYSLE SRNFPGSYLR HASSRVRLDA NDGSALFAGD ATFCARDGWG ATAFESKNLP
GHFLRHYNEG VYVARSGGPN PWDGAASFTQ DTTWNVTIPL WRSGVDLPVD QARSLRVTTP
GFTDRFLRHR DGLARTDVVG SASDATTKAD ATFVVRRGLA DPSCYSFESR NLPGRFLRHA
SYRLRLDTNP NTDLFRRDAT FCAQPGAGGT RLASVNELGA NVRHYNAEVW AASDGGAHAY
DNPVSYAQDV SWSFDQPWTP