Gene Amir_3944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_3944 
Symbol 
ID8328137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp4624226 
End bp4625626 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content71% 
IMG OID644944420 
ProductAlpha-N-arabinofuranosidase 
Protein accessionYP_003101657 
Protein GI256377997 
COG category[R] General function prediction only 
COG ID[COG3940] Predicted beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCACCGG CAGCCTCGGC GGCGATCGTC GACACCTCCG CCTCCTACGC GCTGGTCAAC 
CGGCACAGCG GCCGGGCGCT CGACGTCTAC GACCTGGCCA CGGGCGACGG CGCCCGCATC
GCCCAGTTCA CCCGCAACGA CGGGGCCTGG CAGCAGTGGC AGTTCGTCGA CTCCGGCGGC
GGCTACTACC GGCTCAAGTC CCGGCACAGC GACAAGGTCC TGCAGATCTC CGGCGGGTCG
ACCACCGACG GGGCCGAGGT CGTGCAGTGG ACCGACTCCA ACGCGACCAG CCAGCAGTTC
CGGCTCGTGG ACTCGGCGTC CGGCCACGTC CGGCTGCTGA ACCGGGCCAG CGGCAAGGCG
GTGGAGACCT ACGAGTGGTC GACCGCCGAC GGCGCGCGCG TCGTGCAGTG GCCGGACCTG
GACGGCGCGA ACCAGCAGTG GCAGCTGATC AGGCTGGGCG CGCCGGGCGC GGTCGTGAAC
CCCGTCAAGC GGGGCGGCCC CGATCCCTGG CTCCAGCACC ACAACGGCTA CTACCACCTG
GCGACGACGA CCTGGAACTC CACGGTCACC ATGCGCCGCT CCCGCACCCT GGCCGGGTTG
TCGAGCGCGG CCGACCAGGT CGTGTTCAGC CTCTCCGGCC GCCCCAACGG GTGCTGCACC
ATGTGGGCGC CGGAGTTCCA CCTGCTGAAC GGCCGCTGGT ACCTGTACTA CGTCGCCGGG
CAGAACGTGC CGGACTTCAA CCCCACGCAG CGCCTGCACG TGCTGGAGTC CGCGGGCGGC
GACCCCATGG GCCCCTACAG CTTCAAGGCC GACCTCGGGA ACACCTGGGA GCTCGACCCG
AGCATCCTGC AGGTGGGCGG GAAGCTGTAC CTGTTGGGCA GCGCGATGGA CGGCACGCAG
TCGCTGACCA TCACGCCGAT GAGCAACCCG TACACGCTCA GCGGGGCCCG CCGCACGATC
AGCCAGCCGA CCCTGGCCTG GGAGCGGCAG ACCGCCGCCG TCAACGAGGG CGCGGAACCG
CTGCACCGCA ACGGGAAGAC CATGATCGTG TACTCGGCGA GCGCGTGCTG GGGCCCGGAC
TACAAGCTGG GCCTGCTCAC CCTCACCGGC GGCGACCCGC TCAACCGGGC GCACTGGACC
AAGTCGCCGA ACCCGGTGTT CCAGCGCGAC GACGCGAACG GCGTGTTCGC CCCCGGCCAC
AACGGGTTCT TCAAGTCCCC CGACGGCACG GAGGACTGGA TCGTCTACCA CGCCAACGAG
AGCGCGTCCG GCGGCTGCGA CATGAACCGC TCGGCGCGGG CGCAGAGGTT CACCTGGAAC
GCCGACGGCA CGCCGAACTT CGGCCCGCCG GTCCGCCTGG GCGTGCAGCT CCCGCCGCCC
TCGGGCGAGC CCGCGTCCTA G
 
Protein sequence
MSPAASAAIV DTSASYALVN RHSGRALDVY DLATGDGARI AQFTRNDGAW QQWQFVDSGG 
GYYRLKSRHS DKVLQISGGS TTDGAEVVQW TDSNATSQQF RLVDSASGHV RLLNRASGKA
VETYEWSTAD GARVVQWPDL DGANQQWQLI RLGAPGAVVN PVKRGGPDPW LQHHNGYYHL
ATTTWNSTVT MRRSRTLAGL SSAADQVVFS LSGRPNGCCT MWAPEFHLLN GRWYLYYVAG
QNVPDFNPTQ RLHVLESAGG DPMGPYSFKA DLGNTWELDP SILQVGGKLY LLGSAMDGTQ
SLTITPMSNP YTLSGARRTI SQPTLAWERQ TAAVNEGAEP LHRNGKTMIV YSASACWGPD
YKLGLLTLTG GDPLNRAHWT KSPNPVFQRD DANGVFAPGH NGFFKSPDGT EDWIVYHANE
SASGGCDMNR SARAQRFTWN ADGTPNFGPP VRLGVQLPPP SGEPAS