Gene Amir_1979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_1979 
Symbol 
ID8326164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp2193564 
End bp2195192 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content68% 
IMG OID644942528 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003099773 
Protein GI256376113 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCACGA AGTGGAAGAG GCCGTTCACC GCACTGGTGG CGGGCGGTCT GGTCGCGGGC 
CTGGTGGCCT GCAGCGGGGA CGACCCCGGT TCGGAGCCGG TGAACCTGGA CGGCAAGCGG
GTGGGCGCCA TGGCCGACTT CGCCGCGGGG ACGCCGTTCA AGGCCACCGA GCCGCTGGAC
TTCGGCATCC TCTACAACGA CCACGCCAAC TACCCGATCA AGAACGAGTG GATGCTGTGG
TCGGAGCTGA CCTCGCGCAC CAACGTCACG CTCAAGCCGA CGGTGGTGCC CAACAGCGAC
TACGACCAGA AGCGCAGCCT GGTCATCGGG GCCGGTGACG CGCCGGAGAT CATCTCCAAG
ACCTACCCCG GCATGGAGCG GCCGTTCGTC GCCTCCGGCG CGATCCTGCC GGTCAGCGAC
TACCTCGACC TGATGCCGAA CTTCAAGGCC AAGGTCGAGA GGTGGGGGCT GAAGAAGGAC
CTGGAGACGC TGCGCCAGGA CGACGGCAAG TTCTACGTGC TGCCCGGCCT GCACGAGGCC
CCGTGGCAGG ACTACACGCT GGCGTTCCGC ACCGACGAGC TGACCCGGCT GGGCATCGCG
ACGCCGAAGA ACTGGGACGA GGTCCACACC GCGCTCAAGG CGATCAAGGC GGCGCACCCG
GACAGCTACC CGCTGTCGGA CCGGTTCGAG GGCAAGGCGC TGCTGAACTA CCTCGGCATG
ACCTACGGCA CCGCCGCCGG GTGGGGCTAC GTCAACGCGA CCTGGGACGA GAAGGCGCAG
AAGCTGGAGT ACAGCGCCTC CAGCCCGCAG TACAGGCAGA TGCTGGAGTA CCTGCGCAAG
CTCGTCGACG AAGGGCTGCT CGACCCGGAG AGCTTCACCC GCAAGGACGA CCAGATCGCG
ATCCGCAAGT TCGCCAACGG CGAGTCGTTC GCCATCAGCG CCAACGCGCA GAACGTGGTC
AACGACTACC GGCCCGCGAT CTCCGGCATC CCCGGCGCGA CCGTGGCGAA GATCCCGCTG
CCCGCGGGGC CCGCCGGGAA CCTGATCACC GGGTCGCGGC TGGAGAACGG GCTGATGATC
CCGGCCAAGG CCGCCGAGAG CGAGCACTTC GTCGCGATGA TGCAGTACGT CGACTGGGTC
TGGTACTCCG AGGAGGGCCA GGAGCTGGTC AAGTGGGGCG TGCCCGGCAC GACCTACGAC
AAGGACGCCT CGGGCAAGCG GGTGCTGAAC CCCGGCGTCG CCTACGCGGG CACGAACACC
GGCGCGCCCA AGCACCTGCA GAAGGACTTC GGGTTCGCGG GCGGCGTGTT CGCCTACGGC
GGCCCGACGG AGCTGCTGCG CTCGACGTTC ACCGACGAGG AGATCGCGTT CCAGGAGGAG
ATGGCGAAGA AGACGTCCGC CCCGCTGGTG CCGCCGTACC CGTTCACCGA GGAGGAGCGC
GAGCAGTCCT CGCTCTGGGA GACCCCGCTC AAGGACTACG CGAACCAGTC GGCGCTGCAG
TTCATCCTGG GCGACCGGGA CCTGTCGACC TGGGACGCGT ACGTGAGCGA GCTGGACGCC
AAGGGGCGGA CGCAGTTCGT GGACCTGGCC AACAGCGCCC ACCAGCGCTA CAAGCAGAAG
AACGGCTGA
 
Protein sequence
MRTKWKRPFT ALVAGGLVAG LVACSGDDPG SEPVNLDGKR VGAMADFAAG TPFKATEPLD 
FGILYNDHAN YPIKNEWMLW SELTSRTNVT LKPTVVPNSD YDQKRSLVIG AGDAPEIISK
TYPGMERPFV ASGAILPVSD YLDLMPNFKA KVERWGLKKD LETLRQDDGK FYVLPGLHEA
PWQDYTLAFR TDELTRLGIA TPKNWDEVHT ALKAIKAAHP DSYPLSDRFE GKALLNYLGM
TYGTAAGWGY VNATWDEKAQ KLEYSASSPQ YRQMLEYLRK LVDEGLLDPE SFTRKDDQIA
IRKFANGESF AISANAQNVV NDYRPAISGI PGATVAKIPL PAGPAGNLIT GSRLENGLMI
PAKAAESEHF VAMMQYVDWV WYSEEGQELV KWGVPGTTYD KDASGKRVLN PGVAYAGTNT
GAPKHLQKDF GFAGGVFAYG GPTELLRSTF TDEEIAFQEE MAKKTSAPLV PPYPFTEEER
EQSSLWETPL KDYANQSALQ FILGDRDLST WDAYVSELDA KGRTQFVDLA NSAHQRYKQK
NG