Gene Amir_0403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_0403 
Symbol 
ID8324562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp442348 
End bp443628 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content70% 
IMG OID644940947 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003098216 
Protein GI256374556 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.512139 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGGGCT GGAGAGGACT GGCAGTCGGC GCCACCGCGC TGTCGGTGGC ATTGGCCGGT 
TGCGGCACGA GCAGTGACAG CGGGAGCGGC GAGGGCGGCG ACAAGACGCT CACCGTGTGG
CTGATGGACG GCTCGGCGGC GCCTGCGCTG ACCGACGCGC TGCACCAGGA GTTCGAGAGC
GCCCACCCCG GCGTGAAGGT CAAGTACGAG GTGCAGAAGT GGGGTGGCAT CCAGGAGAAG
CTCACCACCG CGCTGACCGG GAAGACCCCG CCGGACGTCA TCGAGCTGGG CAACACCCAG
ACCGCGAAGT TCGCGTCCGA GGACACCCTG GAGGACCTGT CCGGCGACGT CTCGTCGCTC
GCGGGCGACC AGTGGCTCGC GGGTCTCGAG GACTCGCTCA CCTGGGACGG CAAGCAGTAC
GGGATGCCGT TCTACGCCGC GAACCGGACG GTCGTCTACC GCACCGACCT GTTCCAGGCC
GCGGGCATCA CCGCCGCGCC GACCTCGCGC GACGAGTGGG TCGCGGCCGT CGAGAAGCTC
AAGGCCGCCA ACGCGTCCGA CCCCGAGTTC CAGTCGCTGT ACCTGCCGGG CCAGTCCTGG
TACGTGCTGC TGTCGTTCAT CTGGGACGAG GGCGGCGACG TCGCCAAGAA GGACGGCGAC
AAGTGGGCGG GCGCGCTCGA CAGCGCCGAG GCCAAGGCTG GCCTGGAGTT CTACAAGAAG
CTCGTGGACG CCTCCGGCAC CAAGGCCCCG AAGGACACCG ACGAGGCCAC GCCGCAGCAG
ATGGAGGTGT TCGGCACCGG CAAGGTCGGC ATGATGATCG GTCTGCCGTG GGAGCTGGCG
GGCGCGGTCA AGGCCGACCC GACGCTGGAG GGCAAGCTCG GCGCGTTCCC GATCCCGTCC
AAGACGGCGG GCAAGACCGC GCCGGTGTTC CTGGGCGGCT CGAACCTGGC CATCCCGGCA
GGCAGCGCCA ACCCGGACCT GGCCAAGGAC TACCTGAAGC TGCTGTCCTC CAAGAAGTAC
CAGGACCTGC TGGCCGAGGG CGGCTCGGTG CCCGGCACCT CCACCGACAC CACGAAGCTG
GAGACCACCC CGGTCGGCAA GGCGCTCGCC GCCGCCGCCC CCAACGGCAA GGTCACCCCG
ACCACGCCGA CCTGGGCCGG TGTCGAGGCG GGCCAGAACC CGCTGAAGGA CATGCTGACC
GCGTACCTGA CCGGCGCGAA GTCGCTGGAC CAGGCGACGG CGGACGCCAA CGCGGCGCTC
ACGAAGACGC TGGCGGGCTG A
 
Protein sequence
MKGWRGLAVG ATALSVALAG CGTSSDSGSG EGGDKTLTVW LMDGSAAPAL TDALHQEFES 
AHPGVKVKYE VQKWGGIQEK LTTALTGKTP PDVIELGNTQ TAKFASEDTL EDLSGDVSSL
AGDQWLAGLE DSLTWDGKQY GMPFYAANRT VVYRTDLFQA AGITAAPTSR DEWVAAVEKL
KAANASDPEF QSLYLPGQSW YVLLSFIWDE GGDVAKKDGD KWAGALDSAE AKAGLEFYKK
LVDASGTKAP KDTDEATPQQ MEVFGTGKVG MMIGLPWELA GAVKADPTLE GKLGAFPIPS
KTAGKTAPVF LGGSNLAIPA GSANPDLAKD YLKLLSSKKY QDLLAEGGSV PGTSTDTTKL
ETTPVGKALA AAAPNGKVTP TTPTWAGVEA GQNPLKDMLT AYLTGAKSLD QATADANAAL
TKTLAG