Gene Amir_4241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_4241 
Symbol 
ID8328434 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp4998389 
End bp4999657 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content76% 
IMG OID644944705 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003101942 
Protein GI256378282 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGCCG GGGGACTGGA CCGACGCTCG TTCCTCGCGG CAGCCGGGCT GCTCGGCCTC 
GGGCTGGGCG GGCCGTCGCT GGCCGCCTGC GGGTCCAACA CCGGGCGCGA CGGGTCCGCG
CCGGGCACGC TCAGCCACTG GTACCACGCC TACGGCGAGG ACGGCGTGCA GGACGCCGTG
CGCCGGTACG CCGCCGACTA CCCCGACGCC CGCGTCGAGG TGCAGTGGAA CCCCGGCGAC
TACGAATCCA AGATCGCCAC CGCGCTCCAG GGCGGCCGGG TGCCCGACGT GTTCGAGGCC
CAGGTCAAGG TCGACTGGGT CCGGCAGCAG CGGGTCGTCC CGCTCGACGA CCTCCTCGGC
GACCAGCGCC CCGACTTCGT CAAGACCCTC CTGGACTCGC AGACCGTCGA CGGCAGGCTC
TACGGCGTCC CCCAGGCCAT CGACACCCAG GTCCTGTTCT ACCGCCCCAG CCTCCTGCGC
GAGGCGGGCG TCACCCCGCC CACCACCGTG GACGAGCTGG TCGACGCCAC CCGCCGCCTG
TCCGGCAACA CCGCGCGCGG CTTGTTCGCG GGCAACGACG GCGGCGTCGC CGTGCTCACC
GCGCCGCTGC TGTGGTCCGC CGGGCTCGAC CTGCTCAGCC CGGACGGCGA GTCCCCCGGC
TTCGACGACC CGCGCGCCGC CACCGCCGTC GGCAAGCTCC GCGAGCTGCA CGCCACGGGC
GGCCTCCTGC TCGGCGCCCC TGCCGACTGG GCCGACCCCG GCGCGTTCAC CGAGGGCCTG
ACCGCCATGC AGTGGACCGG CCTGTGGAAC CTGCCCAAGA TCGTCGAGGC GCACGGCGAC
GACGTCGGCG TCCTGCCCTT CCCGCGCCTG GACGCCGACG GCGCCGAGTC CGTGCCCGTG
GGCGCCTACA GCGCGATGGT CAACGCCCGC GCCGCCGACG TCGAGCGCGC CAAGGACTAC
GTGCGCTGGC TGTGGGTCGA GCGGACCGAC CACCAGGCCG AGTTCGCCAC CGCCTTCGGC
GCGCACCTGC CCGCCCGCGC CAGCCTGCGC CCCGCCGCCG ACCGGCTCAG CGGCGGGCTC
GGCGCGGACG TCGCCCAGCT CGTCGCCGAC GTCGGCCGCG TCGCCAGCCC GGCCCGCTGG
AGCGCCGCCG CGAACACCGC CCTGTCCGAC GCCGTCTCCC GCGTCGCCCG CGAGGGCGCC
GACCCCGCCG AGGAGCTGCG CGCCGCCGTC GCCACCGCCC GCGACGAGCT CACCAGGCTG
GACCGGTGA
 
Protein sequence
MGAGGLDRRS FLAAAGLLGL GLGGPSLAAC GSNTGRDGSA PGTLSHWYHA YGEDGVQDAV 
RRYAADYPDA RVEVQWNPGD YESKIATALQ GGRVPDVFEA QVKVDWVRQQ RVVPLDDLLG
DQRPDFVKTL LDSQTVDGRL YGVPQAIDTQ VLFYRPSLLR EAGVTPPTTV DELVDATRRL
SGNTARGLFA GNDGGVAVLT APLLWSAGLD LLSPDGESPG FDDPRAATAV GKLRELHATG
GLLLGAPADW ADPGAFTEGL TAMQWTGLWN LPKIVEAHGD DVGVLPFPRL DADGAESVPV
GAYSAMVNAR AADVERAKDY VRWLWVERTD HQAEFATAFG AHLPARASLR PAADRLSGGL
GADVAQLVAD VGRVASPARW SAAANTALSD AVSRVAREGA DPAEELRAAV ATARDELTRL
DR