Gene Amir_2574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_2574 
Symbol 
ID8326763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp2906250 
End bp2907599 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content67% 
IMG OID644943116 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003100357 
Protein GI256376697 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.316894 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTACCA CCAGCCGCCG CGCGCGGGTG GCCGTGGCGA TGGCCGCCGC GCTCACCACC 
GCGCTCACCG CCTGCTCCTC CGGCGACACC GCCGCGAACC AGCCGGCGGG CATGACCGGC
ACCGCTGACG CCGTCGACGC CGCCCTGAAG GCCGGTGGCG AGATCACCTA CTGGAGCTGG
ACCCCTTCGG CGCAGGCGCA GGTCGACGCC TTCATGAAGG AGTACCCGCA GGTCAAGGTG
AACTACGTCA ACGCGGGCAC CAACAAGGAC CAGTACACCA AGCTGCAGAA CGCGATCAAG
GCCGGCTCGG GCGCGCCCGA CGTGGCCCAG GTCGAGTACC AGGCCCTGCC GCAGTTCGCG
ATGACCGACT CGCTGCTGGA CCTCGGCCAG TTCGGGTTCA ACGAGTACGA GAAGGACTAC
ACCGCCTCCA CCTGGAACTC GGTGAAGGTC GGCGGCGGCC TGTTCGGCCT GCCGCAGGAC
TCGGGCCCCA TGGCGATGTT CTACAACAAG GAGGTGTTCG ACCAGTACCA GATCCAGGTC
CCCAAGACCT GGGACGAGTA CGTCGCCGCC GCCGAGAAGC TGAACGCCGC GGACCCCACC
AAGTTCATCA CCGCGGACTC CGGTGACGCC GGCTTCGCCA CCAGCATGAT CTGGCAGGCG
GGCGGCAAGC CGTTCACCGT CGACGGCACC AACGTCAAGG TGAACCTGCA GGACGAGGGC
GCCAAGAAGT GGACCGAGAC CTGGAACAAG CTGGTCTCCA AGAAGCTCAC CGCGCCCACC
ATCACCGGCT GGTCGGACGA GTGGTACCGG GGCCTGGGCA ACGGCAGCAT CGTCACGATG
ATCAACGGCG CCTGGATGCC CGGCATCCTC GAGGCCTCCG TGCCTGACGG CAAGGGCAAG
TGGGCCGTGG CCCCGATGCC GACCTACGAC GGCAAGCCCG CGACCGCCGA GAACGGCGGC
GGCGGCCAGT CCGTCATCAA GCAGAGCGCC AACCCCGCGC TCGCGGCGGG CTTCGTGCGC
TGGCTGAACC ACGAGCAGGG CGGCATCGAC AAGTTCATCG AGTTCGGCGG CTTCCCGGCC
ACCACCAAGG AGCTCGAGTC GGACGCGTTC CTGAACGCCG AGTCCGCGTA CTTCGGCGGC
CAGAAGATCA ACCAGGTCCT CTCGCAGGCG GGCAAGGACG TCGTGAAGGG CTGGGAGTAC
CTGCCGTTCC AGCTGTACGC CAACAGCATC TTCAACGACA ACGCCGGTAG CGCCTACGCC
AACGCCAGCG ACCTGAACGC GGGTCTGGCC TCCTGGCAGA AGGCCATCAC CGAGTACGGC
AACCAGCAGG GCTTCACCGT CACCAACTGA
 
Protein sequence
MSTTSRRARV AVAMAAALTT ALTACSSGDT AANQPAGMTG TADAVDAALK AGGEITYWSW 
TPSAQAQVDA FMKEYPQVKV NYVNAGTNKD QYTKLQNAIK AGSGAPDVAQ VEYQALPQFA
MTDSLLDLGQ FGFNEYEKDY TASTWNSVKV GGGLFGLPQD SGPMAMFYNK EVFDQYQIQV
PKTWDEYVAA AEKLNAADPT KFITADSGDA GFATSMIWQA GGKPFTVDGT NVKVNLQDEG
AKKWTETWNK LVSKKLTAPT ITGWSDEWYR GLGNGSIVTM INGAWMPGIL EASVPDGKGK
WAVAPMPTYD GKPATAENGG GGQSVIKQSA NPALAAGFVR WLNHEQGGID KFIEFGGFPA
TTKELESDAF LNAESAYFGG QKINQVLSQA GKDVVKGWEY LPFQLYANSI FNDNAGSAYA
NASDLNAGLA SWQKAITEYG NQQGFTVTN