Gene Amir_2534 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_2534 
Symbol 
ID8326723 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp2855727 
End bp2856941 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content75% 
IMG OID644943076 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003100317 
Protein GI256376657 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.478006 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCCTT CCACCCGCCA CGCCGTCCTG CGCGCGCTGG TCCCGCTCCT GCTCCTGACC 
TGCCTCCCCG GCTGCGCCCC GCCCGCCGAC CGCACCCTCG TGCGCGTCCT CGGCCCGTGG
ACCGGAGCCG AGGAGGACCG GTTCCGCGCC GTGCTCGACC GGACCGGAGT GCCCTACGAC
TACACCGGCA GCCGCGCCCT CGGCCAGCTG CTGCGCTCCA GGGTGCAGCA GGGCGACCCG
CCCGACGTCG CCGTCCTGCC CGGCCTCGGC GAGCTGGCCG ACTACGCGCG GGGCGGCTAC
CTGCGCGAAC TGCCCTCGCT GCCCGAGGCC GACTACGCGC CCCTCTGGCG GGACGTGGCG
CGCGTGGGCG CCGAGGGCAC GCACGCCGTC GTCGTCAAGG CCGACCTCAA GAGCCTCATC
TGGTTCGACC CCGGCTCGGA CGTGCGCCCG CCCGCGAACG CCGAGCAGCT CCTGGCGGAC
GGCGCGCCGT GGTGCCTGGG TCTCGGCTCG TCGCCGGACG CGGGCTGGCC GGGCACCGAC
TGGGTCGAGG ACCTGCTGCT GCACCGGTCC GGCCCCGAGG TCTACCGGCG GTGGGCCTCC
GGGGAGCTCG CCTGGAGCTC ACCGGAGGTG CGCGGGGCCT GGCAGACCTG GGGCGCGCTG
GTGTCCGGCG TCCCCGCCGA GCGGGCGCTG CTCACCGACT TCGACGACGC GGGCCTGGCC
ATGTTCACCC GGCCCCAGGG CTGCGGGCTC GACCACCTCG GCTCGTTCGC GGGCGCCGTC
TACCGGGAGC GCGGGCACCG CGGCGACTTC GCCCCCTTCC CCGACCTGGG CGCGAGCGGC
TGGGAGGTGT CCGCCGACCT GGCCGGGCTC TTCACCGACT CGCCCGCCGC GCGCAGGCTC
CTGACGCACC TGGCGGACGC CGAGGGCCAG CGGGTCTGGC CCGCTGCGGG CGGTGCATAC
TCCGCCCACA AACGGGTACC CCCCTCCGGC TACGCCGATC CGGTGGACCG GCGGATCGCC
GAAGTCCTCA CCGAGGGCGC GTCCCTGTGC CTCGACGCCT CGGACCTCAT GCCCCCGAGC
CTGCGCTCCG CCTTCTACCG GGGCGTAATC AACTACCTCG AAGCCCCGGA GTCCCTCGAC
GGGGTGCTGG ACGGTCTCGA CCACATCGCC GATTCGGTCG ACCGAACGGA GTGGATTACG
CTACCTTGCG GCTGA
 
Protein sequence
MSPSTRHAVL RALVPLLLLT CLPGCAPPAD RTLVRVLGPW TGAEEDRFRA VLDRTGVPYD 
YTGSRALGQL LRSRVQQGDP PDVAVLPGLG ELADYARGGY LRELPSLPEA DYAPLWRDVA
RVGAEGTHAV VVKADLKSLI WFDPGSDVRP PANAEQLLAD GAPWCLGLGS SPDAGWPGTD
WVEDLLLHRS GPEVYRRWAS GELAWSSPEV RGAWQTWGAL VSGVPAERAL LTDFDDAGLA
MFTRPQGCGL DHLGSFAGAV YRERGHRGDF APFPDLGASG WEVSADLAGL FTDSPAARRL
LTHLADAEGQ RVWPAAGGAY SAHKRVPPSG YADPVDRRIA EVLTEGASLC LDASDLMPPS
LRSAFYRGVI NYLEAPESLD GVLDGLDHIA DSVDRTEWIT LPCG