Gene Amir_2920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_2920 
Symbol 
ID8327109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp3369915 
End bp3371189 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content70% 
IMG OID644943446 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003100687 
Protein GI256377027 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTCCGA ACCGCCGCGC CGGGTCGCGG GCACGACGCT GGGCAGGTCT GGCCGCCGCC 
CTCTCCCTGG CGGTCACCGG CTGCGGCCCC GGCGCCGCCG CCGACCCGAA CACGCTCACC
GTGTCGGTGT GGAACCTGGC CAGCACCCCG GAGTTCCAGG CCCTGTTCGA CGCCTTCGAG
CAGGCCAACC CCGGTGTCAC GGTCAAGCCG GTGGACATCG TCGCCGCCGA CTACCCGGAG
AAGGTCACCA CCATGCTCGC GGGTGGCGAC CGCACCGACG TGATCACCAT GAAGAACGTC
ACCGACTACG CCCGCCACTC CAGCCGGGGC CAGCTCCAGG ACGTCACCGA CCTCGTGGAG
CCGCTCGAGC CCCAGGGCCT GCAAGGACTT GAGGCGTTCG AGTCCGAGGG CAGGTACTTC
GCCGCCCCGT ACCGCCAGGA CTTCTGGGTG CTGTACTACA ACAAGACCCT GTTCCAGCAG
GCAGGCGTCC CCGCTCCCGA GAACTGGACC TGGGCCGAGT ACGCCGACGC GGCGAAGAAG
CTCACCACCG GCAGCGGCGG CGACAAGACC TTCGGCGCCT ACCACCACGT GTGGCGCTCG
GTGGTCCAGG CCATCGCCGC GGCCCAGAAC GACGGCGACC AGCTCAGTGG CGACTACGGC
TTCTTCACCG AGCAGTACAC CACCGCGCTG CGGATGCAGC AGGACGGCTC GGTCCTGGAC
TACGGCGCCA TCACCACCCA GCAGGCCGAC TACCGCAGCA CGTTCGAGCG GGCCAGGACC
GCCATGATGC CCATGGGCAC CTGGTACACC TCCAGCATCA TCAAGGCCAA GCGGGAGGGC
GACTCCCAGG TCGACTGGGC CATCGCCCCG CTCCCCCAGC GCGCCGAGGG CGGCGAGGTC
GCCACCTTCG GCTCCCCCAC CGCGTTCGCC GCCAACCGCA ACGCCGACAA CAGCGACCTG
GCCCGCGAGT TCGTCGCCTT CGCCGCGGGA GAAGAAGGCG CCAAGGCCAT CACGAAGGTC
GGCGTCGTGC CCGCCCTGCG CAGCGAGGCG CTGACCGACG CCTACTTCCA GCTCGACGGC
ATGGCGAAGG ACGAGCTGTC CACCGCCGCG TTCCGCCCGG ACCACGTCGC CCTCGAGATG
CCGGTGAGCA AGCGCAGCTC CGCCGTCGAC ACCATCCTCA AGGAGGAGCA CGAGCTGATC
ATGGTCGGCG AGAAGTCCGT CGCCGACGGC ATCCGCGACA TGGGCGAGCG CGTGCGCACC
GAGGCCCCCG AGTAG
 
Protein sequence
MIPNRRAGSR ARRWAGLAAA LSLAVTGCGP GAAADPNTLT VSVWNLASTP EFQALFDAFE 
QANPGVTVKP VDIVAADYPE KVTTMLAGGD RTDVITMKNV TDYARHSSRG QLQDVTDLVE
PLEPQGLQGL EAFESEGRYF AAPYRQDFWV LYYNKTLFQQ AGVPAPENWT WAEYADAAKK
LTTGSGGDKT FGAYHHVWRS VVQAIAAAQN DGDQLSGDYG FFTEQYTTAL RMQQDGSVLD
YGAITTQQAD YRSTFERART AMMPMGTWYT SSIIKAKREG DSQVDWAIAP LPQRAEGGEV
ATFGSPTAFA ANRNADNSDL AREFVAFAAG EEGAKAITKV GVVPALRSEA LTDAYFQLDG
MAKDELSTAA FRPDHVALEM PVSKRSSAVD TILKEEHELI MVGEKSVADG IRDMGERVRT
EAPE