Gene Amir_2457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_2457 
Symbol 
ID8326646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp2726601 
End bp2728289 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content70% 
IMG OID644943002 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003100243 
Protein GI256376583 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAGAA GTCGGACCCC AAGGGTGGTC GCCACGGCCA CCGCACTCGT CGGCGCGCTC 
GCGCTGACCG CGTGCGGCGC CAACGACCAG CCCCAGCAGG GCGCGGCCGA CCAGGCCAAG
GCCGAGCGCG GGGGCGTGCT GCGCATCCTC GGCGAGAGCC AGTCGCTGAA CTTCGACCCG
GCCAAGAGCT CCTCCCTGCC GATCACCTCG CTCGCGCTGG TGCACCGCAG GCTCACCGGC
TGGCTGAACG AGCCGGGCCA GGACGCCAAG GTCGTCCCCG ACCTCGGCGA CACCGGCAAG
ACCGACGACG GCGGCAAGAC GTGGACCTTC ACCCTCAAGG AGGGCCTGAA GTACGCGGAC
GGCACGCCCA TCAAGGCCGA TGACGTCAAG TGGGGCGTCG AGCGCTCGTA CGCGCCCGCG
TTCTCCGGCG GCCTCGGCTA CCACAAGCAG CTGATCGAGG GCGGCGCCGA CTACCGGGGC
CCGTTCGAGG GGCAGCGCCT GGACTCCATC AAGGTCCCGG ACGACCGCAC GATCGTGTTC
ACCCTGTCGC GGCCTTACGG TGACTGGCCG TGGGTGGCGA GCACCCCGGC GTTCGCGCCG
GTGCCCAGCG GCAAGGGCGC GGAGGCCGAC TACAGCGACC ACGCGGTCGC GTCCGGCCCG
TACAAGGTGG AGAAGTACCA GAAGGGCGTC GAGGCGCGCC TGGTGCGCAA CGAGAACTGG
GACTCGGCCA CGGACGAGCT GCGCGGCGGC CTCCCGGACG AGATCCTGTT CCAGCTGGGC
CAGGACACCT CGGTGATCTC GCAGCGGCTC ACCGCCGACT CCGGCGACGA CAGGTTCGCG
TTCGGCTCCT CGTTCGTGTC CCCGGCGCAG CTCGCGCAGC TGGCGGGCAA CCCGTCGGCC
AAGCAGCGCC TGGTGACGTC GAAGACCGGC GCGCTCGCCT ACCTGGCGCT CAACACCCAG
AAGGCGCCGT TCGACAACCC CAAGGTGCGG CAGGCCGTGC AGTACGCGGT CGACAAGACC
TCGTACCAGA TCGCGTCGGC GGGCAACGCC GAGCTGGCCG GTGACGTCGC CACCACGCTG
ATCACCCAGG GCATCGCGGG CCGCGAGCAG TTCGACCTCT ACCAGACCAA GCCGTCCGGC
GACCCGGAGA AGGCCAAGTC GCTGCTGGCC GAGGCGGGCT TCCCGAACGG GATCTCGGGC
CTGGAGTTCC TGGTGTCGCA GACCAACAAC TACCCGGAGA AGGCCGAGGC GGTCCAGGCC
GCGCTGGTCA AGGCGGGCAT CCAGTCCGAG ATCCGGGTGC TGGAGAGCGA CGCCTACACC
GCCGAGTCGC GCGCGGACAA CCCGAACTAC GCGCTGACCC TGTCCTCGTG GCAGCCGGAC
TTCCCCAGCG CCAACGCGAA CATCCAGCCG CTGTTCGACT CCAAGGAGAT CGGCCAGGGC
GGGTACAACC TGTCCCGCTA CAGCAACCCC GAGGTCGACC AGCTGATCGC GGCGGCGCAG
GCCACCGTCG ACCCGGTCGA GGCGGGCAAG AAGTGGGTCG AGCTGGACAG GAAGATCCTG
GCCGACTCGC CCGTCGTGCC GCTGATCTAC ACCCGCAACT CGTTCCTGCA CGGGTCCAAG
GTCGCCGACT TCCGCATCGC CGACTTCCCC GCGTACCCGA ACTACGCCCG AGTCGGGCTC
CTGAAGTGA
 
Protein sequence
MTRSRTPRVV ATATALVGAL ALTACGANDQ PQQGAADQAK AERGGVLRIL GESQSLNFDP 
AKSSSLPITS LALVHRRLTG WLNEPGQDAK VVPDLGDTGK TDDGGKTWTF TLKEGLKYAD
GTPIKADDVK WGVERSYAPA FSGGLGYHKQ LIEGGADYRG PFEGQRLDSI KVPDDRTIVF
TLSRPYGDWP WVASTPAFAP VPSGKGAEAD YSDHAVASGP YKVEKYQKGV EARLVRNENW
DSATDELRGG LPDEILFQLG QDTSVISQRL TADSGDDRFA FGSSFVSPAQ LAQLAGNPSA
KQRLVTSKTG ALAYLALNTQ KAPFDNPKVR QAVQYAVDKT SYQIASAGNA ELAGDVATTL
ITQGIAGREQ FDLYQTKPSG DPEKAKSLLA EAGFPNGISG LEFLVSQTNN YPEKAEAVQA
ALVKAGIQSE IRVLESDAYT AESRADNPNY ALTLSSWQPD FPSANANIQP LFDSKEIGQG
GYNLSRYSNP EVDQLIAAAQ ATVDPVEAGK KWVELDRKIL ADSPVVPLIY TRNSFLHGSK
VADFRIADFP AYPNYARVGL LK