Gene Amir_0206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_0206 
Symbol 
ID8324362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp228577 
End bp230262 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content67% 
IMG OID644940751 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003098023 
Protein GI256374363 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.646254 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTCATT CCCGATCGGC CCAGCGACGC TGGGTGGCTG CGATCGGTCT CACCAGTGCC 
GCCGCGCTCG TCCTCACCTC CTGCGTGCAG TCCGAGCGCA GCCAGGACTC CGGGACCGGC
GGCGCCGCGG GCGGGACGTT CACCTTCGGG GCCGCGGGAG CCCCGAAGCT GTTCGACCCG
ATGTTCGCGA CCGACGGTGA GACCTTCCGC GTCGCCAGGC AGATGTTCGA CGGCCTGACC
ACGTTCAAGC CCGGCACCGC CGAGCCCGCG CCCTCGCTGG CCAAGGAGTG GTCGAGCACT
CCCGACGGCC TCACGTGGAC GTTCAAGCTC GAGACGGGCG TGAAGTTCCA CGACGGCAGC
GACTTCAACG CCGAGGCCGT CTGCTTCAAC TTCGACCGCT GGTACAACCT GAAGGGCGAC
GCGCAGTCCG ACGCCGTCTC CCAGTACTAC GTCGACAACT TCGGCGGCTT CTCCGACAAG
CCGGAGACGT CGCTCTACAA GAGCTGCGCC GCCGAGGGCG CCGACACCGC CGTGGTCACC
CTGACCAAGA CGACGTCCAA GTTCCCGGAC ATCCTGGGCC TGCCGTCGTT CTCCATGCAG
AGCCCGAAGG CCCTGAAGGA GTTCAACGCC GACGACGTGA AGGCCCAGGG CGACAGCTTC
GTCTTCCCGG CCTACGCGAA CGAGCACCCG ACCGGCACGG GCCCGTTCAA GTTCGGCAAG
TACGACAAGG CGAACAACGT CGTCGAGCTC GTGCGCAACG ACGACTACTG GGGCGAGAAG
ACCAAGCTGG ACAAGCTGGT CTTCCGGATC ATCCCCGACG AGACCGCCCG CAAGCAGGCG
CTGCAGTCCG GTGACATCGA CGGCTTCGAC TTCCCGAACG CCGCCGACTG GGACAGCCTG
AAGAGCGGTG GCTTCAACGT CGAGGTCCGC CCGGCGTTCA ACGTCTTCTA CGTGGGCATC
AACCAGAAGC GGAACCCGAA GCTCCAGGAC CTCAAGGTCC GCCAGGCGCT GCTGCACGCG
ATCAACCGCG AGCAGCTGGT CAAGTCCCAG CTGCCCGAGG GCGCCGAGGT CGCGACGCAG
TTCATCCCGA AGACCGTGGG CGGCTACGCC GACGACGTGC AGAAGTACGA GTACTCGGTC
GACAAGGCCA AGTCGCTGCT CGCCGAGGCG GGCGCGTCCG ACCTGACGCT GAAGTTCTAC
TGGCCCTCCG AGGTCAGCCG CCCGTACATG CCCAGCCCGA AGGACCTCTA CGGCGCCATC
GCCGCGGACC TGCAGGCCGC GGGCATCAAG GTGGAGGCGG TCACCAAGCC GTGGAACGGC
GGCTACCTGA CCGACGTCGA CCAGGGCGCG CAGGCCGACC TGTTCCTGCT CGGCTGGACC
GGCGACACCG GTAGCGCGGA CAACTGGGTG GGCACCTTCT TCGGCAACCC GGCCAACCGC
TTCAACACCG GGGCCTCCGC GTGGGGCGCC GACCTGTCCG CGCAGCTGAA GACGGCTGAC
GCGGAGCCGG ACCGGACCAA GCGGTACGAC CTCTACAAGG AGATCAACCG CAAGATCATG
GCCGAGTACG TGCCGGCGCT GCCGATCTCG CACTCGCCGC CCGCGCTGGT CGTGAAGAAC
ACCGTCAAGG GCATCACTCC GAGCCCGCTG ACCGACGAGA AGTTCGTCGA CGTCACGGTC
AACTGA
 
Protein sequence
MVHSRSAQRR WVAAIGLTSA AALVLTSCVQ SERSQDSGTG GAAGGTFTFG AAGAPKLFDP 
MFATDGETFR VARQMFDGLT TFKPGTAEPA PSLAKEWSST PDGLTWTFKL ETGVKFHDGS
DFNAEAVCFN FDRWYNLKGD AQSDAVSQYY VDNFGGFSDK PETSLYKSCA AEGADTAVVT
LTKTTSKFPD ILGLPSFSMQ SPKALKEFNA DDVKAQGDSF VFPAYANEHP TGTGPFKFGK
YDKANNVVEL VRNDDYWGEK TKLDKLVFRI IPDETARKQA LQSGDIDGFD FPNAADWDSL
KSGGFNVEVR PAFNVFYVGI NQKRNPKLQD LKVRQALLHA INREQLVKSQ LPEGAEVATQ
FIPKTVGGYA DDVQKYEYSV DKAKSLLAEA GASDLTLKFY WPSEVSRPYM PSPKDLYGAI
AADLQAAGIK VEAVTKPWNG GYLTDVDQGA QADLFLLGWT GDTGSADNWV GTFFGNPANR
FNTGASAWGA DLSAQLKTAD AEPDRTKRYD LYKEINRKIM AEYVPALPIS HSPPALVVKN
TVKGITPSPL TDEKFVDVTV N