Gene Amir_0761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_0761 
Symbol 
ID8324924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp841988 
End bp843802 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content67% 
IMG OID644941304 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003098569 
Protein GI256374909 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.952848 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAGAA AGACCGTCTC GGCGTTCGCG CTGATGACCA GTGCGGCGCT CTTGCTGAGC 
GCCTGTGGCG GCGGCGGCGC CGGCAGCGAG GGCGAAGGCG GCCAGCAGGA CTCGAACCCC
GGTGCGATCG GGGGCCAGGA CGAGATCTTC AAGCGTCCGG CCGTCGACGA CATCGGCGAG
GTCGCGATCG CCGTCGAAGA GGGCTTCACC AACTACAACA ACTTCACGGG TGCGACGAAC
AACTTCGCCA GCACCATGGC GTTGTCGAAC GTGCAGCCCT CGCCGTACAT CGTCGACCTG
GTCGACGGCA AGGTCGTCAT CAAGGTCGAC GGCGACCTGA TGGAGTCCAT CAAGGTCACC
TCGAACGACC CGCAGGTCAT CGAGTGGAAG GTCCGCAAGG AGGCGGTCTG GTCCGACGGC
CAGCCGATCG ACTGCAAGGA CTTCCACCTG AAGTGGCTGG CCGCGACCAG CCAGGCCAGG
ACGACGACCT CCGACGGCGA GTCCGCCTCG ATCTTCGACG CGACCCCCAC CGGGTACGAG
GACATCGAGA AGCTCGAGTG CGCGGACGGC AACAAGACGA TCACCACGAC GTTCAAGAAG
CCGTACGCCG ACTACCGCGG CCTGTTCTCG CAGCCCGGTA GCGACGGCCT CCTCCCGGCC
CACGTGCTGG AGCAGAAGAC CGGCATCGAG GACATCACCA AGATCACCCC GGCGCAGAAC
GACGAGACGG TCAAGAAGGC CGCCGAGTTC TTCACCAAGG GCTGGAACGG CTGGAGCGCC
GACGTCGCCC TCTCCGGTGG CCCGTACGTC ATCACCTCGG CCGACCTGAG CGACCAGACG
GTCCTCGAGC GCAACCCGAA GTGGTGGGGC AACAAGGGCG GCCCGGCCAA GGTCATCCTG
AAGACGAACC GCGACGCCCA GTCCGCGGCC CAGCAGCTGC AGAACAAGGA AGTCCAGGCG
ATCGCGCCGC AGGCCGACAA CGCCGTGGCG CAGCAGCTCC GGGGCAGCGA CGCCTACACG
GTCTTCGCCA GCGGTGGCCA GACCTACGAG CACATCGACC TGAACATGGC CAACCCGCTG
TTCGGCCAGA ACAAGGAGCT CCGCGAGGCG TTCGCGATCT GCACCCCGCG GACCGAGATC
GTCGAGAAGC TCGTCCAGGA CGTGCAGCCG GGCGCCAAGC CCCTGGGCAG CCTGACCTTC
ATGCCCAACG AGGTCGGCTA CGAGGACCAC TACTCCGACC TGGCCGACGG TGACGCCGAG
GCCGCCAAGA AGGTCATGGA GGCCGGTGGC TGGACCCTGG GTGGCGACAA CGTCTACACC
AAGGGCGAGT TCCGCGCGTC CTTCAAGCTG AGCCACAAGA CCGTGACCCG TCGCGCGCAG
ACCGTGCGCC TGGTCCAGGC CTCCTGCGCC AAGGCGGGCA TCGAGGTCAT CGCCGACGAG
GCCGCCGACT TCAACGACAA GCGCCTCCCG GCCTCCGAGT TCGAGGCCGC CCTGTTCGCG
TGGGTCGGCG CCCCGCTGAA GGCAGGCGCG TTCGGCAACT ACGCCCAGAA GGCCAAGGGC
GGCTCGGCGA ACTACAACAA CTACGACTCG GCGACCGTCA CCGACACGTG GGCGAAGGCG
AACAGCGAGC TCGACTACGA GAAGCGCATC ACGCTGATGA ACGACGTCGA CAAGGCGATG
CGCGCCGACC TGGCGAGCAT CCCGCTGTTC CAGCACACCG ACTTCACCGC CTCCTCCTCG
GAGTACGGCC CGGTGAGCTA CATCGGTGTC GCGGGTGGCA TCACCTGGAA CCTGTACGCC
TGGCAGAAGA AGTAG
 
Protein sequence
MRRKTVSAFA LMTSAALLLS ACGGGGAGSE GEGGQQDSNP GAIGGQDEIF KRPAVDDIGE 
VAIAVEEGFT NYNNFTGATN NFASTMALSN VQPSPYIVDL VDGKVVIKVD GDLMESIKVT
SNDPQVIEWK VRKEAVWSDG QPIDCKDFHL KWLAATSQAR TTTSDGESAS IFDATPTGYE
DIEKLECADG NKTITTTFKK PYADYRGLFS QPGSDGLLPA HVLEQKTGIE DITKITPAQN
DETVKKAAEF FTKGWNGWSA DVALSGGPYV ITSADLSDQT VLERNPKWWG NKGGPAKVIL
KTNRDAQSAA QQLQNKEVQA IAPQADNAVA QQLRGSDAYT VFASGGQTYE HIDLNMANPL
FGQNKELREA FAICTPRTEI VEKLVQDVQP GAKPLGSLTF MPNEVGYEDH YSDLADGDAE
AAKKVMEAGG WTLGGDNVYT KGEFRASFKL SHKTVTRRAQ TVRLVQASCA KAGIEVIADE
AADFNDKRLP ASEFEAALFA WVGAPLKAGA FGNYAQKAKG GSANYNNYDS ATVTDTWAKA
NSELDYEKRI TLMNDVDKAM RADLASIPLF QHTDFTASSS EYGPVSYIGV AGGITWNLYA
WQKK