Gene Amir_1678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_1678 
Symbol 
ID8325863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp1833856 
End bp1835517 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content67% 
IMG OID644942228 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003099473 
Protein GI256375813 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.495361 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACTCA GGCGAACGGC CGCCCTCGCC CTCGCGGCGA TCACGGCCTT CACCCCGCTC 
GCCGCCTGCT CCACGAGCAA CCAGGGCGCG CAGTCCCAGG CCGGTGTGCT GAACGTGGGC
AAGCCGGACG GCCCGCAGAC CGAGAACCAC AACCCGTTCC TCAACTCGTC CGCCGCGACG
ATCATGGGCT ACCGCAGGCT GATCTTCGAG CCGCTGACCA TGGTCAACGA GACCGACGCG
ACCCAGGAGC CCACGCCCTG GCTGGCCAGC GAGTGGGACT GGCAGGAGAA CTACTCCAAG
CTCGTGCTGA CCGTGCGCGA GAACGTCACC TGGTCCGACG GCAAGCCGCT CACCCCGGCC
GACGTCGCGT ACACGTTCAC CTTGCTGAAG AACAACCCCG GCCTGAACAT CCAGGGCCTG
CCGATCGACG GCGCGAGCGT CGACGGCGGC AAGGTGACGG TGAGCTTCCC GCGCTCCCAG
TTCACCAACC GCAACAAGCT CCTGGAGCAG TTCGTCGTCC CCGAGCACAT CTGGTCGACC
TACGCGAACC CGTCAACGGA GACGGTCAAG AACCCGGTCG GCAGCGGCCC GTACACGCTG
AAGTCCTTCA CCCCGCAGAC CCAGACCCTC GTCGCGCGCG ACAGCTACTG GCAGGAGCTG
CCGCAGGTCA AGGAGGTCCG GTACACCGCG TACGCCGACA ACAACGCGCA GACCACCGCG
CTGGCCAACG GCACCACCGA CTGGAGCTTC GTGTTCATCC CGAACTACGA GGCCGTCTAC
ACCAGCAAGG ACCCGCAGCA CAACAAGCTC TGGTTCCCGC CGGTGCTGGG CATCCACGGC
CTGTGGTTCA ACACCAAGAG CGCCCCGTGG GACAACCCGG CGCTGCGCCG CGCGGTGAAC
CAGGTGGTCA ACCGGCAGGA CATCTTCGTG CAGGGCGAGG GCGGCTACTT CTACCCGAAG
GTCGACAACA TCACCGGCAT CCCCACGCCC GCCGGTGACC CGTTCATCGC CGACGAGTTC
AAGGGCAGGA CCGTCGAGGT GGACGTCGCC GCGGCCAAGA AGGAGCTGAC CGACAACGGC
TTCAGCTACG ACGGCGACAA GCTCAAGGAC CCGTCCGGCA AGCCCGTGAC GCTGAAGATG
ACCGTGCCGT CCGGCTGGTC CGACTACGTC ACCAACGTCG AGATCATCAA GGACAACGTC
TCCGACATCG GCGTCGAGGC CACCGTCGAG CTGCAGAACG TCGACGCCTG GACCAAGGCG
CTGGACACCG GCGACTTCCA GGCCGCGCTG CACTGGACCA ACAACGGTCC CACGCCGTAC
GACATCTACC AGTCCATCAT GGACGGCGCG CTCTACAAGC CGGTCGGCCA GGGCGGCATC
AACGGCAACT ACGGGCGCTA CGAGAACCCC GAGGCCACCG CCGCGCTGGA GCAGTACGCC
ACCGCGCCCG ACGAGGCCTC CCGCACCGCC GCGATGACCC TGCTCCAGCA GATCTTCGTG
CGCGACATGC CGGTGGTCAT CACCTCGGCG GCCAACGGCG GCGGCGAGTA CACCACCCGC
AACTGGACCG GCTGGCCCGA CGCCGAGAAC CCCTACGCGC CCGCCCAGAT GACCCTGGAG
AACGCGCTGC AGATCGTCCT CAAGCTGAAG CCCGCCGCAT GA
 
Protein sequence
MRLRRTAALA LAAITAFTPL AACSTSNQGA QSQAGVLNVG KPDGPQTENH NPFLNSSAAT 
IMGYRRLIFE PLTMVNETDA TQEPTPWLAS EWDWQENYSK LVLTVRENVT WSDGKPLTPA
DVAYTFTLLK NNPGLNIQGL PIDGASVDGG KVTVSFPRSQ FTNRNKLLEQ FVVPEHIWST
YANPSTETVK NPVGSGPYTL KSFTPQTQTL VARDSYWQEL PQVKEVRYTA YADNNAQTTA
LANGTTDWSF VFIPNYEAVY TSKDPQHNKL WFPPVLGIHG LWFNTKSAPW DNPALRRAVN
QVVNRQDIFV QGEGGYFYPK VDNITGIPTP AGDPFIADEF KGRTVEVDVA AAKKELTDNG
FSYDGDKLKD PSGKPVTLKM TVPSGWSDYV TNVEIIKDNV SDIGVEATVE LQNVDAWTKA
LDTGDFQAAL HWTNNGPTPY DIYQSIMDGA LYKPVGQGGI NGNYGRYENP EATAALEQYA
TAPDEASRTA AMTLLQQIFV RDMPVVITSA ANGGGEYTTR NWTGWPDAEN PYAPAQMTLE
NALQIVLKLK PAA