Gene Amir_2666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_2666 
Symbol 
ID8326855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp2999243 
End bp3001096 
Gene Length1854 bp 
Protein Length617 aa 
Translation table11 
GC content69% 
IMG OID644943206 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003100447 
Protein GI256376787 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.955399 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACGG CACAGTCCGC CACCGACCTG GCCACGACCG ACCCGACCGC CGCCGACCCG 
ACCGCCGCAC CCGCCGGGGA CGTCCCCCGC AAGGGCGGGG TCGTCACCTG GGCCTGCGCG
CCGGGCTTCC CGCCCGCCGT GATCTTCCCG TTCACGCCCG CCGAGCGCAT GGGCACCCGC
AACATCCTGG AGTTCCAGGC CCTGATGTAC CGGACGCTGT ACTACTTCGG CAGCGACGGC
ACCCCGAACG TCGACTACCA CCAGAGCATC GGCGAGGAGC CGGTGTGGAG CGAGGACGGC
CGCACCGCGC GCGTGCGCAT CAAGCCGTGG AAGTGGTCCA ACGGCGAGAC CGTCTGCGCC
GACAACGTGC TGTTCTGGGT GAACCTGATG AAGGTCAAGG GCCCCAGGTA CGGCGAGTAC
GTCCCCGGCT ACTTCCCGGA CAACCTCACC GAGTACGGCA AGCTCGCCGA CGACGAGGTG
TTCTTCACCT TCGACAAGCC CTACTCCAAG CACTGGGTGC TGCACAACCA GCTCAGCACC
ATCACCCCGC TGCCCAAGGC GTGGGACCGC ACCGCCGACG GCCCGGCCAA CGCCTCCGGC
GACCTCGCCG ACGTCGAGGC CGTCTACGAG CACCTGATGG CCGAGCAGGG CGACATCATC
AACGAGGGCA ACGAGCACCG CACCAGGTGG GCCGACAGTC CCGTGTGGAG CGTCGTCTCG
GGGCCGTGGC GGCTCAAGAG CTACACCCTC GAAGGCGTCG TCACCTTCGT CCCCAACGAG
CACTACTCCG GCCCGAACAA GCCGCACCTG GACGAGTTCC GCCAGATCCC GACCTTCTCC
GACGAGGAGC AGTACGAGGT CCTCAAGAAG GGCCCGGACG CCGAGGGCGG CTTCCAGGTC
GGCTACCTGC CGCTCAGCTT CGCCACCGAG CCCGCCGTGG ACCCGGTCGT CGGCGGCCCG
AACCCGCTGG CCGAGCACTA CACGATGCAC CCGCAGACCG CGTTCTGCAT CCGGTACATC
TCGCTCAACT ACAACAACCC CACCGTCGTC GGGAAGATGT TCGCCCAGAC CTACCTGCGC
CAGGCGCTGC AGAGCGTCCT GGACCAGGAC ACCGCCGTCC GCGACATCTA CCAGGGCTAC
GCCTACCGGC AGAACGGCCC CGTCCCGATG TACCCGAGGA CCGAGTACGT CTCCCCGCGC
CAGCGCGAGG GCGCCTGGCC GCTCCCGTTC GACCCCAAGC ACGCCAAGGA GCTGCTGGAG
GCCAACGGCT GGGACACCAG CCGGACCCCT GCGGTGTGCG TGCGCGCCGG AACCGGACCG
GGCGAGGCGG GGGAGGGCAT CCCCGAGGGA ACCGAGCTCA CCCTGCTCAT GCGCTACGTC
GAAGGCAGGC CCGCGCTCAC CAGGCTCATG GAGGGCTTCC GCGACGCCGC CGCCGAGGCG
GGCATCGAGC TGCGCCTGCG CGAGATCTAC GGCTCCGTCC TGGTCGCCGA GGACGCGCCG
TGCGTGCCCA CCGAGGAAAC CCCCTGCCTG TGGGAGATGT GCTGCTGGAA CGGCGGCTGG
GCCTACCACC ACCCGACCGG CGAGATCCTC TTCTCCACCG GCGCGGGCGG CAACTTCGGC
TTCTACACCG ACCCCGAGGC CGACGCGCTC ATCGAGCGCA CCGTCACCAC CGACGACCTC
GACGTCCTCT ACGAGTACCA GGACTACATC GCCGAGCAGG TGCCGGTGAT CTTCACGCCG
AACTTCCCCA TCCGGCTCTT CGAGGTCGCC AACAACCTCA GGGGCTTCGG GCCGATCAAC
CCCTACGGCA TGATCAACCC GGAGAACTGG TACTACGCCG AGGACCCGGC GTGA
 
Protein sequence
MTTAQSATDL ATTDPTAADP TAAPAGDVPR KGGVVTWACA PGFPPAVIFP FTPAERMGTR 
NILEFQALMY RTLYYFGSDG TPNVDYHQSI GEEPVWSEDG RTARVRIKPW KWSNGETVCA
DNVLFWVNLM KVKGPRYGEY VPGYFPDNLT EYGKLADDEV FFTFDKPYSK HWVLHNQLST
ITPLPKAWDR TADGPANASG DLADVEAVYE HLMAEQGDII NEGNEHRTRW ADSPVWSVVS
GPWRLKSYTL EGVVTFVPNE HYSGPNKPHL DEFRQIPTFS DEEQYEVLKK GPDAEGGFQV
GYLPLSFATE PAVDPVVGGP NPLAEHYTMH PQTAFCIRYI SLNYNNPTVV GKMFAQTYLR
QALQSVLDQD TAVRDIYQGY AYRQNGPVPM YPRTEYVSPR QREGAWPLPF DPKHAKELLE
ANGWDTSRTP AVCVRAGTGP GEAGEGIPEG TELTLLMRYV EGRPALTRLM EGFRDAAAEA
GIELRLREIY GSVLVAEDAP CVPTEETPCL WEMCCWNGGW AYHHPTGEIL FSTGAGGNFG
FYTDPEADAL IERTVTTDDL DVLYEYQDYI AEQVPVIFTP NFPIRLFEVA NNLRGFGPIN
PYGMINPENW YYAEDPA