Gene Amir_3066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_3066 
Symbol 
ID8327256 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp3540979 
End bp3542622 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content73% 
IMG OID644943588 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003100828 
Protein GI256377168 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTTACGAA TAGCGGGCAA CAAGGACGAC GAGGGGCGGA ACGTGCGCGT ACCTGCTTGG 
GCCCGAACGG CGGGAGCTCT GCTCTCCGCC ACCCTGCTGA CCACGGCGTG CTCGACCTCG
CCCACCGCGC AGGGCGGCCC CGCGCCGACC GAGCTGCGGC TGGCCATCGG CGGCGAGTCC
GAGGACGGCT ACGACCCGAC GCTCGGCTGG GGCCGCTACG GCTCCCCGCT GTTCCAGTCG
ACCCTGCTCG GCAGGGACGC CGACCTGAAC CTGACCAACG ACCTGGCCAC CGGCCACTCG
GTGAGCCCGG ACGGTCTGGT GTGGACGGTC GACCTGCGGG CCGACGCGAA GTTCAGCGAC
GGGCAGCCGG TCACCGCGAA GGACGTCGCC TACACCTTCA CCACCGCCGC CAAGAGCGGC
GGCCTCACCG ACGTGACCGT GCTCAAGGAG GCGGTCGTCG TCGACGAGGA CACCGTCGAG
CTGCGCCTGA CCCAGCCGCA GAGCACGTTC GTCAACCGCC TGGTGTCGCT GGGCATCGTC
CCCGAGCACG CGCACGGCGC CGACTACGCG CGCGAGCCGG TCGGGTCCGG GCCGTTCGTG
CTGGAGCAGT GGGACGAGGG CCAGCAGCTC ATCGTCAAGC GCAACGACGC CTACTACGGG
CAGAAGCCCG CGTTCGAGCG CGTCGTGTTC GTGTTCACCG GCGAGGACGC CACCCTGGCC
GCCGCCCGCT CCGGGCAGGT CCACGTGGCC TCGCTGCCGT CGTCGCTGGC CAGGACCGGG
CTGCCCGGCA TGGCGCTGCG GGACGTGGAC TCGGTGGACA ACCGGGGCAT CTCGTTCCCG
TACCTGCCCG CCGAGGGCCG CACCACCGCC GAGGGACGCC CGATCGGCAA CGCCGTCACC
TCGGACCGCG CCGTGCGCCA GGCCGTGAAC TACGCGCTGG ACCGGCAGGC GCTGGTCGAC
GGCGTGCTCG ACGGCTTCGG CAGCCCCGCC ACCGGCCCGG TCGACGGGAT GCCGTGGTTC
GAGCCGTCCG CCGCGATCGA GGACAACGAC CCGGAGCGCG CCAAGAAGCT CCTGGACGAG
GCGGGCTGGA CCGACCCGGA CGGTGACGGC GTGCGCGAGC GCGCGGGTGT GAAGGCCGAG
TTCCCCCTGC TCTACCCGGC CAGCGACTCG CTGCGCCAGG GCCTGGCGCT GGCCGTGGTC
GACATGCTCA AGCCGATCGG GATCGCGGTC TCCGCGCAGG GCGAGAGCTG GGAGGTCATC
CGCACCCGGA TGCACGCCGA GCCGGTCCTG TTCGGCTGGG GCAGCCACGA CCCGACCGAG
ATGCACACCC TGTACTCGTC GGCCCGCGCG GGCGTCGAGC TGTGGAACCC CGGCTTCTAC
GCCAACCCGG CGGTGGACGC GCACCTGGAC GCCGCGATGG CCACCACCGA CCCCGAGGTC
GCCACCCGCG AGTGGAAGGC CGCCCAGTTC GACGGGGCGC AGGGCTTCTC CGCGCAGGGC
GACGCCGCGT GGGCCTGGCT GGTCAACCTC AAGCACACCT ACTTCGCCAA CCAGTGCCTG
GACCTCGGCC CCGAGCAGGT CGAGCCGCAC GGCCACGGCT GGCCGGTCAC CTGGAACATC
GCCGCCTGGC GCTGGACTTG CTGA
 
Protein sequence
MLRIAGNKDD EGRNVRVPAW ARTAGALLSA TLLTTACSTS PTAQGGPAPT ELRLAIGGES 
EDGYDPTLGW GRYGSPLFQS TLLGRDADLN LTNDLATGHS VSPDGLVWTV DLRADAKFSD
GQPVTAKDVA YTFTTAAKSG GLTDVTVLKE AVVVDEDTVE LRLTQPQSTF VNRLVSLGIV
PEHAHGADYA REPVGSGPFV LEQWDEGQQL IVKRNDAYYG QKPAFERVVF VFTGEDATLA
AARSGQVHVA SLPSSLARTG LPGMALRDVD SVDNRGISFP YLPAEGRTTA EGRPIGNAVT
SDRAVRQAVN YALDRQALVD GVLDGFGSPA TGPVDGMPWF EPSAAIEDND PERAKKLLDE
AGWTDPDGDG VRERAGVKAE FPLLYPASDS LRQGLALAVV DMLKPIGIAV SAQGESWEVI
RTRMHAEPVL FGWGSHDPTE MHTLYSSARA GVELWNPGFY ANPAVDAHLD AAMATTDPEV
ATREWKAAQF DGAQGFSAQG DAAWAWLVNL KHTYFANQCL DLGPEQVEPH GHGWPVTWNI
AAWRWTC