Gene Amir_3466 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_3466 
Symbol 
ID8327656 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp4029343 
End bp4031133 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content72% 
IMG OID644943966 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003101206 
Protein GI256377546 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0853012 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGTGTTT GGCAGCGTAT CGTCGCCGCG TCGGCGGCGG CAGGTCTGTC CGTGTTGCCC 
TTCGGGGTAC CGGCCCAGGC CCAGAACGAG GTCGTCTTAC GGGTGGCCAT CACCCAGCAG
GTCGACTCGC TCAACCCGTT CCTGGCCACC TTCCAGGCCA GCACCGAGGT CGGCAGGCTC
ATGTACGACT TCCTCACCGC CTACGACCAG CGCGACCAGA CCCCCGTCCC CGCCCTCGCC
GACCGGTGGT CCTCCAGCGA GGACCGGCTC ACCTGGACCT TCCACGTCCC CGAGGGCCGC
AAGTGGTCCG ACGGCCAGGA CATCACCGCC GACGACGTCG CCTTCACCTA CGACCTGATG
ATGCGCGACA CCACCGCCGC CACCGCCAAC GGCAGCTTCA CCGCCGACTT CGAGTCCGTC
ACCGCCTCCG CCGACGGCCG CGAGGTCGTC ATCAGGACCA AGCAGCCGCA GGCCACCATG
CTCGCCCTGG ACGTGCCCAT CGTCCCCGAG CACGTCTGGT CCGAGGTCAC CGACGTCGGC
GACTACACCA ACGACCAGGG CCCGGTCGTG GGCAGCGGCC CGTTCGTGCT CACCGAGCAC
AAGCCCAACG AGTTCATCCG CTTCGCCGCC AACAAGACCT ACTGGCGCGG CGCGCCCAAG
TACGACAGCC TCGTCTTCGT CTACTACAAG ACCACCGACG CCGCCGCCCA GGCCCTGGCC
AAGGGCGAGG TCGACCTGGT CAACCGCCTC GGACCCGCCC AGTTCGACTC GCTGGAAGGC
GCCGAGGGCG TCACCCGCAA CAAGGCCAAC GGCCGCCGCT TCAACGAGCT GGTCCTCAAC
TCCGGGGCCG CCACCAACAC CGGCGAGCCC ATCGGCGACG GCCACCCCGC GCTGCGCGAC
CTCGTCGTGC GCCGCGCCAT CGCCCAGGCC ATCGACCGCG ACGCCATCAT CGCCCGCGTC
AACAACGGCT ACGCCCAGCG CGGCACCGGC CCCATCCCGC CGGTCTTCCC CGCCTACCAC
CTGCCGCAGC CCGACCCCGA CCCGCTGCCG CACGACCCCG CCGCCGCCAA CGCCGCCCTC
GACGCCGCGG GCTACGCGCG CGGGGCCGAC GGCACCCGCG CCAAGGACGG CCGCCCGCTG
CGGCTGCGCC TGCTCGGCCA CGCCAGCCGC GCCTACGACG AGCAGGCCGC CGAGTTCGTC
AAGGGCGGCC TCGCGGCCAT CGGCGTCGCC GTCGACGTGC AGATCGTCTC CGACAACCAG
CTCAACGAGT CCGCCACCGC GGGCACCTTC GACCTGGTCT TCTCCGGCTG GGGCACCAAC
CCCGACCCCG ACTTCATCCT GTCGCTGCAC ACCTGCGCCC AGCGCCCCGG AGCCGACGGC
AAGGGCGGCA CCACCGACAC GTTCTTCTGC GACCCCGAGT ACGACGCGCT GCACGCCCGC
CAGCGCGCCG AGTTCGACCA GGCCAAGCGC GCCGACCTGG TCAAGCAGAT GCAGCGCCGC
TTCTACGAGC AGGTCCCGGC CGTCGTCCTC GGGTACGACA ACGTGCTGGA GGCCTACCGC
AGCGACAAGT TCACCGGCTT CCCCGTCCAG CCCGACCCCG GCGGCGTGAT CATGGCCCAG
AACGGCGTGT GGGGCTACTA CGGCGCCACC CCCGCCGCCG CCGGCGCGCA GCGGGACACC
GGCAACGGCA CCCTCGTCGT CATCCTCATC GTCGGCGTCG CGGTGGTCGT CGTGGCGGGC
GGGGTGCTGC TGGCCCGCAG GCGCGGCGCG GGCGCCGAGG ACCGCGAGTG A
 
Protein sequence
MGVWQRIVAA SAAAGLSVLP FGVPAQAQNE VVLRVAITQQ VDSLNPFLAT FQASTEVGRL 
MYDFLTAYDQ RDQTPVPALA DRWSSSEDRL TWTFHVPEGR KWSDGQDITA DDVAFTYDLM
MRDTTAATAN GSFTADFESV TASADGREVV IRTKQPQATM LALDVPIVPE HVWSEVTDVG
DYTNDQGPVV GSGPFVLTEH KPNEFIRFAA NKTYWRGAPK YDSLVFVYYK TTDAAAQALA
KGEVDLVNRL GPAQFDSLEG AEGVTRNKAN GRRFNELVLN SGAATNTGEP IGDGHPALRD
LVVRRAIAQA IDRDAIIARV NNGYAQRGTG PIPPVFPAYH LPQPDPDPLP HDPAAANAAL
DAAGYARGAD GTRAKDGRPL RLRLLGHASR AYDEQAAEFV KGGLAAIGVA VDVQIVSDNQ
LNESATAGTF DLVFSGWGTN PDPDFILSLH TCAQRPGADG KGGTTDTFFC DPEYDALHAR
QRAEFDQAKR ADLVKQMQRR FYEQVPAVVL GYDNVLEAYR SDKFTGFPVQ PDPGGVIMAQ
NGVWGYYGAT PAAAGAQRDT GNGTLVVILI VGVAVVVVAG GVLLARRRGA GAEDRE