Gene Amir_4965 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_4965 
Symbol 
ID8329163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp5925378 
End bp5926304 
Gene Length927 bp 
Protein Length308 aa 
Translation table11 
GC content75% 
IMG OID644945405 
Productproline iminopeptidase 
Protein accessionYP_003102637 
Protein GI256378977 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.328745 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCACCCGG TGACCGAACC ACACACGAGC GGGCGGCTGG CGGTCGGCGA CGGCCACGAG 
CTGCACTGGC AGATCCACGG GAACCCGACC GGGAAACCGG TGGTGGTCCT GCACGGCGGG
CCGGGGTCGG GCAGCCGCGC CAGGGCCACC AGGCTGCTCG ACCCGGCGGT GTACCGGGTG
GTGCTCTTCG ACCAGCGCGG CTGCGGGCGC TCGACGCCGC ACGCGGGCGA GCCGGAGGTC
GACCTGTCCA CCAACACCAC GGACCACCTG GTGGCGGACC TGGAACTGCT GCGCGCGTCC
CTGGACGTCG AGCGCTGGCT GGTGCTCGGG GGGTCCTGGG GCGCGGTGCT CGGGCTGGTC
TACGCGCAGC GGTACCCGGA GCGGGTGACC GGGCTGGTGC TCGCGGGCGT GGCGACCGGG
CGGCGGGCGG AGACGGACCT GCTGACGCGT GGGCTCGGGG AGGTGTTCCC GCGGGCGTGG
CAGGAGTTCA GCGACTTCGT CGGCGCGCCC GACGGCGACC TCTCGGCGGC CTACCTGGAG
CGGCTGGTCG ACCCGGACCC ACTGGTGCAC CTCGCGGCGG CGGACGCCTG GTGCGCGTGG
GAGGAGGCGA TGCTGCCGCA GACGCCGGGG TCGTTGGAGG ACGTGGTGGG GCGGGACCGG
CTGGCGTTCG CGCGGCTGGT GGCGCACTAC TGGGCGCACG GGAGCTGGTT GCGGGAGAAC
GAGGTCCTGG ACGGCTGCGA CCGGCTGGCC GGGGTGCCGG GGATCGTGGT GCAGGGCGAG
CTTGACCTGA TCAACCTGGT CGGGACGCCG TGGCTGCTGG ACCGGGCCTG GACGGCCGGC
GAGCTGGTGG TGGTGCGGGA GACCGCGCAC GGCGGGTCGG CGGCGCTGAG CGAGGCGTGG
AAGGCGGCGG CGGACAGCTT CCGGTGA
 
Protein sequence
MHPVTEPHTS GRLAVGDGHE LHWQIHGNPT GKPVVVLHGG PGSGSRARAT RLLDPAVYRV 
VLFDQRGCGR STPHAGEPEV DLSTNTTDHL VADLELLRAS LDVERWLVLG GSWGAVLGLV
YAQRYPERVT GLVLAGVATG RRAETDLLTR GLGEVFPRAW QEFSDFVGAP DGDLSAAYLE
RLVDPDPLVH LAAADAWCAW EEAMLPQTPG SLEDVVGRDR LAFARLVAHY WAHGSWLREN
EVLDGCDRLA GVPGIVVQGE LDLINLVGTP WLLDRAWTAG ELVVVRETAH GGSAALSEAW
KAAADSFR