Gene Emin_1192 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1192 
Symbol 
ID6263599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1288663 
End bp1289862 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content39% 
IMG OID642611670 
Productpeptidase M29 aminopeptidase II 
Protein accessionYP_001876079 
Protein GI187251597 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2309] Leucyl aminopeptidase (aminopeptidase T) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.318194 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAATA AAAATGAATT GAAGAAATAC GCAGAGATCA TGGTATGGTC GCTAAAAAAA 
GCAAGGACGG CGCAATTTAA AAAGTATGAC TCAGTTCTTG TCCGTTATGA TATAGCATCC
GCCCCGCTGG CCGAGGAAAT TAACAGAATT TTACTTGCCG AAAGGCTTAA CCCTGTTATG
CAGGTACTCA GTTCCGAAAA CATAACAAAG GATTTCTACC AAATTTCTGA CGATAGCCAA
CTTAAATTCA TGCCAAAATG GCAGCTTGAG ATGCAAAAAG GCATAAACGG ACTTATTGCT
CTGCGCGCGC CGGCCAGTTT GGATCACTTA AAAGACGTTG ATCCTAAAAA AATGTCTTTA
ACAGCGTTGG CAAGAAAGCC CGCTAAAGAA ATTTTAGATA AACGCGAGGC GGAAGGTTCC
TTCGGGTGGA CGCTTTCCAC CTATCCTACC CAAGCGCTTG CCAAAAAAGC GGGGCTTAGT
TTAAAAGAAT ATACAAATCA AATAAAAAAA GCCTGTTACC TTACGGATGC CAACCCCTTA
AAAACATGGG AAAAAATTTA TAAACAAATG GAAGAGGTAA CTAAGTGGCT TGAGTCTTTA
AAAATAGACA CTATTAATAT GCAGTCTAAA AGCATGGATT TAAATGTTCT CTTGGGTGAA
CAGCGCAAAT TTTTATCAGC GCGCGGTTGC AATATGCCTT CTTTTGAAAT TTTTACCTCG
CCCGATTGGA GGGGAACGGA AGGAGTATAT TTCGCAGATA TGAAATCTTT TAGAAGCGGG
CAGATAATAG AAAATATAAA AGTGGAGTTT AAAAAAGGGC GCGTAATTAA GGCAAAAGCT
TCAAAAGGGG ACGATTACCT TAAAAAAATG ATAGCTATGG ACCCGGGCGC CGCTCAAATA
GGCGAATTTT CTTTAACGGA TAAACGTTTT TCTAAAATCG ACCGCTTTAT GGCGGATACT
TTGTTTGACG AAAACTTTGG CGGCAAAAAC GGCAACAGCC ACATAGCTTT AGGCGCCAGC
TTTGCCGACA GCTTTAGCGG TGACGTTAAA AAACTTACAA AAGCTAAAAA GAAAGCGCTT
GGGTTTAATG ACTCATCGCT TCATTGGGAT ATTATAAATA CGGAAGATAA AATTGTTAAA
GCAAAAGTAA AAAACGGTAA AACGGTTACT ATATACGAGA AAGGAATGTT TAAATATTAA
 
Protein sequence
MFNKNELKKY AEIMVWSLKK ARTAQFKKYD SVLVRYDIAS APLAEEINRI LLAERLNPVM 
QVLSSENITK DFYQISDDSQ LKFMPKWQLE MQKGINGLIA LRAPASLDHL KDVDPKKMSL
TALARKPAKE ILDKREAEGS FGWTLSTYPT QALAKKAGLS LKEYTNQIKK ACYLTDANPL
KTWEKIYKQM EEVTKWLESL KIDTINMQSK SMDLNVLLGE QRKFLSARGC NMPSFEIFTS
PDWRGTEGVY FADMKSFRSG QIIENIKVEF KKGRVIKAKA SKGDDYLKKM IAMDPGAAQI
GEFSLTDKRF SKIDRFMADT LFDENFGGKN GNSHIALGAS FADSFSGDVK KLTKAKKKAL
GFNDSSLHWD IINTEDKIVK AKVKNGKTVT IYEKGMFKY