Gene Emin_0731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0731 
Symbol 
ID6262972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp806469 
End bp807527 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content40% 
IMG OID642611205 
Productpeptidase M24 
Protein accessionYP_001875623 
Protein GI187251141 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0000000317312 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTAAAAA CAAAATTTAA CCCCGAACTT ATTAAAAAAC TTCAAAAAGC TGTTGTGAAA 
AACAAATTAG ACGCTTATTT TGTAACCGAT TATAAGGACC AGCTTTATTT GACCGGTTTT
AAATTTTACC CGCAGGAAGC TATTTTACTT GTAACTCCTA AAGAAGTTTA TTGTTACACA
AGAGATTTAT ATATTATAGA ACTTGGGCAG AAAATACCCC TGTTAAAGGC TTCCGCCCCG
TTAGATTACG CTTTGGCGGC AGCCGAACAG GCAAAAAAAC TTAAACTTAA AAATGTGGGT
TTTGATGCGG TTAAAACATA TTATAATTAC GGCAAAACAT TTGAGAAATT CGGCTACAAA
CCCTCAGCCT TTACGCCGGG TGAACTACGC GAGGTTAAAG AAAAAAGCGA GCTTGACACA
ATGCGCAAAG CCAACCGCAT AGCTTATAAA ACCTATGAAT ATATTAAAAA ATATATTAAA
ACCGGAATGA GCGAGTTTGA AGTAGCGGCC GAAATTGAAC GTTATATGAA ATCGCAGGGA
GCTACAGCCT TAAGTTTTGA GTCAACAGTG TGCTTTGGCG TTAACGGTAC AAACACGCAC
CACACCCCCA CAAAAGACAA ATTAAAAAAT GAGCAGGCCA TATTACTTGA TTTCGGCTGT
ATTTATGATA ACTACTGCTC AGATATTTCA CGCAGCTGGT GGCACGGCAA AAAACCCACG
GCTGAGTATA AAAAGGCCTG GAAAGCGGTT GATGACGCCA GAAAAGCCGG GATAAAAGCG
GCTAAACCCG GTATTACCGG TAAAGAGCTT GACCTTGTTC CTAGAAATGT AATTGAAAAA
GCAGGTTTTG GCAAATATTT TATACACAGG ACAGGCCACG GCATAGGCAT GCAGGCGCAT
GAAGACCCTA ACGTGGAACC GCAAAATAAC AGGAAATTTG TTGCTAACAA CGTAATAACA
ATTGAGCCCG GTATTTATTA CACAGGTCAT TTTGGCATAC GTATAGAAGA TACTGTTGTT
GTCACACCTA AAGGCGGCGT AATTCTTACA AAGAAATAA
 
Protein sequence
MVKTKFNPEL IKKLQKAVVK NKLDAYFVTD YKDQLYLTGF KFYPQEAILL VTPKEVYCYT 
RDLYIIELGQ KIPLLKASAP LDYALAAAEQ AKKLKLKNVG FDAVKTYYNY GKTFEKFGYK
PSAFTPGELR EVKEKSELDT MRKANRIAYK TYEYIKKYIK TGMSEFEVAA EIERYMKSQG
ATALSFESTV CFGVNGTNTH HTPTKDKLKN EQAILLDFGC IYDNYCSDIS RSWWHGKKPT
AEYKKAWKAV DDARKAGIKA AKPGITGKEL DLVPRNVIEK AGFGKYFIHR TGHGIGMQAH
EDPNVEPQNN RKFVANNVIT IEPGIYYTGH FGIRIEDTVV VTPKGGVILT KK