Gene Emin_0529 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0529 
Symbol 
ID6262717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp578900 
End bp580126 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content39% 
IMG OID642610999 
Productpeptidase T 
Protein accessionYP_001875421 
Protein GI187250939 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01882] peptidase T 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00086333 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.062594 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTTTA AACAAAATAT TTTAGAGAAA TTTTTGAGAT ATGTAAAAAC AGAAACCACT 
TCCGATACGG AATCATCATC AAAACCTTCA ACAAAAACCC AGCTTGAATT CGCGGCTGTA
CTTGCCAAAG AAATGGAAAC CCTGGGTATT AAAGATATAA AAATATCTAA AACAGGGCAT
TTAACCGGTT CCATACCCGC AAATAATGAC GCCAAAGCGC CTACAATAGG GTTTATAGCG
CATATAGATA CATCCCCCGA TTTTAACGGT AAAAACGTTA ATCCGCAAAT ACATAAAAAT
TACGCGGGCG GAGCTATTGT TATAAATAAA GATAAAAATA TGTCAATTTC GCCTGAAATG
GACAAAATTC TTAATGACGT AACAGGCCAC GACATTATAA CAACCGATGG AAATAGCCTT
TTAGGCGCGG ATGATAAAGC GGGTATAGCT ATTATAATGA CAATGGCCCA ATATTTAAAG
AATAATCCAT CCTTTAAACA CGGACCCGTA AAAATAGCTT TTACACCTGA TGAGGAAATA
GGCACGGGCA TTTTGGATTT TGATGTCGCG GACTTTAAGG CTGACTTCGC TTACACCGTT
GACGGCAGCG TTATGGGTGA AATAGAAAAC GGCAACTTTA ACGCCGATAA GTTTAAAATT
GAAATAACCG GCGTTAACTG CCACCCCGGC ACGGCTAAAG ACGTTATGGT CAACCCCGTG
AGAGTAGCGG CTGATTTAAT AAACCGCTGG CCTGAAAGCA AACTGCCTGA AACCACGGAA
GGAGAGGAAG GCTTTATACT TTTTAACACA TTAAAAGGGA ATATCGAAAA AACCGAAATA
GGCGGTATTA TAAGGGAGCA TGATTTAAAA AAACTTACGG ATTTAGAAGA CTCTCTTAAA
AAAATTATTG AAGATACTAA AGCTAAATTT AAAGGAGCGC AGATTAAGTT AACAATAAGC
GAGCAATACA GAAATATGAA AGACGTACTT GCAAAAAACC CCGAAGCCAT GAATAAACTT
TTAAGCGCTT TAGAAGATAT GGGTATTAAA TATAAAATAA GCCAAATAAG GGGCGGCACC
GACGGGGCCA GGCTTTCTTT TATGGGTTTG CCGACGCCAA ATATTTTTGC CGGCTATTCA
CAGCCGCACG GACCGTATGA ATGGGCTTCT TTAGACGCTA TGGCTATAGC TTGCAAGTTT
ATATTAAAGA TAGTCGAAGT AAAATAG
 
Protein sequence
MDFKQNILEK FLRYVKTETT SDTESSSKPS TKTQLEFAAV LAKEMETLGI KDIKISKTGH 
LTGSIPANND AKAPTIGFIA HIDTSPDFNG KNVNPQIHKN YAGGAIVINK DKNMSISPEM
DKILNDVTGH DIITTDGNSL LGADDKAGIA IIMTMAQYLK NNPSFKHGPV KIAFTPDEEI
GTGILDFDVA DFKADFAYTV DGSVMGEIEN GNFNADKFKI EITGVNCHPG TAKDVMVNPV
RVAADLINRW PESKLPETTE GEEGFILFNT LKGNIEKTEI GGIIREHDLK KLTDLEDSLK
KIIEDTKAKF KGAQIKLTIS EQYRNMKDVL AKNPEAMNKL LSALEDMGIK YKISQIRGGT
DGARLSFMGL PTPNIFAGYS QPHGPYEWAS LDAMAIACKF ILKIVEVK