Gene Emin_0190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0190 
Symbol 
ID6263993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp201417 
End bp202721 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content42% 
IMG OID642610654 
Producthypothetical protein 
Protein accessionYP_001875091 
Protein GI187250609 
COG category[S] Function unknown 
COG ID[COG3014] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAGA CCTGTATATT TCTGCTTATA ATTCTTGTTA TCTGCGGGTG CAGGTCTTCT 
TTTAATTTCA GACAGAACGT AAACAAGGAG ATTAACGAGG GTAATTACTC TTCTGCCACA
GCTAAAATTG AAGACCAAAA AAATAAGGTT TACAGGGAAA AAGATTCTTT AATTTATTAT
TTGGATTTAG GCACCGTGCA GCATGACGCC AAAAAACATG AAGAGAGTGA TAAAAATTTT
GACCTGGCCC AGCAAAGAAT TGATGAGCTT TTTACAACAA GCGTAAGCCA GTCGGTAGGA
ACGCTGGTAA AAAACGAACT TACGGCCGCC TATGAAGGGG CCGATTATGA AAGAGCCATG
ACCTATTTTT ACAGGGCTAT GAATTTTTTA GCCATGAACA ATTTGTCAGG TGCTTTGGTT
GAAGCCAGAA AAGCAGTTTT TTATTTGGAC AATTTAAGAA AAAATAAGCA TAAAGGGTAT
AATGATGATC CTTTTGTACA GTATTTCGCC AGTTTGTTAT TTGAAAGCGA GGGTAATTTA
TCCTCGGCAA GGATAGCCAG GGCAAACGCT TTTAACGCGT ATGAGCGTTT TGCGTCTTTT
TATAATGTGC CGAAGCCGGA TTTTACTGTC CCTTCTAACG CGGATAAAAT GGGGGAAATT
ATATTTGTCC ATTATAACGG GCATATACCT ATAATACGCT CCCAGACTAT TCAGATAGCG
TGGGACAGGG CGATGTCAAT GACTGTAGGT ACGGACGACC TTGCCAATGC GGATTCCTCC
GTGCAAAACG CTGTCGTTGC GGGCATTATG GGCAACGCCG TCACCATAGC TTACCCTGTT
TTAACACCTG TGCCGTTTAG TACGGCGGGG TCTTCGGTAA GAGTAGGCTC TGTTAAACAA
GATACCGTGC TTGTGCATAA TTTATCGGCA CTTGCTAAAG AAGAATTAGA TGAAAGAATG
CCTTCAATAA TGGCTAAAAT GGTAGCGAGA GCGGTTATAA AACAGATTAT AGCCACCCAG
GCAAGGCATG CCGCCACAAA AGCTACGGAT AATGAAAACT GGGGTATGAT AGCTGGTATG
ATGGTAAGCG CTTTTAACGC CGCTACCGAG CGCGCGGACA CAAGAATGTG GTTTACCCTG
CCTGGGGAAA TAAGAATGAG CAGAGTTTTT GTCGAACCCG GTTACCATAA AATAATTTTT
ACCGCCTATG ACTCCATGGG TAACGCCATA GAGGTTAAAG ATTTTGATAA TATAGAAATT
AAAGCAGGAG AAAGAATTTA TTTGCATCAC AGAACAGGCA AATAA
 
Protein sequence
MKKTCIFLLI ILVICGCRSS FNFRQNVNKE INEGNYSSAT AKIEDQKNKV YREKDSLIYY 
LDLGTVQHDA KKHEESDKNF DLAQQRIDEL FTTSVSQSVG TLVKNELTAA YEGADYERAM
TYFYRAMNFL AMNNLSGALV EARKAVFYLD NLRKNKHKGY NDDPFVQYFA SLLFESEGNL
SSARIARANA FNAYERFASF YNVPKPDFTV PSNADKMGEI IFVHYNGHIP IIRSQTIQIA
WDRAMSMTVG TDDLANADSS VQNAVVAGIM GNAVTIAYPV LTPVPFSTAG SSVRVGSVKQ
DTVLVHNLSA LAKEELDERM PSIMAKMVAR AVIKQIIATQ ARHAATKATD NENWGMIAGM
MVSAFNAATE RADTRMWFTL PGEIRMSRVF VEPGYHKIIF TAYDSMGNAI EVKDFDNIEI
KAGERIYLHH RTGK