Gene Emin_1472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1472 
Symbol 
ID6263916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1569019 
End bp1569975 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content42% 
IMG OID642611957 
Productdihydrouridine synthase DuS 
Protein accessionYP_001876357 
Protein GI187251875 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCTT TCGTTAAAAA AATAACAATA GGTTCTTTCG CGGCAAAAAA TAACCTTATG 
CTTGCCCCGA TGGCCGGTAT TACCGACACG CCTTTTAGAA TACACTGTTT AAATAACGGC
GCGGGCATAG TTTGCGCGGA AATGGTGTCG GCCAAGGCGG TTGAGTATGA TAATAAAAAA
AGCGTAAAAA TGTTAGCGGT TGATAAAAAG GAGCATCCAG TTTCCATGCA GATTTTCGGA
GGGGATGCGG AAAGCATTTC CATAGCGGCC AAAGCGGCCG AGGCCGCGGG CGCTGATATT
ATTGATATTA ACGCGGGGTG TCCCGTAAAA AAAATAAACA GAGCGGGCGC GGGCTGCGTT
TTAATTAAAG ATGAAAAATT GTTAGCTTCA ATAGTAAACG CCGCTGTTAA TTCCGTGAGT
ATTCCGGTAA CTTTAAAAAC AAGAATAGGT CTTACCGCTG GCGATTTTAA AGGTGATAAA
ATTGCCAAAC TGGCTGAAAA CGAAGGCGCG GCGGCTGTTA TTATGCATGC GCGTTACGCC
GGCAATGTGC ATGGCGGCCC GGCTGATTTA GAGGCTCTTG CCAAAGTCGT TTCGGCCGTT
AAAATACCCG TTATAGGTAA CGGAGGTATC GTTGATGTTA ATACAGCTGA TAAAATGTTT
GAAACCGGCG TGCGCGGCAT AATGGTGGGG CGCGGAGCTA TAGGCAATAT TAATATTTTT
AAAAGCATAA TTAACGGTTG TGACATAGAG TTAAATCCTA AAGAAAACGT TAAAATATTT
TTTAATCTGA TTAAACAAAA CGTTAATTTT TACGGTGAGA AAAACGGTAT TGCCAGGAGT
AGGAAAACCG TGGGTTTTTG GATAAAAGGG TTTCCGATGG CGGGGGAAAT AAGAGGCGAG
TTTGTAAAAT TAAATACATT AGCCGCAGTG CAAAAACTTT TTGGGGAATA TTTATGA
 
Protein sequence
MNAFVKKITI GSFAAKNNLM LAPMAGITDT PFRIHCLNNG AGIVCAEMVS AKAVEYDNKK 
SVKMLAVDKK EHPVSMQIFG GDAESISIAA KAAEAAGADI IDINAGCPVK KINRAGAGCV
LIKDEKLLAS IVNAAVNSVS IPVTLKTRIG LTAGDFKGDK IAKLAENEGA AAVIMHARYA
GNVHGGPADL EALAKVVSAV KIPVIGNGGI VDVNTADKMF ETGVRGIMVG RGAIGNINIF
KSIINGCDIE LNPKENVKIF FNLIKQNVNF YGEKNGIARS RKTVGFWIKG FPMAGEIRGE
FVKLNTLAAV QKLFGEYL