Gene Emin_0523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0523 
Symbol 
ID6262650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp572537 
End bp574090 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content44% 
IMG OID642610993 
Productphosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001875415 
Protein GI187250933 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.038315 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.38418 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTCAGG AACGCAAAAT TAAAAGGGCT TTAATTTCAG TATCAGATAA AACAGGGCTT 
GAGGTTTTTG CTAAAGGCCT TCATAAATTA GGCGTTGAAC TTGTTTCAAC GAGCGGCACG
GCTAAATTTT TAAAAGCCGC GGGCCTTCCC GTTCGTGATT TAAGCGACCT TACCGGATTT
CCCGAAATAT TAGACGGACG TGTTAAAACC CTTCATCCCC GTGTTCACGG CGCTATTTTA
TATAAACGGG ACGATGATGC CCATTGCAAG GTAATTAAAG ATATGGGCAT TGAAGATATT
GATATGCTTG TTGTTAATCT CTATCCTTTT AGGGAAACCG CCGCTAAAGC CAAACACAGT
TTTGACGCCG AGGTTATTGA AAATATTGAT ATAGGCGGGC CTTCAATGTT AAGAAGCGCG
GCTAAAAATT TTGCCCACGT GGCGGTACTT TGCCGTCCCA AAGATTATGA AGTAGTTTTA
AGTGAGATGG CGGCTTCTCA GGGGGCGCTT TCTTACGCTA CAAGGCAGCG TCTTTGCGTG
GAAGCTTTTA CCCATACGGC TGAGTATGAC GCCGCCATAA GCGAAGAATT TAAAAAAGGC
TTAAATCATG AATTTCCCGA AAGTAAAATT GTTGTTTTAC ATAAAACACA GGATTTGAGA
TACGGCGAAA ATCCGCACCA GAAAGCGGTC CTTTATTCAC AAAAAAAAGA TTTTTCTTTC
GAACAATTGC ATGGCAAAGA ACTTTCTTAT AATAATATTT TAGACGCTTT TGGCACTTGG
GACGCTGTTT GCGATTTTGA TTTGCCCGCC TGCGTTATAT TTAAACACGT TACGCCTTGC
GGCATAGGAA CGGGGAAAGT TTTAACCGAA GCCTTTAATA ACGCCTGGGC GTGTGATCCT
AAATCCGCCT TTGGCGGTAT TATAGCTTTA AATAAACCTA TGCAGCGTGA TATAGCAGAG
GCTATAAGCA AAGTGTTTAT TGAGGCTGTC TGCGCGCCAG ATTACGACTT GGAATCTTTG
GAAATTTTAA AACAGAAAAA GAATATCCGC ATATTAAAAA GAAACTCACC TTTAAGCGCG
GCTTACCAGC TTAAATCGGT TGGCGACGAA GTGCTTTTAC AGCAGCCCGA CAGAACGCTT
CTTCTTGACA ACAAATGGGA CTGCGTTACA AAACGTAAAC CCACCGAGGA AGAGGATAAG
GCTTTAAAAT TCGCCTGGGC CTCGGTAAAA CACGTTAAAT CAAACGCTGT TATTTTAACT
TCCGAAAGCG CCTCCGTAGG TATCGGCGCG GGGCAGATGA GCAGGGTGGA CAGCGTTAAA
ATGGCCGGTA TGAAATTTGA AGAATATTTG CAGGAAAATA AGAAGCCTAA AGTTTTAGTA
ATAGGGTCGG ACGCGTTTTT CCCATTCCGG GACGGCGTTG ACGCGGCCGC CAAACTTGGA
GTAAGCGCAA TAGTTCAGCC CGGCGGATCA GTACGGGACG AGGAAGCCAT TGCCGCTGCC
GATGAGCACG GCATAGCCAT GATTTTTACT GGCTTAAGGC ACTTTAGACA TTAA
 
Protein sequence
MTQERKIKRA LISVSDKTGL EVFAKGLHKL GVELVSTSGT AKFLKAAGLP VRDLSDLTGF 
PEILDGRVKT LHPRVHGAIL YKRDDDAHCK VIKDMGIEDI DMLVVNLYPF RETAAKAKHS
FDAEVIENID IGGPSMLRSA AKNFAHVAVL CRPKDYEVVL SEMAASQGAL SYATRQRLCV
EAFTHTAEYD AAISEEFKKG LNHEFPESKI VVLHKTQDLR YGENPHQKAV LYSQKKDFSF
EQLHGKELSY NNILDAFGTW DAVCDFDLPA CVIFKHVTPC GIGTGKVLTE AFNNAWACDP
KSAFGGIIAL NKPMQRDIAE AISKVFIEAV CAPDYDLESL EILKQKKNIR ILKRNSPLSA
AYQLKSVGDE VLLQQPDRTL LLDNKWDCVT KRKPTEEEDK ALKFAWASVK HVKSNAVILT
SESASVGIGA GQMSRVDSVK MAGMKFEEYL QENKKPKVLV IGSDAFFPFR DGVDAAAKLG
VSAIVQPGGS VRDEEAIAAA DEHGIAMIFT GLRHFRH