Gene Elen_0728 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0728 
Symbol 
ID8415018 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp915671 
End bp917245 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content69% 
IMG OID645023699 
Productphosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_003181096 
Protein GI257790490 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAATC CTAAGGTCAA GCGCGTGCTC GTGTCCGTTA CGGACAAGAG CGGCGTTGCG 
GACTTCGCCC GCGCGCTCGT CGACGAGTTC GGGGCGGAAA TCATCTCGAC GGGCGGCACC
GCCCGCGCGC TCAAGGACGC CGGCGTGCCG GTTACGCCCA TCGACGACGT GACGCAGTTC
CCCGAGATGA TGGACGGTCG CGTGAAGACG CTGCACCCGC GCGTGCACGG CGGTCTGCTG
GCCAAGCGCG ACAACGAGGC CCACATGGCC CAAGCGGCCG AGCACGGCAT CGAGATGATC
GACATGGTGG TGGTGAACCT CTACGCCTTC GAGAAGACGG TGGAAAGCGG CGCCGATTTC
GGCACCTGCA TCGAGAACAT CGACATCGGC GGCCCGTCCA TGCTGCGCTC CGCGGCCAAG
AACTTCGAGA GCGTGGCCGT GGTCACGCGC CCGGCGAGCT ACGACGCTAT CCTGGCCGAG
ATGCGCGCCA ACGACGGCGC CACCCTGCGC GACACGCGCG CCAAGCTGGC GCTCGACGTG
TTCGAGACCA CGGCGGCTTA CGACGGCGCC ATCGCCGCGT GGATGGGCGC CCAGCTCAAG
GACGAGGGCG ACGTGAAGTT CCCCGCCGAC CGCACGCTGC ATCTCTCGAA GGTGCAAGAC
CTGCGCTACG GCGAGAACCC GCACCAGTCC GCCGCGTTCT ACCGTCGCGA CGACTACGCC
GACGCCCCGC ACAGCCTGGC CCATGCCAAG CAGCATCAGG GCAAGGAGCT GTCGTACAAC
AACTACCTCG ACCTCGACGC GGCTTGGACG GCCGTTCGCG AGTTCGACGA GCCGGCCTGC
GTCATCGTCA AGCACCTCAC GCCCTGCGGC GTGTGCCAGA ACGACGACCT CGTCGAGGCC
TACCAGCGCG CGCACGCGTG CGACCCGGTG AGCGCCTACG GCGGCGTCAT GGCGTTCAAC
CGCCCCGTCA CCTCCGACGT GGTGGTGGCC ATCTTCGACA ACAAGCAGTT CGTCGAGGCC
ATCATCGCTC CCGAGTTCGC GGGCGACGCG CTTGACATGT ACAGCGCGAA GAAGAACGCG
CGCCTGCTGT CCACGGGCGG CGTGAACCCG GCCGGCGGGG AAGTGGAGTA CCGCTCGGTC
GAGGGCGGCC TGCTGGCCCA GGATTCCGAC GCCGTGGCCG AGGATCCCGC GACGTTCACG
GTTCCCACGA AGCGCCAGCC CAGCGAGGAA GAGCTCGCCG AGCTGCTGTT CGCGTGGAAG
GTGTGCAAGT CCATCAAGTC CAACGCCATC GCCATCACGA AGGGCCACGC GACCATCGGC
GTGGGCGGCG GCCAGCCGAA CCGCGTGAAC TCCGCGCGCA TTGCCGTGGA GCAGGCGGGC
GAGGAGGCCA AGGGCGCCGT GGCCGCCTCC GACGCGTTCT TCCCGTTCCG CGACGGCCTC
GACGCGCTGG CCGAGGCCGG CGTGACGGCC ATCATCGAGC CGGGCGGCTC CATCCGTGAC
GAAGAGGTGA TCGCCGCCGC CGACGAGCAC GGCATCGCGC TCGTCTTCAC CGGCCACCGC
CACTTCAGGC ACTAG
 
Protein sequence
MSNPKVKRVL VSVTDKSGVA DFARALVDEF GAEIISTGGT ARALKDAGVP VTPIDDVTQF 
PEMMDGRVKT LHPRVHGGLL AKRDNEAHMA QAAEHGIEMI DMVVVNLYAF EKTVESGADF
GTCIENIDIG GPSMLRSAAK NFESVAVVTR PASYDAILAE MRANDGATLR DTRAKLALDV
FETTAAYDGA IAAWMGAQLK DEGDVKFPAD RTLHLSKVQD LRYGENPHQS AAFYRRDDYA
DAPHSLAHAK QHQGKELSYN NYLDLDAAWT AVREFDEPAC VIVKHLTPCG VCQNDDLVEA
YQRAHACDPV SAYGGVMAFN RPVTSDVVVA IFDNKQFVEA IIAPEFAGDA LDMYSAKKNA
RLLSTGGVNP AGGEVEYRSV EGGLLAQDSD AVAEDPATFT VPTKRQPSEE ELAELLFAWK
VCKSIKSNAI AITKGHATIG VGGGQPNRVN SARIAVEQAG EEAKGAVAAS DAFFPFRDGL
DALAEAGVTA IIEPGGSIRD EEVIAAADEH GIALVFTGHR HFRH