Gene Elen_1079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1079 
Symbol 
ID8415369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1304748 
End bp1306106 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content70% 
IMG OID645024042 
Producthistidinol dehydrogenase 
Protein accessionYP_003181439 
Protein GI257790833 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.156485 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000000192016 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCCGCA TCATCTTGCA ACCTGGCGAG CAGTTCACCA ACGCCCACCT CAAGCGCACC 
GGCGCGTTCA ACGCCCAGGC CCTGACGGCG GCCACCGCCA TCATCGAGGG CGTGCGCGAG
CGCGGCGACG AGGCCCTGCG CGCCTACACC GAGCAGTTCG ACGGCGTGCG CGTGGAGGAG
TTCCGCGTGT CGCAGGCGGC TATCGCAGAA GCCATCGTGA ACGTCGACGA CAAGACGGCG
CGCGCCCTGC GTCAGGCCGC CGCACAGATC CGCGACTTCC ACGAGCGCCA GAAGCAGCAG
AGCTGGTTCA CCGTGCGCGA GGACGGCGCG CTCGTGGGCT CGAAGGTGGA GCCGCTGGAA
TCCGTGGGCA TCTACGTGCC GGGCGGGCGC GCGCTGTACC CGTCGTCGGT GCTGATGAAC
GCGCTGCCGG CCGCCGTGGC CGGCGTGAAG CGCATCGTAT GCGTGACGCC GCCGACGGCC
GACGGAACGC TGGATCCGGC CATTTTGGAG GCGTGCCGCA TCTCGGGCGT CACCGAAATC
TACGCGGTGG GCGGCGCGCA GGCCATCGCC GCGCTGGCGT ACGGCACCGA GTCCATCGCG
CCCGTGGCCA AGATCACCGG ACCCGGCAAC GCCTACGTGG CGGCGGCGAA GAAGGTGGTG
TCGGGCGATG TGGGCATCGA CATGATCGCC GGCCCGTCCG AGGTGTGCGT CGTGGCCGAC
TCCACGGCCG ATCCGGCGCT CGTGGCCATC GACCTCATGG CGCAGGCCGA GCACGACCCG
CTGGCCGCCT GCTACCTGGT TACGTTCGAT GCGGCCTACG CCGACGAGGT GGAGCGCATG
GTTGAGCGCC ACCTCAAGTC GTCCACACGC GCCGAGATCA CGGCGGCATC GCTGGCCGAC
CAGGGCCTCA TCGTCGTGTG CGACAGCATG CCGCAAGCCA TCGAGGCCGT GAACGCCATC
GCGCCCGAAC ACCTCGAGCT GCACGTCGAC CACGCCTTCG ACCTCTTGGG CGCCATCCGC
AACGCGGGCG CCATCTTCCT GGGCGCCTGG ACGCCCGAGG CCGTGGGCGA CTACGCCGCC
GGCCCGAACC ACACGCTGCC CACGGGCGGC ACGGCGCGCT ACGCCTCGCC GTTGTCGGTG
GACGAGTTCG TGAAGAAGTC GAGCGTCATC CAGTATTCGT CGCAGGCGCT CGCCCGCGAT
GCCGACATGG TAACCACCAT CGCGCGCCAC GAGGGCCTGT GGGCGCACGC CATGAGCGTG
GAGATGCGCA AGAACCTGCT CGACACGGGC AACGTGTATG GGATCGAGGG CGGCGACGGA
GCGGGCCGCG CGGCCGGAGA CGGGGGTGCC GATGCGTAG
 
Protein sequence
MRRIILQPGE QFTNAHLKRT GAFNAQALTA ATAIIEGVRE RGDEALRAYT EQFDGVRVEE 
FRVSQAAIAE AIVNVDDKTA RALRQAAAQI RDFHERQKQQ SWFTVREDGA LVGSKVEPLE
SVGIYVPGGR ALYPSSVLMN ALPAAVAGVK RIVCVTPPTA DGTLDPAILE ACRISGVTEI
YAVGGAQAIA ALAYGTESIA PVAKITGPGN AYVAAAKKVV SGDVGIDMIA GPSEVCVVAD
STADPALVAI DLMAQAEHDP LAACYLVTFD AAYADEVERM VERHLKSSTR AEITAASLAD
QGLIVVCDSM PQAIEAVNAI APEHLELHVD HAFDLLGAIR NAGAIFLGAW TPEAVGDYAA
GPNHTLPTGG TARYASPLSV DEFVKKSSVI QYSSQALARD ADMVTTIARH EGLWAHAMSV
EMRKNLLDTG NVYGIEGGDG AGRAAGDGGA DA