Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1079 |
Symbol | |
ID | 8415369 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 1304748 |
End bp | 1306106 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645024042 |
Product | histidinol dehydrogenase |
Protein accession | YP_003181439 |
Protein GI | 257790833 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0141] Histidinol dehydrogenase |
TIGRFAM ID | [TIGR00069] histidinol dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.156485 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0000000000192016 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGCCGCA TCATCTTGCA ACCTGGCGAG CAGTTCACCA ACGCCCACCT CAAGCGCACC GGCGCGTTCA ACGCCCAGGC CCTGACGGCG GCCACCGCCA TCATCGAGGG CGTGCGCGAG CGCGGCGACG AGGCCCTGCG CGCCTACACC GAGCAGTTCG ACGGCGTGCG CGTGGAGGAG TTCCGCGTGT CGCAGGCGGC TATCGCAGAA GCCATCGTGA ACGTCGACGA CAAGACGGCG CGCGCCCTGC GTCAGGCCGC CGCACAGATC CGCGACTTCC ACGAGCGCCA GAAGCAGCAG AGCTGGTTCA CCGTGCGCGA GGACGGCGCG CTCGTGGGCT CGAAGGTGGA GCCGCTGGAA TCCGTGGGCA TCTACGTGCC GGGCGGGCGC GCGCTGTACC CGTCGTCGGT GCTGATGAAC GCGCTGCCGG CCGCCGTGGC CGGCGTGAAG CGCATCGTAT GCGTGACGCC GCCGACGGCC GACGGAACGC TGGATCCGGC CATTTTGGAG GCGTGCCGCA TCTCGGGCGT CACCGAAATC TACGCGGTGG GCGGCGCGCA GGCCATCGCC GCGCTGGCGT ACGGCACCGA GTCCATCGCG CCCGTGGCCA AGATCACCGG ACCCGGCAAC GCCTACGTGG CGGCGGCGAA GAAGGTGGTG TCGGGCGATG TGGGCATCGA CATGATCGCC GGCCCGTCCG AGGTGTGCGT CGTGGCCGAC TCCACGGCCG ATCCGGCGCT CGTGGCCATC GACCTCATGG CGCAGGCCGA GCACGACCCG CTGGCCGCCT GCTACCTGGT TACGTTCGAT GCGGCCTACG CCGACGAGGT GGAGCGCATG GTTGAGCGCC ACCTCAAGTC GTCCACACGC GCCGAGATCA CGGCGGCATC GCTGGCCGAC CAGGGCCTCA TCGTCGTGTG CGACAGCATG CCGCAAGCCA TCGAGGCCGT GAACGCCATC GCGCCCGAAC ACCTCGAGCT GCACGTCGAC CACGCCTTCG ACCTCTTGGG CGCCATCCGC AACGCGGGCG CCATCTTCCT GGGCGCCTGG ACGCCCGAGG CCGTGGGCGA CTACGCCGCC GGCCCGAACC ACACGCTGCC CACGGGCGGC ACGGCGCGCT ACGCCTCGCC GTTGTCGGTG GACGAGTTCG TGAAGAAGTC GAGCGTCATC CAGTATTCGT CGCAGGCGCT CGCCCGCGAT GCCGACATGG TAACCACCAT CGCGCGCCAC GAGGGCCTGT GGGCGCACGC CATGAGCGTG GAGATGCGCA AGAACCTGCT CGACACGGGC AACGTGTATG GGATCGAGGG CGGCGACGGA GCGGGCCGCG CGGCCGGAGA CGGGGGTGCC GATGCGTAG
|
Protein sequence | MRRIILQPGE QFTNAHLKRT GAFNAQALTA ATAIIEGVRE RGDEALRAYT EQFDGVRVEE FRVSQAAIAE AIVNVDDKTA RALRQAAAQI RDFHERQKQQ SWFTVREDGA LVGSKVEPLE SVGIYVPGGR ALYPSSVLMN ALPAAVAGVK RIVCVTPPTA DGTLDPAILE ACRISGVTEI YAVGGAQAIA ALAYGTESIA PVAKITGPGN AYVAAAKKVV SGDVGIDMIA GPSEVCVVAD STADPALVAI DLMAQAEHDP LAACYLVTFD AAYADEVERM VERHLKSSTR AEITAASLAD QGLIVVCDSM PQAIEAVNAI APEHLELHVD HAFDLLGAIR NAGAIFLGAW TPEAVGDYAA GPNHTLPTGG TARYASPLSV DEFVKKSSVI QYSSQALARD ADMVTTIARH EGLWAHAMSV EMRKNLLDTG NVYGIEGGDG AGRAAGDGGA DA
|
| |