Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_3094 |
Symbol | |
ID | 8417430 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 3598485 |
End bp | 3599945 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 645026074 |
Product | regulatory protein GntR HTH |
Protein accession | YP_003183425 |
Protein GI | 257792819 |
COG category | [K] Transcription |
COG ID | [COG2186] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.0435211 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGAAAAGA AAGCCACCTT ATTCGAGTAC GTGTATCGTC AGCTTGTCAG CGACATCGAG CAGGGAGCGC TGCGCTATGG CGACGCGTTG CCGTCGCTGC ACGACTTGTG CGATCGATTT CGGGTGGGCA TCAGAACCAT CCGCGACGTG CAGCGTGCTT TGAAGGCCGA CGGGTACATT GTCGTGGAGG AGCGAAAGCG CGCTGTCGTG GCGTACCGTC CGGCCGATGA CGCGGATGAC GGTCGCATCC GTGCGCTGCT GGCGCGTCGC GAGGTGGTTG CCGATTGCTA CAGAACCCTC GAACTGGTCA TGCCCCCGTT GTTCCACTTA GCCGCTCGAT GCTGCTCTGA CGAGGATCTT TTCGCTTTGG CTCAGGATGC GAAGCGCGTT GATCGCTCTG GCGTATCGGA AGGGTGGCGC ATATCGCACG CATCGATTGT TTTGCACGGC CTGGTGGCAA AGGCAGGCAA CCCGTTGTTC ACGTCGTGCT TCGCAAGCTT GGAGCGCATC GGTCTGGTGC CGGTGGTTCC GGGTTTCGAA AGCCCTTTTG CCTCGCGTGC GGTCGACGTC GACGGGGGTT TGATGACGTG GATGTTCTCC TCTCTGCTTT TGCGCGATGC TGATGAGGTG CAGTATCGGT TCGGTTGCAT GTATCGCGGC GTCGCGCAGC GCGTCGGGGC GTATTTCGAC GCGCTCGAGA ACGCATATGC CCCCGCCGTG GACATCGGAT CGTTCGGGTA CGCGTGGAAT GCGAAGGCCG GGTTGGAATT CGTGCACGGA CAAATAGCTC GCAATCTTGT GGAACGCATC GTTCGGGGGG AATTCGTTGA CGGCCAGCTG CTTCCTTCCA TCGCCGAGCT GTCGGCGCAG TATGGAGTCT CCGCTTCCAC GGTGCAGAAA GCGTACGGCG CGCTCAACGT CATAGGCGTC GCGCGCACCG TCAACGGGTT GGGCACGCGC GTGCAGTTGG GCAATGCGAC GTTCAGCGAG CGCTGGCTGG AAGATCACTC GTTCAAGAGA GATGTGGGCA CGTACGTCCA TGCGGCGCAG ATGATGTGCG CCGTGCTGCC TGCTGCATTG AGCCGCGTGC AAGGTCATGC GGAGGAGGTG GCCTCTTCTG CGGAACGCGC CCTCGCGGTC GAGGAGGGGG ATTGGGCGGT GTCGAAGGCT CTCATAACGG GTTTGATCGA ATGCACGGAG CCTTGCGCAT TGCAGACGAT TCTGCGGGAG CTGAACGATT TGCTGCGCTG GGGAGCCTTC CTTACGCTGT TCGCGGCTTC GCGAGAAAGC GCGTCCGCTC TCTCGTCTCT CGAAAGCATC GCTCTCGGGC AGGCGCGCAG AGCCGACGAC GAGGGGTTCT CCCAATCGAT GACCGCGTAT TACCGCCTTA TGCTCCAGTC CGTGCTCGCG TTTCTCGAGA GGGCGGGTAT GATTGACGTC GACCGCACGA TCGTGCCCTA G
|
Protein sequence | MEKKATLFEY VYRQLVSDIE QGALRYGDAL PSLHDLCDRF RVGIRTIRDV QRALKADGYI VVEERKRAVV AYRPADDADD GRIRALLARR EVVADCYRTL ELVMPPLFHL AARCCSDEDL FALAQDAKRV DRSGVSEGWR ISHASIVLHG LVAKAGNPLF TSCFASLERI GLVPVVPGFE SPFASRAVDV DGGLMTWMFS SLLLRDADEV QYRFGCMYRG VAQRVGAYFD ALENAYAPAV DIGSFGYAWN AKAGLEFVHG QIARNLVERI VRGEFVDGQL LPSIAELSAQ YGVSASTVQK AYGALNVIGV ARTVNGLGTR VQLGNATFSE RWLEDHSFKR DVGTYVHAAQ MMCAVLPAAL SRVQGHAEEV ASSAERALAV EEGDWAVSKA LITGLIECTE PCALQTILRE LNDLLRWGAF LTLFAASRES ASALSSLESI ALGQARRADD EGFSQSMTAY YRLMLQSVLA FLERAGMIDV DRTIVP
|
| |