Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0001 |
Symbol | |
ID | 8417460 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 67 |
End bp | 1182 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 645022973 |
Product | transcriptional regulator, TrmB |
Protein accession | YP_003180385 |
Protein GI | 257789779 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000656108 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.000000126577 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGGACAACC AGATTGAAAA AGCAGATCAT CAGGATGAAA CCGCTGATGA AAACGTGGAG CCTTTCGACT ACACGCATGT ACACGCCGTA GCGCGCATCG CGCTGTATGA CGATCTTCGC AGCGCCCCGC GCGTGACGGA GATACACCCG GCGCCGACGG CTGAATTCAT TGAAAGCCTT GCTTCGAAAA TTTATGAACA GGCTAAAAAC GCCGGAGGGA CCATTCCGTA CACCGTCATC CGCGAAGTAT CGGAAAACTT CATCCACGCG CGTTTCGCCG AGGCTACCGT GTCCATCCTG GACGAGGGCA ATACCATCCG TTTCGCCGAC CAGGGCCCGG GCATACCTTA TAAGGATCAA GCGCAGATCC CCGGGTTCAC ATCCGCCGTG GAGCCGATGA AACACTATAT TCGCGGCGTG GGGTCGGGCT TGCCCATCGT GAAAGAGTAC CTCGATTTCT CGCACGGCAC CATCACCATT GAGGACAACC TGGGAACAGG CGCCGTAGTG ACCATCAGCC TACGTGCGGG CGAGGCGACC GATATGCCGC CCGTCGACCA ATCGAGCGCG CTTCACCCTG CATCCGCGCA ACCATCGGCC ATGGAACCCG CTTACCCGAT GCACGAGGCG CCGCAGCAGC TGCAGCAGCA GATTCCTCCC CAGCAGCCGA TGCAGCAACC CGCCTACCCG GCGCAATACG GATACGCAAA CCCACCGTAT CCGCAGGAAG CCGCGCCCGC ACGACCGCCT TACGGTTACG AGCCCGAACC GCAGTACGCG CCCCCGCGCT ATGCGCAGAA TCCCTACGCG GCAGGCGCGC CCTACTATCC CCAGCACGGT GCCCCGGCTC ATCGCGCTCA GGGCATGGAC ATGCAAGCCC AGCATGCGAT GGCGCCGCTT ATCCCCCCGT TGTCGCAACG CGAGCGCGAC TTCCTGCCCA TCTTCCTGAG CGAAGGAGCC CTGGGAGTAA CGGACCTGTC GCGTTTGACC GGCGTGCCGC AATCAAGCAC GTACGTGGCG CTGTCGAAAT TAGAGGAAGC CGGGCTTATC GAGAAAACGG TCGGACAGAA GCGCATTCTG ACCGATCTGG GATACCACGT GGCAAATTCC CTATAA
|
Protein sequence | MDNQIEKADH QDETADENVE PFDYTHVHAV ARIALYDDLR SAPRVTEIHP APTAEFIESL ASKIYEQAKN AGGTIPYTVI REVSENFIHA RFAEATVSIL DEGNTIRFAD QGPGIPYKDQ AQIPGFTSAV EPMKHYIRGV GSGLPIVKEY LDFSHGTITI EDNLGTGAVV TISLRAGEAT DMPPVDQSSA LHPASAQPSA MEPAYPMHEA PQQLQQQIPP QQPMQQPAYP AQYGYANPPY PQEAAPARPP YGYEPEPQYA PPRYAQNPYA AGAPYYPQHG APAHRAQGMD MQAQHAMAPL IPPLSQRERD FLPIFLSEGA LGVTDLSRLT GVPQSSTYVA LSKLEEAGLI EKTVGQKRIL TDLGYHVANS L
|
| |