Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1940 |
Symbol | |
ID | 8416247 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 2274557 |
End bp | 2275993 |
Gene Length | 1437 bp |
Protein Length | 478 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645024913 |
Product | histidine kinase |
Protein accession | YP_003182293 |
Protein GI | 257791687 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.208818 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.0947013 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCCGCG CCCCTCGCAC ATCTGCTCGA GACCGTCGCA CAAGGCGACC GGCGGGCTTC GCGCGCTTCT TCACCAAGCA GCTGCTGCTG TTCGTGGCGC TGGCGCTGCT CATCGTGGTC ATCGACTTCT TCCTGTACGC CGTCATCGCC TACCGGGAAT CGAACTCGAA CTTCAACGAC GGCACGCCCG CGTCCACGAC GCGCGCAGTC GACCAAGCTC TCGAGCAGGA TGCCGACGGC TCATGGACGC TTGGGGAGAA CGGCCTGGAG GCGCTCGGCC AGCAGGACGC CTGGGCGCTC GTCATCGGGA CGGACGGCGC GGTGGCCTGG TCGCAGGACA AGCCGGAGGA CGTGCCCGAC CGGTTCAGCG TGAACGACGT GGCGATGGCA GCGCATTATG CGGCCGTCGC CGACTATCCG GCGTTTTTCT GGGATCGGGA CGACGGCCTG CTGGTCGTGG GTTTCCCGAA GAACGAGTTC TGGACGATGA CGCTCACCTA CCCCGCGTCG ACCGTGCGCA ACTTCCCGCT GTACGTGCTG CTGATATTCG CAGTCGACCT CGGGATCTTG TACACCATCT ACGCGGTGTC GCGTCGCCGG ACGCAAAACG CCGTGGCTCC CATCGCCGAA GCGCTCGACG CGCTGTCTGA CGGGCGCGCG GCCGAGCTGC ACCTCAAAGG GGATCTGCGC GACATCGGCG ACCAGATCAC CGAGACGAGC GCCGTCATCG AGCAGAAGGA CGCCGCGCGC GCAAGCTGGA TTCGCGGCAT CTCCCACGAC ATCCGCACGC CGCTGTCGAT GATTCTCGGC TACGCCGACG CGCTCGTGCA GGACGAGGGC GCAGCCGAAG AGGCGCGCGC AAGCGCGCGG GTCATCAGAG CGCAGGGGCT CAAGATCAAG GACCTCGTCA CCGATCTCAA CACCGCCTCG CAGCTGGACT ACGACATGCA GCCGATGCGC CTGGAACGCG TGCATGCGGC GCGCCTGCTG CGCACCGTGG CGGCCGCGCA TGCGAACAGC GGGCTGGACG AAGCGCATCC TATCGAGCTC GACATCGCAG AGGACGCGCT GAACGCGGTG GTGCTGGGGG ACGAGAGGCT GCTCACGCGC GCGGTGGAGA ACGCCATCTC CAACGCGCGG CTGCACAACG AGCAGGGATG CACGATCAGC GTCGAACTGG CGCTGCGAGA CAACGCGTAC TGCACGATCC GCGTGAGCGA CGACGGTGCT GGAATCGCGG CTGCCGACCT CGCCGCGCTC GAAGCGCGCC TCGCGCGCTC GCGCACGGCG CGCAGCGCGG CTGGGTCGTT CAACCGCGAT CACGGCCTGG GGCTCGTTCT GGTCGACCGC ATCGCCCGCG CGCACGAGGG ATCGCTCTCC CTCGACGGCG CGCCGGGCGA AGGCTTCTCC GTGACCCTCG CCCTCCCGCT GGCGTAG
|
Protein sequence | MGRAPRTSAR DRRTRRPAGF ARFFTKQLLL FVALALLIVV IDFFLYAVIA YRESNSNFND GTPASTTRAV DQALEQDADG SWTLGENGLE ALGQQDAWAL VIGTDGAVAW SQDKPEDVPD RFSVNDVAMA AHYAAVADYP AFFWDRDDGL LVVGFPKNEF WTMTLTYPAS TVRNFPLYVL LIFAVDLGIL YTIYAVSRRR TQNAVAPIAE ALDALSDGRA AELHLKGDLR DIGDQITETS AVIEQKDAAR ASWIRGISHD IRTPLSMILG YADALVQDEG AAEEARASAR VIRAQGLKIK DLVTDLNTAS QLDYDMQPMR LERVHAARLL RTVAAAHANS GLDEAHPIEL DIAEDALNAV VLGDERLLTR AVENAISNAR LHNEQGCTIS VELALRDNAY CTIRVSDDGA GIAAADLAAL EARLARSRTA RSAAGSFNRD HGLGLVLVDR IARAHEGSLS LDGAPGEGFS VTLALPLA
|
| |