Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1010 |
Symbol | |
ID | 8415300 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 1222448 |
End bp | 1224166 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645023974 |
Product | histidine kinase |
Protein accession | YP_003181371 |
Protein GI | 257790765 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.0817847 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGCCT CAGGGGGAAT CGCCATGTCG TTCAAACGAG GTATCCGCTT CAAGTTCGCG GTGTTCATCG GCGCCTTCAT CGTGGCCCTC ATGGCCGTCG ACGCGCTGTG GAACGTCCAG CTGCAACAGC AGCAGGCCGA GAACGAGGCG CGCGAGAAGG CCGAGGTGCT GGCCGACGAG ATGCACGCGA TGTGGGACTT CATCGACATC AACCAGAACA CCATCAACCG CACCGAGGAC GGCGCCTTCC GCACGAAGGC CCTCGTGTGC GTGGTCACCG CCAAGTCGGT GAGCACGCTG TTCACCATGA ACACCGACTA CAAGATCCAG TACACGAGCC CCACCCCGCG CCAGGCGGCG AACGCGCCCG ACGAGTTCGA GCAGCGGGCG TTCGAGGCCT TCGGCGCCGA TGCGGCCCTC GAGGCGTACT ACGACGTGGG CTACGACGCC GAGGGGCGGC GCGTGTTCCG CTACGCCGAG CCGCTGTACG TGACCGAGAC GTGCCTCGAA TGCCACGGCG AGCCCGTCGG CGAGCTCGAC CAGTTCGGCT ACGAGAAGGA GGGCATGCAG GTGGGCGACA TCGGCGGCGC CGTGTCCATC ACCGAGCCCA TGGACATCTA CGCCGACGGC ATGCGCACGA GCGTACTGCA GCAGGTGTTC ATGGTGCTGC TCGTGCTCGT GCTGGCCTGC GTGGGCATCT ACTTCGCCGT GAGCAAGCTC GTGCTGCACC CGCTCGACGC GCTCGGGCGC GCCGCGAAGC AGATCGGCGC GGGCGACTTC TCCTACCAGC TGGAGGCGCG CACCGTGGGC GGCCCCGACG AGCTCACCGA GTTCGCCGAC GACTTCGACA AGATGGCCCG CCAGCTGGAA CGGCTCTACA CCGACCTCGA AAGCGAGGTG CGCAGCCAAA CCGACAAGCT CTCGGCGCTC AACGACCTGC TGCTGTACCA GAAGGTCGAG CTCAAGAAGG CGCTCGACCG CCTCAGCGAG GAAACCGCCT ACAAAAACGA GTTCTTCGCC ATCATGAGCC ACGAGCTGCG CACGCCGCTC ACGTCCATCC TCGCGTTCGC GCGCATCCTG CGCGGGGTCG ACTCGCTCGA CGCCAAGACG CGCAGCGCCG TGGAGGAGAT CGAGGCGAAC GCCACGCTGC TGCTCAACAT GGTGAACAAC ATCCTGACCA TCTCGAAGGC CGAGGCGCAC AAGAACGAGC TGGTGGTGGA GCCGGTCGAC TTCGTGGACC TGCTGGGGTT CATCAGGAAG TCGCTCGAGC CCGTGGCGAA GAACAAGGGC ATCGCCCTGA CCGCGAAGAC CGACGCCGAC GTGCCCGTGT CGATGGCCGA CTGGGAGAAG CTGCGGCGCA TCGTCGAGAA CCTCGTGGAC AACGCCATCA AGTACACCCA CGTCGGCGGT CGCGTGGACG TGCGCGCGAC GTTCGACGGC GCCTGCATCG TCGTGTCCGT CGCCGACGAC GGCATGGGCA TCGACGAAGC CGACCAGGAG GGCATCTTCG AGCGCTACCG CCAGGCCGGC CAGTCGCCCA ACCGCCGCTA CCGCGGCACA GGCCTCGGCC TGGCCGTGGT GAAGGAGCTG GCCGAGCTGC ACGGGGGCAG CGTGTCGGTG GCGTCGGCCC GCAAGCTCGG CAGCACGTTC ACCGTGCGCA TCCCCTACGT TGCCGTGGAT ACGGAGGAAT ACGATGAAGA AGATCCTGCT GATCGATGA
|
Protein sequence | MRASGGIAMS FKRGIRFKFA VFIGAFIVAL MAVDALWNVQ LQQQQAENEA REKAEVLADE MHAMWDFIDI NQNTINRTED GAFRTKALVC VVTAKSVSTL FTMNTDYKIQ YTSPTPRQAA NAPDEFEQRA FEAFGADAAL EAYYDVGYDA EGRRVFRYAE PLYVTETCLE CHGEPVGELD QFGYEKEGMQ VGDIGGAVSI TEPMDIYADG MRTSVLQQVF MVLLVLVLAC VGIYFAVSKL VLHPLDALGR AAKQIGAGDF SYQLEARTVG GPDELTEFAD DFDKMARQLE RLYTDLESEV RSQTDKLSAL NDLLLYQKVE LKKALDRLSE ETAYKNEFFA IMSHELRTPL TSILAFARIL RGVDSLDAKT RSAVEEIEAN ATLLLNMVNN ILTISKAEAH KNELVVEPVD FVDLLGFIRK SLEPVAKNKG IALTAKTDAD VPVSMADWEK LRRIVENLVD NAIKYTHVGG RVDVRATFDG ACIVVSVADD GMGIDEADQE GIFERYRQAG QSPNRRYRGT GLGLAVVKEL AELHGGSVSV ASARKLGSTF TVRIPYVAVD TEEYDEEDPA DR
|
| |