Gene Elen_1010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1010 
Symbol 
ID8415300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1222448 
End bp1224166 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content66% 
IMG OID645023974 
Producthistidine kinase 
Protein accessionYP_003181371 
Protein GI257790765 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0817847 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCCT CAGGGGGAAT CGCCATGTCG TTCAAACGAG GTATCCGCTT CAAGTTCGCG 
GTGTTCATCG GCGCCTTCAT CGTGGCCCTC ATGGCCGTCG ACGCGCTGTG GAACGTCCAG
CTGCAACAGC AGCAGGCCGA GAACGAGGCG CGCGAGAAGG CCGAGGTGCT GGCCGACGAG
ATGCACGCGA TGTGGGACTT CATCGACATC AACCAGAACA CCATCAACCG CACCGAGGAC
GGCGCCTTCC GCACGAAGGC CCTCGTGTGC GTGGTCACCG CCAAGTCGGT GAGCACGCTG
TTCACCATGA ACACCGACTA CAAGATCCAG TACACGAGCC CCACCCCGCG CCAGGCGGCG
AACGCGCCCG ACGAGTTCGA GCAGCGGGCG TTCGAGGCCT TCGGCGCCGA TGCGGCCCTC
GAGGCGTACT ACGACGTGGG CTACGACGCC GAGGGGCGGC GCGTGTTCCG CTACGCCGAG
CCGCTGTACG TGACCGAGAC GTGCCTCGAA TGCCACGGCG AGCCCGTCGG CGAGCTCGAC
CAGTTCGGCT ACGAGAAGGA GGGCATGCAG GTGGGCGACA TCGGCGGCGC CGTGTCCATC
ACCGAGCCCA TGGACATCTA CGCCGACGGC ATGCGCACGA GCGTACTGCA GCAGGTGTTC
ATGGTGCTGC TCGTGCTCGT GCTGGCCTGC GTGGGCATCT ACTTCGCCGT GAGCAAGCTC
GTGCTGCACC CGCTCGACGC GCTCGGGCGC GCCGCGAAGC AGATCGGCGC GGGCGACTTC
TCCTACCAGC TGGAGGCGCG CACCGTGGGC GGCCCCGACG AGCTCACCGA GTTCGCCGAC
GACTTCGACA AGATGGCCCG CCAGCTGGAA CGGCTCTACA CCGACCTCGA AAGCGAGGTG
CGCAGCCAAA CCGACAAGCT CTCGGCGCTC AACGACCTGC TGCTGTACCA GAAGGTCGAG
CTCAAGAAGG CGCTCGACCG CCTCAGCGAG GAAACCGCCT ACAAAAACGA GTTCTTCGCC
ATCATGAGCC ACGAGCTGCG CACGCCGCTC ACGTCCATCC TCGCGTTCGC GCGCATCCTG
CGCGGGGTCG ACTCGCTCGA CGCCAAGACG CGCAGCGCCG TGGAGGAGAT CGAGGCGAAC
GCCACGCTGC TGCTCAACAT GGTGAACAAC ATCCTGACCA TCTCGAAGGC CGAGGCGCAC
AAGAACGAGC TGGTGGTGGA GCCGGTCGAC TTCGTGGACC TGCTGGGGTT CATCAGGAAG
TCGCTCGAGC CCGTGGCGAA GAACAAGGGC ATCGCCCTGA CCGCGAAGAC CGACGCCGAC
GTGCCCGTGT CGATGGCCGA CTGGGAGAAG CTGCGGCGCA TCGTCGAGAA CCTCGTGGAC
AACGCCATCA AGTACACCCA CGTCGGCGGT CGCGTGGACG TGCGCGCGAC GTTCGACGGC
GCCTGCATCG TCGTGTCCGT CGCCGACGAC GGCATGGGCA TCGACGAAGC CGACCAGGAG
GGCATCTTCG AGCGCTACCG CCAGGCCGGC CAGTCGCCCA ACCGCCGCTA CCGCGGCACA
GGCCTCGGCC TGGCCGTGGT GAAGGAGCTG GCCGAGCTGC ACGGGGGCAG CGTGTCGGTG
GCGTCGGCCC GCAAGCTCGG CAGCACGTTC ACCGTGCGCA TCCCCTACGT TGCCGTGGAT
ACGGAGGAAT ACGATGAAGA AGATCCTGCT GATCGATGA
 
Protein sequence
MRASGGIAMS FKRGIRFKFA VFIGAFIVAL MAVDALWNVQ LQQQQAENEA REKAEVLADE 
MHAMWDFIDI NQNTINRTED GAFRTKALVC VVTAKSVSTL FTMNTDYKIQ YTSPTPRQAA
NAPDEFEQRA FEAFGADAAL EAYYDVGYDA EGRRVFRYAE PLYVTETCLE CHGEPVGELD
QFGYEKEGMQ VGDIGGAVSI TEPMDIYADG MRTSVLQQVF MVLLVLVLAC VGIYFAVSKL
VLHPLDALGR AAKQIGAGDF SYQLEARTVG GPDELTEFAD DFDKMARQLE RLYTDLESEV
RSQTDKLSAL NDLLLYQKVE LKKALDRLSE ETAYKNEFFA IMSHELRTPL TSILAFARIL
RGVDSLDAKT RSAVEEIEAN ATLLLNMVNN ILTISKAEAH KNELVVEPVD FVDLLGFIRK
SLEPVAKNKG IALTAKTDAD VPVSMADWEK LRRIVENLVD NAIKYTHVGG RVDVRATFDG
ACIVVSVADD GMGIDEADQE GIFERYRQAG QSPNRRYRGT GLGLAVVKEL AELHGGSVSV
ASARKLGSTF TVRIPYVAVD TEEYDEEDPA DR