Gene Elen_1246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1246 
Symbol 
ID8415538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1497533 
End bp1499821 
Gene Length2289 bp 
Protein Length762 aa 
Translation table11 
GC content63% 
IMG OID645024210 
Producthistidine kinase 
Protein accessionYP_003181605 
Protein GI257790999 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0124717 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAACG CCCGCTACGA CTGGAAGAGG GGCGTCCTCC CTATGGAACT GATTACCCGC 
TCGCGCGAGC GGCGGCGCAT CGCCATCATC GTGGTCGCGC TCGTCCTTGC GCTTCTGATC
GCGCTCGCGT ACACGATCGT CTCGTCGCGT CAGGCGACGG CGGCCGCGGT GGAAACCATG
AGCGAAGTGT ACTTGCAAGA GCTCAGCAAC CAGGTCATCT CCCATTTCAA CACCGGCATC
GACGGCAAGT TCTGCCAGCT GGAAACCGTG GGCAGCTCGC TTGAGCTGTA CGACCCGGAG
AATCTCGACG AGGTGAGGGA TTTCCTCGCC TGCATGGAGG CGGACGATGA CGAGTACGCC
TACCTTGCGC TGCGTGGCAG CGACGGGCTC TACTACACGT CGTGGGGCTC GGACGCCGCC
TCGGATGAGG AGCTTGCCGC CCGCAGCGAG CTGGGGCTGT CGTATCGGGA CGCGCACGGC
CACGACGTGA TGCTATACAA CGGAACCATC GCGCTCGTGG ATTCCTTCGA GCCCGTTACC
TGCGGAGGCG TCACGTTCAC CGCCGTCGTC GCCGGGTTCG GCGTCGACAC CGTTTCCGGG
AAGCTCAACC TCGATCTGGT CAATGGCAGC GCGCGGTCGA GCGTCATCGG CTTCGACGGC
ACTTGCATCG CAGGATGCGA TGCCGAGGGG CTGTGCAACG GCGAGAACCT GTTCGACGCG
TTGGAATCGA GCGCTCGCCT CGACAAGGGG TACAGCATGG ACCAGGTGCG AGAAGCCGTG
GAGAACGGGG AGACGTTCCT GCTTCCCTTC TGGTACGGGG GTCATCACGA GTACCTGTAC
TTTCGGCCGA TGAGCAATCA AGACTGGTAT CTGTGCACCG CCATGCCCTA CGGCGTGGTG
GACGAGGACG TGGCGGGACT CAGCTGGGTG CTCATGCAGA ACGCCGTTCT CATGGCGACC
ATGATCATCG CGGTCATAGC CATCTTCTTC TCGATCTACT ATCAGCTGGT CAAGCGCAAC
ACGCGTCTGC TGTCGGACGA GAAGAACCGT GCCGAGCGCG CTTCCGAGGA AGCGCGGCGC
GCCAGCTTGG CGAAAAGCGA GTTCCTCTCG CGCATGTCGC ACGAGATTCG TACGCCGATG
AACGGCATCA TGGGCATGAC CGCCATCGCG CTCGAGAATG CGCACGACGA GGAGAAGGCG
CGAGCCTGCC TCGAGAAGAT CGACGTGACG TCCGAGCATC TCATGGCGCT CATCAACGAC
ATCCTCGACA TGAGCAAGAT CGAGAGCGGC AAGATCGACA TCAAGCGCGA GACGTTCGAT
TTCGGGACGT TCGTAGGGTC GCTCGACGAC GTGTTCGGAA CGCAGGCTCT CGAACAGGGG
ATTCGCTACA AGACGGAAGA AGTGGGCGCG CTGCCCTCGC TGCTCGTCGG CGACGGGCTT
CGGCTGAACC AGATCGTTTA CAATCTCGTT GGCAACGCTT TCAAGTTCAC TCCTCGCGAC
GGCAGCGTCA CGCTGCGCAT AGAGGAGCTT CCCGCGCCGC CGGAAGAGGA TGCTGCGCAC
GACGACGCGA TCTGGCTGCG CTTCTCGGTG ACCGATACGG GTTGCGGCAT CAAGCCGGAG
AACCGCGAGC GGATATTCTC GTCTTTCGAG CAGGGCGACG AGTCTTCATG CAGACGCGGA
GGCACGGGTC TGGGCCTTGC CATCACGAAG CGGTTCGCGG AGATGATGGG CGGCCGTATA
TCGTTGTCGA GCGAGGTGGG GAAGGGGTCG ACCTTCACGG TGGACGTGCC GTTCGGGCGA
GCCTCGGGCG AAGGGGCCGC CACGTGCGAC GATGCGTTCG CCGCGCGGTC CCGAACGCAC
GGCGACGGAG TTTCGTACGA TTTCTCCGGC AGGCGCGTCA TCGTCGCCGA AGACAACGAG
CTCAACCGCG AGATAGCCAC CGAGGTGCTG GCCATGGCGG GTGCCGAGGT CCTGGCGGCA
TCCACGGGCG CCGAGGCCGT GCGCGCGTTC GAGCGCTCGC ACCCGGGCTC CGTCGACCTG
ATCTTCATGG ACATCCAGAT GCCCGAGATG GACGGCTACG AGGCGACCCG CGTCATTCGC
TCGCTCGATC GCGACGACGC GCGTTCGGTG CCCATCGTCG CGATGACGGC CAACGCGTTC
GTCGAGGACG AGGAGCGCAG CCGCATGAGC GGAATGGATG GCCATCTGAG CAAACCCCTT
GATATCCATC TCGTATATGC CACAATGGAC AGGTTTTTGA GAGGGCGCTC GCGGGGAGGG
GGCGCGTAG
 
Protein sequence
MKNARYDWKR GVLPMELITR SRERRRIAII VVALVLALLI ALAYTIVSSR QATAAAVETM 
SEVYLQELSN QVISHFNTGI DGKFCQLETV GSSLELYDPE NLDEVRDFLA CMEADDDEYA
YLALRGSDGL YYTSWGSDAA SDEELAARSE LGLSYRDAHG HDVMLYNGTI ALVDSFEPVT
CGGVTFTAVV AGFGVDTVSG KLNLDLVNGS ARSSVIGFDG TCIAGCDAEG LCNGENLFDA
LESSARLDKG YSMDQVREAV ENGETFLLPF WYGGHHEYLY FRPMSNQDWY LCTAMPYGVV
DEDVAGLSWV LMQNAVLMAT MIIAVIAIFF SIYYQLVKRN TRLLSDEKNR AERASEEARR
ASLAKSEFLS RMSHEIRTPM NGIMGMTAIA LENAHDEEKA RACLEKIDVT SEHLMALIND
ILDMSKIESG KIDIKRETFD FGTFVGSLDD VFGTQALEQG IRYKTEEVGA LPSLLVGDGL
RLNQIVYNLV GNAFKFTPRD GSVTLRIEEL PAPPEEDAAH DDAIWLRFSV TDTGCGIKPE
NRERIFSSFE QGDESSCRRG GTGLGLAITK RFAEMMGGRI SLSSEVGKGS TFTVDVPFGR
ASGEGAATCD DAFAARSRTH GDGVSYDFSG RRVIVAEDNE LNREIATEVL AMAGAEVLAA
STGAEAVRAF ERSHPGSVDL IFMDIQMPEM DGYEATRVIR SLDRDDARSV PIVAMTANAF
VEDEERSRMS GMDGHLSKPL DIHLVYATMD RFLRGRSRGG GA