Gene Elen_3093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_3093 
Symbol 
ID8417429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3596661 
End bp3598325 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content59% 
IMG OID645026073 
Producthistidine kinase 
Protein accessionYP_003183424 
Protein GI257792818 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.2154 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGAG CGCCGAATGC GCGCTGGACG CGCATCTTCG GGTTGGCGGG TTTGCTGACC 
CTGCTTCGCC GCGTGCGGCG CCGGGCTGAT GCGAAGGAGC TGGCCATCGA GAGCCGCGAG
CAGCTGTTCA GCATGCTGGT GCGCAATGCC GACGATATCT ACGTTATGTT CTCTCCCGAT
GCGTACGAGG TCGAATACGT CAGTCCCAAT GTGGAGAAGC TTTTGGGCGT GTCGGTCGAA
GCGGTGAAGA ACAACATACG AGCGCTGTCG GATTCGGCGG CCGATCCATC GTCCGACCCT
CGAATCGATG ACATTGCGTC GTTGGAAGAG GGCGAGTGCC TTCAAGTGTT TCGCGAGCGC
ATTCAAGCGC GCACAGGAGA GCGTCGCTGG TACCAGGAAA CGTTGTACCG GGAATCCATC
AAAGGGGTGG ATAAGTACGT GCTGGTGCTG TCCGATCGCA CGAACGAGCA AAAAGGAAGT
TTGATGCTCG AGCAGGCGCT CGATATCGCA CGATCGTCGA ACGAGGCGAA GAGCCAGTTC
CTTGCGAACA TGTCGCACGA CATACGCACG CCCATCAACG CCATCGTGGG CATGACGAAG
ATCGCACGGG AAAGCGGCGA GGCCAGCGAG AAGATAGCCG GTTGCCTCGA TGCCATCACC
GCGTCGTCTC GTCATTTGCT GGAGCTTATA AACGACGTTC TGGACATGTC GCGCATCGAA
AGCGGGCAGA TGGAGCTTCA GGAGCGTCGG TTCGACATCG ATGACGTCGT GGGCGGGGTA
GAGGCCATCA TCCGTCCTCA GGCTCAGGCG AAGAGTCAAG AGCTGCGCAT CGATCGCTCG
AAGATGAAGC ATAAGGCGTT CTCGGGAGAC GAGCTGCGCA TAAGCCAGAT CCTTTTGAAC
CTTGCGTCCA ATGCGGTGAA GTACACGCAG GAGGGCGGCT CCATAGCCCT CGTCGTGCAG
GAGCTGGCGA AAACGCGGCC CAGCTATGCG AGCGTTGCTT TCACGATAAC CGACAACGGC
ATGGGCATGT CGCCGGAATT CGTCGAGCGT ATTTTCGATC CGTTCGAGCG CAGCGAAGAG
GTGTCTATCG CGCGGATACA GGGAACCGGG TTGGGGATGT CCATAACGAA GGCGCTGATC
GACGCCATGG GCGGCATTGT CGAGGTCGAC AGCGAAAAAG GACGAGGCAG CTCGTTTCGC
GTGACATTGG AACTTCGCGT CGTCTCGTCG GCGGATGCTT GTCCGCCGCT TGAAGTCGTG
CCGGAGAGCA CCTCCTATCG ATTCGAGGGC AAGAGGTTTC TCCTGGCCGA GGACAACGAG
CTGAACGCTG AGATATTGAT AGAGCTTCTC GGCTGCCGCG GCGCGAAAGT GGAGTGGGCC
GAGAACGGCG AAAAGGCGAT CGATGCATTC TCGAAGCACC CGGCAGGATA TTACGATGCG
GTGTTCATGG ACGTGATGAT GCCGGTGATG AACGGCTACG AGGCAGCGCG CGCTCTGCGC
GCTTGCAGCA GCGCCCGCTC GGAAGAAGTG AAGATCGTCG CGCTGACGGC CAACGCATTT
GCCGAAGATG TGAAGTCCGC GCTCGATGCG GGCATGGATG CCCATGTGGC GAAACCGGTC
GACATAGACG GGCTTGCATG CGTGCTCGGA AAGGTTTGCG GCTGA
 
Protein sequence
MKRAPNARWT RIFGLAGLLT LLRRVRRRAD AKELAIESRE QLFSMLVRNA DDIYVMFSPD 
AYEVEYVSPN VEKLLGVSVE AVKNNIRALS DSAADPSSDP RIDDIASLEE GECLQVFRER
IQARTGERRW YQETLYRESI KGVDKYVLVL SDRTNEQKGS LMLEQALDIA RSSNEAKSQF
LANMSHDIRT PINAIVGMTK IARESGEASE KIAGCLDAIT ASSRHLLELI NDVLDMSRIE
SGQMELQERR FDIDDVVGGV EAIIRPQAQA KSQELRIDRS KMKHKAFSGD ELRISQILLN
LASNAVKYTQ EGGSIALVVQ ELAKTRPSYA SVAFTITDNG MGMSPEFVER IFDPFERSEE
VSIARIQGTG LGMSITKALI DAMGGIVEVD SEKGRGSSFR VTLELRVVSS ADACPPLEVV
PESTSYRFEG KRFLLAEDNE LNAEILIELL GCRGAKVEWA ENGEKAIDAF SKHPAGYYDA
VFMDVMMPVM NGYEAARALR ACSSARSEEV KIVALTANAF AEDVKSALDA GMDAHVAKPV
DIDGLACVLG KVCG