Gene Elen_1940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1940 
Symbol 
ID8416247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2274557 
End bp2275993 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content68% 
IMG OID645024913 
Producthistidine kinase 
Protein accessionYP_003182293 
Protein GI257791687 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.208818 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0947013 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCGCG CCCCTCGCAC ATCTGCTCGA GACCGTCGCA CAAGGCGACC GGCGGGCTTC 
GCGCGCTTCT TCACCAAGCA GCTGCTGCTG TTCGTGGCGC TGGCGCTGCT CATCGTGGTC
ATCGACTTCT TCCTGTACGC CGTCATCGCC TACCGGGAAT CGAACTCGAA CTTCAACGAC
GGCACGCCCG CGTCCACGAC GCGCGCAGTC GACCAAGCTC TCGAGCAGGA TGCCGACGGC
TCATGGACGC TTGGGGAGAA CGGCCTGGAG GCGCTCGGCC AGCAGGACGC CTGGGCGCTC
GTCATCGGGA CGGACGGCGC GGTGGCCTGG TCGCAGGACA AGCCGGAGGA CGTGCCCGAC
CGGTTCAGCG TGAACGACGT GGCGATGGCA GCGCATTATG CGGCCGTCGC CGACTATCCG
GCGTTTTTCT GGGATCGGGA CGACGGCCTG CTGGTCGTGG GTTTCCCGAA GAACGAGTTC
TGGACGATGA CGCTCACCTA CCCCGCGTCG ACCGTGCGCA ACTTCCCGCT GTACGTGCTG
CTGATATTCG CAGTCGACCT CGGGATCTTG TACACCATCT ACGCGGTGTC GCGTCGCCGG
ACGCAAAACG CCGTGGCTCC CATCGCCGAA GCGCTCGACG CGCTGTCTGA CGGGCGCGCG
GCCGAGCTGC ACCTCAAAGG GGATCTGCGC GACATCGGCG ACCAGATCAC CGAGACGAGC
GCCGTCATCG AGCAGAAGGA CGCCGCGCGC GCAAGCTGGA TTCGCGGCAT CTCCCACGAC
ATCCGCACGC CGCTGTCGAT GATTCTCGGC TACGCCGACG CGCTCGTGCA GGACGAGGGC
GCAGCCGAAG AGGCGCGCGC AAGCGCGCGG GTCATCAGAG CGCAGGGGCT CAAGATCAAG
GACCTCGTCA CCGATCTCAA CACCGCCTCG CAGCTGGACT ACGACATGCA GCCGATGCGC
CTGGAACGCG TGCATGCGGC GCGCCTGCTG CGCACCGTGG CGGCCGCGCA TGCGAACAGC
GGGCTGGACG AAGCGCATCC TATCGAGCTC GACATCGCAG AGGACGCGCT GAACGCGGTG
GTGCTGGGGG ACGAGAGGCT GCTCACGCGC GCGGTGGAGA ACGCCATCTC CAACGCGCGG
CTGCACAACG AGCAGGGATG CACGATCAGC GTCGAACTGG CGCTGCGAGA CAACGCGTAC
TGCACGATCC GCGTGAGCGA CGACGGTGCT GGAATCGCGG CTGCCGACCT CGCCGCGCTC
GAAGCGCGCC TCGCGCGCTC GCGCACGGCG CGCAGCGCGG CTGGGTCGTT CAACCGCGAT
CACGGCCTGG GGCTCGTTCT GGTCGACCGC ATCGCCCGCG CGCACGAGGG ATCGCTCTCC
CTCGACGGCG CGCCGGGCGA AGGCTTCTCC GTGACCCTCG CCCTCCCGCT GGCGTAG
 
Protein sequence
MGRAPRTSAR DRRTRRPAGF ARFFTKQLLL FVALALLIVV IDFFLYAVIA YRESNSNFND 
GTPASTTRAV DQALEQDADG SWTLGENGLE ALGQQDAWAL VIGTDGAVAW SQDKPEDVPD
RFSVNDVAMA AHYAAVADYP AFFWDRDDGL LVVGFPKNEF WTMTLTYPAS TVRNFPLYVL
LIFAVDLGIL YTIYAVSRRR TQNAVAPIAE ALDALSDGRA AELHLKGDLR DIGDQITETS
AVIEQKDAAR ASWIRGISHD IRTPLSMILG YADALVQDEG AAEEARASAR VIRAQGLKIK
DLVTDLNTAS QLDYDMQPMR LERVHAARLL RTVAAAHANS GLDEAHPIEL DIAEDALNAV
VLGDERLLTR AVENAISNAR LHNEQGCTIS VELALRDNAY CTIRVSDDGA GIAAADLAAL
EARLARSRTA RSAAGSFNRD HGLGLVLVDR IARAHEGSLS LDGAPGEGFS VTLALPLA