Gene Elen_3044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_3044 
Symbol 
ID8417379 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3539991 
End bp3541307 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content60% 
IMG OID645026024 
Productsignal transduction histidine kinase, LytS 
Protein accessionYP_003183376 
Protein GI257792770 
COG category[T] Signal transduction mechanisms 
COG ID[COG3275] Putative regulator of cell autolysis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGAAG AGACCGGGGA GTCGAAATCG GCGCTGCCGC GCTTCTTCAC GCTCGAGATG 
TTCATGTTCA CGGTGACGGC GCTTTCGGGT CTTGTGTTGC TGTGGTCCAT CGCGGTGCCG
TACCGCAACG TGGCAATCAT GGTATGCGCA GGCGTGGTGT TCACGCTGTC CATAGTGGTG
GTCATCCGTC TCATCATGGA TCCCGATTCG GTGCGCGCTC GTCAGTCTGA CTCCATGTTG
AAGCTGGCTA GCCAAACGCT GACCTGCATG AACGACGGCA TGGATTACAA GGCCGCGCAG
AAGATCTGCG GGTTGCTGCT GCCGTCCACC GCGGCTATCG CCGTGGCTAT CACCGACAAA
AAGCAGATTT TGGGATACGC AGGCTTCGAG GAAGCTCAGA ACCTGCCGGG CAGCATCATC
CGCACCCACG CGACCCATGC CACGCTCGCC GACGGCAAGC TGCGCATCCT GTTCACGCCC
GAAGATATCG GCTTCCCCAG CGAGTCGTCG AATATCAAGG CAGCCATCAT CGTGCCGCTT
GCCATAGGCC GCAACGTCGA GGGCACGCTC AAGTTCTACT ATCGCCGAGC GAAGCATATC
AGCGAGACGC AGAAGTCCAT CGCCGAAGGG TTCGGCAAAC TGCTGTCCAC GCAGATGGCG
GCATCGGCGC TGGAAGAGCA GACGCAGTTG GCCACGCGCA TGGAGCTGAA GATGCTCCAA
AGCCAGATCA ACCCGCACTT CTTGTTCAAT ACCATCAACA CCATCGCCTC ACTCATTCGC
ACCGACCCCG AAACGGCGCG CAAGCTGCTG CGCGAATTCG CCGTGTTCTA TCGCCGAACG
CTTGAAGACT CCGCCGATCT GATCGTGTTC GCGCGTGAGA TGGAGCAAAC GAAGCGGTAC
TTCACGTTCG AAGTGGCGCG TTTCGGTGCC GACCGCGTGG AGATGGAGAT GCGCATCGAT
CCTCGTGTGG AAGACATGCT GGTGCCGCCT TTTTTGCTGC AACCGCTCGT GGAGAACGCT
GTACGCCACG CCATGCCGAG CGAGGGGAAG TTGACTATCG AGGTGACGGG CGAGGTCACG
GGCAACGACG TGATTGTGCG CGTGTGCGAC AACGGCGTGG GTATGACCGA AGAGGCGCGC
TGCAACATTC TTCATCCCGA ATCGTCGCTC GGCCTCGGCA TCGCGGTGAA GAACGTGCAC
GATCGAATCT GCGGCTACTT CGGTCCCGGT ACGCATATGG AAGTGGAAAG CGAGCTCGGC
AAGGGAACCT GCGTGATCCT CGTATTGAAG GAAGGGGCTC TGCGCGAGTA CCAGTAG
 
Protein sequence
MQEETGESKS ALPRFFTLEM FMFTVTALSG LVLLWSIAVP YRNVAIMVCA GVVFTLSIVV 
VIRLIMDPDS VRARQSDSML KLASQTLTCM NDGMDYKAAQ KICGLLLPST AAIAVAITDK
KQILGYAGFE EAQNLPGSII RTHATHATLA DGKLRILFTP EDIGFPSESS NIKAAIIVPL
AIGRNVEGTL KFYYRRAKHI SETQKSIAEG FGKLLSTQMA ASALEEQTQL ATRMELKMLQ
SQINPHFLFN TINTIASLIR TDPETARKLL REFAVFYRRT LEDSADLIVF AREMEQTKRY
FTFEVARFGA DRVEMEMRID PRVEDMLVPP FLLQPLVENA VRHAMPSEGK LTIEVTGEVT
GNDVIVRVCD NGVGMTEEAR CNILHPESSL GLGIAVKNVH DRICGYFGPG THMEVESELG
KGTCVILVLK EGALREYQ