Gene Elen_3055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_3055 
Symbol 
ID8417390 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3552145 
End bp3553833 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content64% 
IMG OID645026035 
Producthistidine kinase 
Protein accessionYP_003183387 
Protein GI257792781 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGAAGC ATGCTTTCAG TCTGAAGACC ATCTATCTTG CCGCTCTGAC GGCGGTGTTC 
GCCGTGTCGT TTTTCGCGTT CGTCGCGTTC GATTTGTACT CGCAGCAGCG GCAGACCGAG
CAGGCCATGC TGGAGGAAGC GCGCACGTTC GCGCGCGAGA TGGACGCGGT GTGGCAGTTC
ATGGACAACT CCCAAAGCAT CATCAACAAT TCGTCCAGCG GCGGCTACGA GTTCAAGGGC
CTGCACTGCT CGGTGGTGGG CAAGAGCGTC GGCCGGCTGT TCTCGGCGGG CAGCGACTAC
CACATCCGGT ACACGAACTT CGACCCGCGC AGCGAGCAGG ATATTCCCGA CGAGTTCGAG
ACGAAGGCGC TCGAGGCGTT CAACGCCGAT CGTTCGGTGA CCGAGTACTA CGGCGTGGCC
CCGTTCGACG GCGAGGATCG GTTCCGCTAC CTGCAAGCGC TCGAAGTGGA CGACAGCTGC
CTCGAATGCC ACGGCGAGCC GGTCGGCGAG CTCGACATCA CCGACCACGC GAAGGAAGGT
TGGACGCTCG AGTCGGTCGG CGGGGCAATC AGCATCGTGA TCCCCCTCGA TCAGCAGCAG
GCGGCCATGC GCGGCAACGT CATCCGAGAC ATGGCGTACT TCCTGTTGAT CACCGTGTTC
ATCGGCTTGG TCATCTTGGT GGTGACCACT GTGTTCGTGC TGCGGCCGTT GGGCGGCATG
CACGCGGCGT TCGGCGAGCT GAAACAGGGA CGCTTGGGCG CGTCCGTCAG CCAGCGCTTC
GCCGCGAAGG AAGTGAGGAG CCTCATCGCC GGCTTCAACG ACATGGCGGG CGAGCTGCGG
GGCATGTACG AGCATCTGGA ATCGCAGGTG CAGGAGCGTA CGGTGGACCT GCGCGAGGCG
AACGCCCTGC TGGAACGCCA GCGCGACAAG CTGGAGCAGC TTAACGCCGA CCTGGCGCAG
GAGACGCAGT TCAAGTCCGA CCTGCTAAGC ATGGTGAACC ACGAGCTGCG CACGCCGCTG
ACGTCCATCA TCACGTTCGC GCAGATATCG CGCGAGGCGT GCGACCCGGC CAACGAGCAC
GACCGTCGCT CGTGGGAGGA GATCGAGAAG AACAGTCGCA TCCTGCTCAA CATGATCAAC
AACATGCTGG ACATCGCGCG TTCGGATGCG GGCGGCATGC GCGCCACCTG CGAGCCGATG
GATTTGGGCG ACGTGGCGGC ATCGGTGAAG GGCACCATGG CTCCGCTGGC GCGCAAGTAC
GAGGTGTCGT TCAGCACGAA GGTGGCGTCG GACGTGCCCT TGGTCAACGG CGACTACGAG
AAGACGACGC GCATGCTGGA GAACCTGGCC AGCAACGCCA TCAAGTTCAC GCCCGACGGC
GGCTCCATCG AGCTGCGCGT GGCGTACGAC GCCGAGGCGC GCGTGGTGAC GTTGTCGATG
GTGGACGACG GCATCGGCAT CGCGCCCGAG GACCAGGCGC GCATCTTCGA GCGGTTCGTG
CAGGTGGACA GCACGTCCAC GCGTAAGTAC AACGGCAGCG GCCTCGGTTT GGCACTGGTG
CGCGAATACG GCGACATGCA AGGGTTCGCC GTGTCGGTGG AAAGCGAGCT CGGTCGCGGC
AGCAGGTTCG TCATCACGAT TCCCGCGAGC GCGATCGTGG GCGAGATAGA GGGGGAGGAC
GATGTATAA
 
Protein sequence
MGKHAFSLKT IYLAALTAVF AVSFFAFVAF DLYSQQRQTE QAMLEEARTF AREMDAVWQF 
MDNSQSIINN SSSGGYEFKG LHCSVVGKSV GRLFSAGSDY HIRYTNFDPR SEQDIPDEFE
TKALEAFNAD RSVTEYYGVA PFDGEDRFRY LQALEVDDSC LECHGEPVGE LDITDHAKEG
WTLESVGGAI SIVIPLDQQQ AAMRGNVIRD MAYFLLITVF IGLVILVVTT VFVLRPLGGM
HAAFGELKQG RLGASVSQRF AAKEVRSLIA GFNDMAGELR GMYEHLESQV QERTVDLREA
NALLERQRDK LEQLNADLAQ ETQFKSDLLS MVNHELRTPL TSIITFAQIS REACDPANEH
DRRSWEEIEK NSRILLNMIN NMLDIARSDA GGMRATCEPM DLGDVAASVK GTMAPLARKY
EVSFSTKVAS DVPLVNGDYE KTTRMLENLA SNAIKFTPDG GSIELRVAYD AEARVVTLSM
VDDGIGIAPE DQARIFERFV QVDSTSTRKY NGSGLGLALV REYGDMQGFA VSVESELGRG
SRFVITIPAS AIVGEIEGED DV