Gene Elen_1744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1744 
Symbol 
ID8416043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2051599 
End bp2052798 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content64% 
IMG OID645024710 
Producthistidine kinase 
Protein accessionYP_003182098 
Protein GI257791492 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.376505 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0197179 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGAGCGT CCAATTTCAG CGAGCGGGAG CTGGCGGGCG CGTGCGAGAA GTACGCGCGT 
GCGGTGAACG ACATCAAAGA TCGGTACGGG TTCGATTTCG TGTCCATCGG GTTGACGGCG
TTCATCGGCG CGCCGCTCAA GTGGATATAC AGCGCAGGCG CTACCGGCGA GCGCCATCGC
CGCATCGTGC TGGCGCCGGG GCACGGCATC GGCGGCATCA CCATCAAGGC GGGCAAGCCC
ATGATGTTCA CGAACATCGA CGAGGAGATC GACCCGCGCG AGTACTCGTC GTACCCTATC
GTGTTCGCCG AGGATCTGCA TAGCTTCTGC GCGTTGCCGC TCACGCGCGA CGGGCGCGTG
GTCGCCGTGC TGCTGTGCGC GTTCCGCACG GTGAGCGATC GGCACGAAGC GGCGTACCGC
CAGGTCATCG ACGATCTGGA CGGCACCTTG TGCGATTTGG ACGTCGTTTC CGACGACTTC
ATGGATTTCG AAAGAATCGC GGTGGAGAAA CGCGCCGACG ATCAGAAGAA CCCTATCTTC
ATCCGTTCGG AGCTTGCGCG CGTCATCGCC GCTCAGGAGG ACGAGCGCAA GCGCATCTCG
CGCGAGCTGC ACGACGGCAT AGCGCAGGAG CTGCTGACGC TGTCGTTCGT GTTCAAGCGC
CTTGTCGCGT ATGTTGACGA AGAGGGCTAC GAGCTGCTGG CGGAAGCGAA CAACGATCTT
GCCAACGTGC TTGACGAGCT GCACAACCTG TCGGTGAAGC TGCGACCCTC GGCTCTCGAC
CATCTTGGTT TCGTTGCGGC TCTGCGCTCG CAGGCTGCCG TGTTCGAGCG CACGTACGGC
AACGAGATCG TGTTCGAGGG CAGCCTGTCG TGCGATCGCT TCGATCAGGC TCTCGAGACG
CAGGCGTACC GCATCTGCCA GGAGGCCATC CTCAACGCCT GCAAGTACTC GGGTTCCGAG
AAGGTGATCG TCCGGCTCGA GGATTCGGCC GGATGGCTGC ATGTGAGCGT GATCGACCAC
GGATGCGGCT TCGACACCGA GCAGCCGGAG ATCAAGGGGA GCGGCTGCGG TCTCGTAGGC
ATGCAGGAGC GCGCGAGCGT CATCGGCGCC CGGCTCGCGA TGGAATCCGA CGAGCATGGC
ACGAAGATGA CGCTGGTTGC GCCGATGCAC GTGGCGGAAG GCAAGGAGGC GGGCGCATGA
 
Protein sequence
MGASNFSERE LAGACEKYAR AVNDIKDRYG FDFVSIGLTA FIGAPLKWIY SAGATGERHR 
RIVLAPGHGI GGITIKAGKP MMFTNIDEEI DPREYSSYPI VFAEDLHSFC ALPLTRDGRV
VAVLLCAFRT VSDRHEAAYR QVIDDLDGTL CDLDVVSDDF MDFERIAVEK RADDQKNPIF
IRSELARVIA AQEDERKRIS RELHDGIAQE LLTLSFVFKR LVAYVDEEGY ELLAEANNDL
ANVLDELHNL SVKLRPSALD HLGFVAALRS QAAVFERTYG NEIVFEGSLS CDRFDQALET
QAYRICQEAI LNACKYSGSE KVIVRLEDSA GWLHVSVIDH GCGFDTEQPE IKGSGCGLVG
MQERASVIGA RLAMESDEHG TKMTLVAPMH VAEGKEAGA