Gene Elen_3010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_3010 
Symbol 
ID8417344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3494757 
End bp3496370 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content66% 
IMG OID645025989 
Producthistidine kinase 
Protein accessionYP_003183342 
Protein GI257792736 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.335446 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGCATG GGGTACCGAT GGCGCGCGTA AGGGGCGCCG AACGCCTCGA CGCCTTGCGG 
GCGCGTTGGC ACCACGCGTC GCTCAAAACG TCGTTCATGG TGTACATGCT GGGGTTCTTG
CTGCTGGCGC TGGTGATGTC CACGGTGACG GCCGGGATGT TCAGCGGGTT GCAGCACCGG
GTGATCGCAG ACGCGTACGA AATCTCGGGG CTCTACCTCT ACGATGCGCA CAACCGGACG
CTCGTTCCCG CGCGCGCCGT GGATATCGAC GAGAACGGCA GCAGCGTGTT CGTTCAAACC
GTGCGCGGCG AGATGACCGA GATGCCGCTG ACAGACTTGT CGTCGCTCGT GGAGATCACT
GACGCGAGCG ATTACCAGTA CTCGCCGGGG ACGGCGTATC TGTATGGTTC CGCCAACGAC
GACGCACTCA GCGTCGAAAC GCCCGAAGCG CCCGACCAAA CCGAGCTCAC GCCCGCCGAG
CTGCCCGCCT ATGACGCGAA AGCGCGCGAT CGGTTCGATG CCTGGCTGGC TGCCAACCCT
GATAGCCCCT ACACGTCGTT CTTCGACGGA GACAGCCAGG ACGGCAACAC CGTCGGACTG
CTCACTTCCG CAGTGGGTTA CTACGTGAAC ACGCCCCCAT CGGAGGAGGC GCAGGCGCTG
TCCTCGGCAC TCGAGCTGCT AACGTTCCTC ATGTTCCCAC TGTGGTTCGG CCTATGCATC
TTCGCAGCGG CGCGGCGCTT CTTCGGCAAG CGCCTCCAAC CTGGATTCGA TGTGCTCGAC
CGAGCAGCGT CCAACATCGC CGAGCAGAAC CTCGAGTTCT CCGTGTCGTA CGATCGAGAT
GACGAGCTGG GCCATCTAGC TTCGTCGTTC GAAGCCATGC GCGCCTCGTT GGCCGAATCG
CAGCGCGCGC TGTGGCGCAC GGCCGAGGAG CGCAAGCGCC TGAACGCCGC GTTCGCGCAC
GACCTGCGCA CGCCGCTCAC CATCCTCAAG GGCAAGATCG AGCTGCTGGA CGCGCACGTG
CAGTCCGGCA ACGTGCCCGC CGATCGCCTG ACGGCCTCCG TCGCCTCGCT GGCCGCGCAG
GTGGAACGCC TTGAGCGCTA CGTCGCCGCT ATGAGCGGGC TGCAAAAGCT GGAAGACCGC
GCCGTCGTTG CGCGCCCGAC CAGGTTCGAC AGCGTGGCGG GCATGGTGGA GGACGCCGGA
ACGGGGCTTG CTGCGCACGC CGACCGGACC TTCGCGCTGT CGGTGAGCGC ACGTTGCGAC
CGCGAGCGGC CAGAGCTATG CGTGGATCAG GCCATCGTGG GCGAGGTGGC CGAGAACCTG
CTGAACAACG CCATGCGCTA TGCGTCGTCC CAAGTGGACG CTCGCCTCGA CGTGCGTGAC
GGCGCGCTGG TTCTCATGGT GGAAGACGAC GGGCCGGGCT TTTCGAACGC CGCGCTCGAA
CGCGGTTGCG CGCCCTTCTT CAGCGAGGTG CCCTCGGCAG AGCATTTCGG CCTGGGGCTG
AACATCGCGT CGCTCATGTG CGAGAAGCAC GGCGGCGGCG TAACGCTGGA GAACCGCGAG
GGGGGAGGCG CGCGCGTGGT CGCGCGATTC TCCCTGGATT TCTGCGCCGA GTAG
 
Protein sequence
MGHGVPMARV RGAERLDALR ARWHHASLKT SFMVYMLGFL LLALVMSTVT AGMFSGLQHR 
VIADAYEISG LYLYDAHNRT LVPARAVDID ENGSSVFVQT VRGEMTEMPL TDLSSLVEIT
DASDYQYSPG TAYLYGSAND DALSVETPEA PDQTELTPAE LPAYDAKARD RFDAWLAANP
DSPYTSFFDG DSQDGNTVGL LTSAVGYYVN TPPSEEAQAL SSALELLTFL MFPLWFGLCI
FAAARRFFGK RLQPGFDVLD RAASNIAEQN LEFSVSYDRD DELGHLASSF EAMRASLAES
QRALWRTAEE RKRLNAAFAH DLRTPLTILK GKIELLDAHV QSGNVPADRL TASVASLAAQ
VERLERYVAA MSGLQKLEDR AVVARPTRFD SVAGMVEDAG TGLAAHADRT FALSVSARCD
RERPELCVDQ AIVGEVAENL LNNAMRYASS QVDARLDVRD GALVLMVEDD GPGFSNAALE
RGCAPFFSEV PSAEHFGLGL NIASLMCEKH GGGVTLENRE GGGARVVARF SLDFCAE