Gene Elen_0001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0001 
Symbol 
ID8417460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp67 
End bp1182 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content60% 
IMG OID645022973 
Producttranscriptional regulator, TrmB 
Protein accessionYP_003180385 
Protein GI257789779 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000656108 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000126577 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGGACAACC AGATTGAAAA AGCAGATCAT CAGGATGAAA CCGCTGATGA AAACGTGGAG 
CCTTTCGACT ACACGCATGT ACACGCCGTA GCGCGCATCG CGCTGTATGA CGATCTTCGC
AGCGCCCCGC GCGTGACGGA GATACACCCG GCGCCGACGG CTGAATTCAT TGAAAGCCTT
GCTTCGAAAA TTTATGAACA GGCTAAAAAC GCCGGAGGGA CCATTCCGTA CACCGTCATC
CGCGAAGTAT CGGAAAACTT CATCCACGCG CGTTTCGCCG AGGCTACCGT GTCCATCCTG
GACGAGGGCA ATACCATCCG TTTCGCCGAC CAGGGCCCGG GCATACCTTA TAAGGATCAA
GCGCAGATCC CCGGGTTCAC ATCCGCCGTG GAGCCGATGA AACACTATAT TCGCGGCGTG
GGGTCGGGCT TGCCCATCGT GAAAGAGTAC CTCGATTTCT CGCACGGCAC CATCACCATT
GAGGACAACC TGGGAACAGG CGCCGTAGTG ACCATCAGCC TACGTGCGGG CGAGGCGACC
GATATGCCGC CCGTCGACCA ATCGAGCGCG CTTCACCCTG CATCCGCGCA ACCATCGGCC
ATGGAACCCG CTTACCCGAT GCACGAGGCG CCGCAGCAGC TGCAGCAGCA GATTCCTCCC
CAGCAGCCGA TGCAGCAACC CGCCTACCCG GCGCAATACG GATACGCAAA CCCACCGTAT
CCGCAGGAAG CCGCGCCCGC ACGACCGCCT TACGGTTACG AGCCCGAACC GCAGTACGCG
CCCCCGCGCT ATGCGCAGAA TCCCTACGCG GCAGGCGCGC CCTACTATCC CCAGCACGGT
GCCCCGGCTC ATCGCGCTCA GGGCATGGAC ATGCAAGCCC AGCATGCGAT GGCGCCGCTT
ATCCCCCCGT TGTCGCAACG CGAGCGCGAC TTCCTGCCCA TCTTCCTGAG CGAAGGAGCC
CTGGGAGTAA CGGACCTGTC GCGTTTGACC GGCGTGCCGC AATCAAGCAC GTACGTGGCG
CTGTCGAAAT TAGAGGAAGC CGGGCTTATC GAGAAAACGG TCGGACAGAA GCGCATTCTG
ACCGATCTGG GATACCACGT GGCAAATTCC CTATAA
 
Protein sequence
MDNQIEKADH QDETADENVE PFDYTHVHAV ARIALYDDLR SAPRVTEIHP APTAEFIESL 
ASKIYEQAKN AGGTIPYTVI REVSENFIHA RFAEATVSIL DEGNTIRFAD QGPGIPYKDQ
AQIPGFTSAV EPMKHYIRGV GSGLPIVKEY LDFSHGTITI EDNLGTGAVV TISLRAGEAT
DMPPVDQSSA LHPASAQPSA MEPAYPMHEA PQQLQQQIPP QQPMQQPAYP AQYGYANPPY
PQEAAPARPP YGYEPEPQYA PPRYAQNPYA AGAPYYPQHG APAHRAQGMD MQAQHAMAPL
IPPLSQRERD FLPIFLSEGA LGVTDLSRLT GVPQSSTYVA LSKLEEAGLI EKTVGQKRIL
TDLGYHVANS L