Gene Elen_1172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1172 
Symbol 
ID8415463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1407447 
End bp1408565 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content66% 
IMG OID645024135 
Producttranscriptional regulator, XRE family 
Protein accessionYP_003181531 
Protein GI257790925 
COG category[K] Transcription 
COG ID[COG1396] Predicted transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.38155 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGACA TTAACTTGGG AGCCGCCATC GCGCGCGAGC GCCGTGCGGC GCAGGTGACG 
CAGGGCGAGC TGGCCGCGCA CCTGGGCGTG ACGAAGGCGG CCGTGTCGAA GTGGGAGCTG
GAGCAAAGCA TGCCCGACCT GGCTCTGCTG CCGCGCATCG CCGCCTACTT CGACCTCACG
CTGGACGAGC TGTTCGACTA TCGGCCGCAG CTGGTGGGCG ACAATCTGCA AGGCGCCTAC
CTGAGGCTGC TCGCGCAGTT CGACGAAGAT CCCGAAGCCG CTTTCGCGAA CGCCGAGGAC
CTCGTCCGTT CGCACTACTC CTGCTGGCCC GCGCTGCAGC AGATGGGCAT GCTCTACGCG
CAGCGCGCAA CCCTCGACCC CGACCGCGCC GAGCCCTTGG CCGCGCGCGC AGCCGAGCTG
TTCGAGCGCG TCGAGCGGCA TGCCGACGAC GTGGAGCTGG TGCGCGCCGC GCGGATGATG
CGCGCTTCCG TCATGAGCGT GCAGGGTGAC TTGGACGGAT GCATCGCCCT GTTCGAAAGC
CTCAAGCCCG ACAGGACGAC GGCGAACATC GATCTCATGC TGGCCAGCAT GTACCAGCAG
AGAGGCGACC TTGACGCCGG CTTGAAGCTG TTCCAAGAAT CGATGGGCTG GTGCGTGATG
AACGCCATAA GCTGCGTCTC GGCGCAGATC CCGCTGTACG CCGACGACGC CGAGCACCTG
GAGGCGCTCC TGCGGGCCGG CGAAGGCGTG CTTTCCGGGT TCGATTTGCA GAACCAGAGC
CCGATGACGG TGCTCACGTT CTGCACGAAC GCATCTTCCG CGTGTTTGCA GGCGGGCGAC
GAGGATCGAG CCGCGAGCTA TCTCGAGCGC TTTACGTCTT TGCTGGAGGA GCTTGACGCG
CGCATGCTGG TGTACGGCCG GAACCAGAGC GCGCTGTACG ATCGGGCGCC CGAGCTTTGG
AGCGTCGATC CCGGCCAGGA GCATATCGCG GAAACCCGGT TTGGCGCGAT CGACTTCAGG
CGGCAGTGCG CGCAGATGGT GGCAGCGCAG CCCGTCTGGG CCGAACGTGC CGGCGATTCG
CGTTTCAAGC CGCTGTTCGA TCGACTGGAG GCCCTATGA
 
Protein sequence
MDDINLGAAI ARERRAAQVT QGELAAHLGV TKAAVSKWEL EQSMPDLALL PRIAAYFDLT 
LDELFDYRPQ LVGDNLQGAY LRLLAQFDED PEAAFANAED LVRSHYSCWP ALQQMGMLYA
QRATLDPDRA EPLAARAAEL FERVERHADD VELVRAARMM RASVMSVQGD LDGCIALFES
LKPDRTTANI DLMLASMYQQ RGDLDAGLKL FQESMGWCVM NAISCVSAQI PLYADDAEHL
EALLRAGEGV LSGFDLQNQS PMTVLTFCTN ASSACLQAGD EDRAASYLER FTSLLEELDA
RMLVYGRNQS ALYDRAPELW SVDPGQEHIA ETRFGAIDFR RQCAQMVAAQ PVWAERAGDS
RFKPLFDRLE AL