Gene Elen_3094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_3094 
Symbol 
ID8417430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3598485 
End bp3599945 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content61% 
IMG OID645026074 
Productregulatory protein GntR HTH 
Protein accessionYP_003183425 
Protein GI257792819 
COG category[K] Transcription 
COG ID[COG2186] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.0435211 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGAAAAGA AAGCCACCTT ATTCGAGTAC GTGTATCGTC AGCTTGTCAG CGACATCGAG 
CAGGGAGCGC TGCGCTATGG CGACGCGTTG CCGTCGCTGC ACGACTTGTG CGATCGATTT
CGGGTGGGCA TCAGAACCAT CCGCGACGTG CAGCGTGCTT TGAAGGCCGA CGGGTACATT
GTCGTGGAGG AGCGAAAGCG CGCTGTCGTG GCGTACCGTC CGGCCGATGA CGCGGATGAC
GGTCGCATCC GTGCGCTGCT GGCGCGTCGC GAGGTGGTTG CCGATTGCTA CAGAACCCTC
GAACTGGTCA TGCCCCCGTT GTTCCACTTA GCCGCTCGAT GCTGCTCTGA CGAGGATCTT
TTCGCTTTGG CTCAGGATGC GAAGCGCGTT GATCGCTCTG GCGTATCGGA AGGGTGGCGC
ATATCGCACG CATCGATTGT TTTGCACGGC CTGGTGGCAA AGGCAGGCAA CCCGTTGTTC
ACGTCGTGCT TCGCAAGCTT GGAGCGCATC GGTCTGGTGC CGGTGGTTCC GGGTTTCGAA
AGCCCTTTTG CCTCGCGTGC GGTCGACGTC GACGGGGGTT TGATGACGTG GATGTTCTCC
TCTCTGCTTT TGCGCGATGC TGATGAGGTG CAGTATCGGT TCGGTTGCAT GTATCGCGGC
GTCGCGCAGC GCGTCGGGGC GTATTTCGAC GCGCTCGAGA ACGCATATGC CCCCGCCGTG
GACATCGGAT CGTTCGGGTA CGCGTGGAAT GCGAAGGCCG GGTTGGAATT CGTGCACGGA
CAAATAGCTC GCAATCTTGT GGAACGCATC GTTCGGGGGG AATTCGTTGA CGGCCAGCTG
CTTCCTTCCA TCGCCGAGCT GTCGGCGCAG TATGGAGTCT CCGCTTCCAC GGTGCAGAAA
GCGTACGGCG CGCTCAACGT CATAGGCGTC GCGCGCACCG TCAACGGGTT GGGCACGCGC
GTGCAGTTGG GCAATGCGAC GTTCAGCGAG CGCTGGCTGG AAGATCACTC GTTCAAGAGA
GATGTGGGCA CGTACGTCCA TGCGGCGCAG ATGATGTGCG CCGTGCTGCC TGCTGCATTG
AGCCGCGTGC AAGGTCATGC GGAGGAGGTG GCCTCTTCTG CGGAACGCGC CCTCGCGGTC
GAGGAGGGGG ATTGGGCGGT GTCGAAGGCT CTCATAACGG GTTTGATCGA ATGCACGGAG
CCTTGCGCAT TGCAGACGAT TCTGCGGGAG CTGAACGATT TGCTGCGCTG GGGAGCCTTC
CTTACGCTGT TCGCGGCTTC GCGAGAAAGC GCGTCCGCTC TCTCGTCTCT CGAAAGCATC
GCTCTCGGGC AGGCGCGCAG AGCCGACGAC GAGGGGTTCT CCCAATCGAT GACCGCGTAT
TACCGCCTTA TGCTCCAGTC CGTGCTCGCG TTTCTCGAGA GGGCGGGTAT GATTGACGTC
GACCGCACGA TCGTGCCCTA G
 
Protein sequence
MEKKATLFEY VYRQLVSDIE QGALRYGDAL PSLHDLCDRF RVGIRTIRDV QRALKADGYI 
VVEERKRAVV AYRPADDADD GRIRALLARR EVVADCYRTL ELVMPPLFHL AARCCSDEDL
FALAQDAKRV DRSGVSEGWR ISHASIVLHG LVAKAGNPLF TSCFASLERI GLVPVVPGFE
SPFASRAVDV DGGLMTWMFS SLLLRDADEV QYRFGCMYRG VAQRVGAYFD ALENAYAPAV
DIGSFGYAWN AKAGLEFVHG QIARNLVERI VRGEFVDGQL LPSIAELSAQ YGVSASTVQK
AYGALNVIGV ARTVNGLGTR VQLGNATFSE RWLEDHSFKR DVGTYVHAAQ MMCAVLPAAL
SRVQGHAEEV ASSAERALAV EEGDWAVSKA LITGLIECTE PCALQTILRE LNDLLRWGAF
LTLFAASRES ASALSSLESI ALGQARRADD EGFSQSMTAY YRLMLQSVLA FLERAGMIDV
DRTIVP