Gene Elen_0304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0304 
Symbol 
ID8414588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp399426 
End bp400436 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content70% 
IMG OID645023281 
Producttranscriptional regulator, AraC family 
Protein accessionYP_003180684 
Protein GI257790078 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATTCGCG CAGACGGAAA TCGGGACGGC ACGGCTGCCG CATGCGCCGA CTACGACATC 
ATGCGCGAGG TGTTCCTCCC CTGCCTCGCC GACATGCAGA TGGAGGAAAC GTGCGCGGCG
GACGTGGGCG CGCCCGAACG CGAGGGGCGC ATGTTCCGCA TGCCGCGCGA CGTGGCCTCG
GGATACTTCT GGCTGTACGC CGAGCGCGAT TTCGCCATCT CGGCGACCGA CATGCTGTTC
CCCCGCGACT TCCGCGAGCA ATGCCGGCAT CCGCGCTTCG TCTCGGTGCG CTACTACCTG
TCGGGAAGCT GCGTCGAGAG CGTCACGAAC CGCACGGTGG AGGCACCTTA CCTGGAGGGG
CACGTTCTGG ACACGCCGCA TTGGGACTGC CTGTGCCGGG CCGGCACGCC CATCCGCAAC
GTCGAGATCA TGCTCGCCCC GCCGTTTTAC GAGCAGTATT TGCGCGAGGT GTACCGGGAT
GAGGCCTTCA GCGCCGAGGA GGCGTTCGCC AGCATCGACG GGCTTTCCGA CTTCCCCGAG
ATGGTGGTGC TGCTCAAGCA GGTGGAAGCC TACCGCGGAC GCGGCGCGTC GGCGCGGCTG
TTCTACCGCA GCAAGGTTGA GGAAGCGGTG GCGCTCGTCG TGGACAAGTC GCGCGCGATG
GCCGGCGAGC GCGCCAGCGA ACTGGCCAGC GAGGACATGC ACGCCATCGA GCGCGTGCGG
CGGCGCCTGG AGCAGCAGCT GGCCGCGCCC GTGGACGCCG ACGAGCTGGC CCGCATCGCC
TGCATGGGCC AGACCAAGCT GCGGCGCACC TTCAAGCAGG CGTGCGGCTG CACCATCGTG
GAGTACCGCC AGCGCTTGCG CTGCGCGAAG GCCGCCGAGC TGCTGGCCGC CGGCGACGCG
CCCGTGGCGC AGGTTGCCGC AGCCGTCGGC TACCGCCCCG AGCGCCTGGC CGAGCTGTTC
GCCCGCACCC ACCACACCAC CCCCAGCGCC TACCGTGCCG CCATGCGCTA G
 
Protein sequence
MIRADGNRDG TAAACADYDI MREVFLPCLA DMQMEETCAA DVGAPEREGR MFRMPRDVAS 
GYFWLYAERD FAISATDMLF PRDFREQCRH PRFVSVRYYL SGSCVESVTN RTVEAPYLEG
HVLDTPHWDC LCRAGTPIRN VEIMLAPPFY EQYLREVYRD EAFSAEEAFA SIDGLSDFPE
MVVLLKQVEA YRGRGASARL FYRSKVEEAV ALVVDKSRAM AGERASELAS EDMHAIERVR
RRLEQQLAAP VDADELARIA CMGQTKLRRT FKQACGCTIV EYRQRLRCAK AAELLAAGDA
PVAQVAAAVG YRPERLAELF ARTHHTTPSA YRAAMR