Gene Elen_3072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_3072 
Symbol 
ID8417407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3571484 
End bp3573394 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content66% 
IMG OID645026052 
Productchaperone protein DnaK 
Protein accessionYP_003183404 
Protein GI257792798 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR02350] chaperone protein DnaK 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0257567 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGA TTCTTGGTAT CGACTTGGGC ACGACGAACT CCGCCATGGC CGTCATGGAG 
GGCTCCGAGC CCGAAATCCT CGTGAACGCC GAGGGCGACC GCACCACCCC GTCGGTCGAG
GGCTTCCGCA AGGACGGCGA GCGCGTCGTG GGCAAGGCCG CGAAGAACCA GGCCGTCACG
AACCCCGAGA ACACCGTGTC GTCCGTGAAG CGCTTCATCG GCCGTTCCTA CGACGAGACG
CCGGAAGAGC GCAAGACCGT CAGCTACAAG CTGCAGAAGG GCAAGGACGG CCGCGCGGTG
GTCGACATCG ACGGCAAGGA CTACACGCCG GAGGAAATCT CCGCCATGGT ACTGCAGAAG
CTGAAGAACG ACGCCGAGAA GCAGCTGGGC TCCCCGGTGA CGCAGGCCGT CATCACGGTG
CCCGCGTACT TCAACGACGC GCAGCGCCAG GCCACGAAGG ACGCCGGCAA GATCGCGGGC
CTCGAAGTGC TCCGCATCAT CAACGAGCCC ACGGCCGCCG CGCTGGCCTA CGGCCTCGAC
AAGACCAACA AGGATGAGAA GATCCTCGTC TTCGACCTGG GCGGCGGTAC GTTCGACGTG
TCCATCCTGG AGCTGGGCGA CGGCGTGTTC GAGGTTGCGT CCACCGCGGG CGACAACCAC
CTGGGCGGCG ACGACTGGGA CCAGCGCATC ATCGACTGGA TGGCTGACAA GTTCCAGGCC
GAGAACGGCA TCGACCTGCG CCAGGACAAG ATGGCTCTGC AGCGTTTGAA GGAAGCCGCC
GAGAAGGCGA AGATGGAGCT GTCCTCCACC ACGCAGGCCA ACATCAACCT GCCGTTCATC
ACGGCCGACG CTTCCGGCCC GAAGCACCTC GACTACACGC TGACGCGCGC CGAGTTCGAG
CGCATCACGA AGGATCTGCT CGACCGCGTG AAGAAGCCCG TTGAGCAGGC GCTCAAGGAT
GCCGGCCTCA AGACGGGCGA CATCGACGAG GTCATCCTCG TGGGCGGCTC CACCCGTATG
CCCGCCGTGC AGGACCTCGT GAAGAAGCTC ACCGGCAAGG ATCCGAACAT GTCCGTGAAC
CCGGACGAGG TCGTGGCCAT GGGTGCGGCG GTCCAGGGCG GCGTGCTGGC CGGCGACGTC
GAGGGCATCC TGCTGCTCGA CGTGACCCCG CTGTCGCTGG GCGTGGAGAC GATGGGCGGC
GTCATGACGA AGATGATCGA GCGCAACACC ACCATCCCCA CCCGCAAGAC CGAGATCTAC
TCCACCGCGT CCGACAACCA GACGTCGGTC GAGGTGCACG TGCTGCAGGG CGAGCGCCAG
ATGGCCTCCG ACAACAAGAC GCTGGGCAAG TTCCAGCTCA CCGGCATCCC GGCTGCGCGC
CGTGGTGTGC CGCAGATCGA GGTCACTTTC GACATCGACG CCAACGGCAT CGTGAACGTG
TCGGCGAAGG ACCTGGGCAC CGGCAAGCAG CAGCAGATCA CCATCTCCGG CTCCACCGCG
CTGAACGACG ACGAGGTCGA GCGCATGGTG AAGGACGCCG AGGCCCATGC CGAGGAAGAC
GCCAAGCGCA AGGAAGAGAT CGAGGTTCGC AACAACGCCG ACGCGTTGGT GAACGCCACC
GAGCAGACGC TCCAGGAAGT GGGCGACAAG GCTCCGGCCG ACGTGAAGTC CGCCGCTGAG
GAGGCCATCG CCGAGGCGAA GCAGGCGCTC GAGGGCTCCG ACATGGACGC CATCAAGGCC
GCGACCGAGA AGATGCAGCA GGCGGGCTAC AAGCTGGCCG AGGTCGTGTA CTCCACGCAG
GGCCCGGACG CCGCTTCGCA GGCCGCTGCC GCCGAGTCCA CCCCGGCCGA TGACACCATC
GAAGCCGACT ACGAGGTCGT CGAGGACGAC GACAAGAAGG AAGGGAAGTA A
 
Protein sequence
MSKILGIDLG TTNSAMAVME GSEPEILVNA EGDRTTPSVE GFRKDGERVV GKAAKNQAVT 
NPENTVSSVK RFIGRSYDET PEERKTVSYK LQKGKDGRAV VDIDGKDYTP EEISAMVLQK
LKNDAEKQLG SPVTQAVITV PAYFNDAQRQ ATKDAGKIAG LEVLRIINEP TAAALAYGLD
KTNKDEKILV FDLGGGTFDV SILELGDGVF EVASTAGDNH LGGDDWDQRI IDWMADKFQA
ENGIDLRQDK MALQRLKEAA EKAKMELSST TQANINLPFI TADASGPKHL DYTLTRAEFE
RITKDLLDRV KKPVEQALKD AGLKTGDIDE VILVGGSTRM PAVQDLVKKL TGKDPNMSVN
PDEVVAMGAA VQGGVLAGDV EGILLLDVTP LSLGVETMGG VMTKMIERNT TIPTRKTEIY
STASDNQTSV EVHVLQGERQ MASDNKTLGK FQLTGIPAAR RGVPQIEVTF DIDANGIVNV
SAKDLGTGKQ QQITISGSTA LNDDEVERMV KDAEAHAEED AKRKEEIEVR NNADALVNAT
EQTLQEVGDK APADVKSAAE EAIAEAKQAL EGSDMDAIKA ATEKMQQAGY KLAEVVYSTQ
GPDAASQAAA AESTPADDTI EADYEVVEDD DKKEGK