Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_3072 |
Symbol | |
ID | 8417407 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 3571484 |
End bp | 3573394 |
Gene Length | 1911 bp |
Protein Length | 636 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645026052 |
Product | chaperone protein DnaK |
Protein accession | YP_003183404 |
Protein GI | 257792798 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0443] Molecular chaperone |
TIGRFAM ID | [TIGR02350] chaperone protein DnaK |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0257567 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAAGA TTCTTGGTAT CGACTTGGGC ACGACGAACT CCGCCATGGC CGTCATGGAG GGCTCCGAGC CCGAAATCCT CGTGAACGCC GAGGGCGACC GCACCACCCC GTCGGTCGAG GGCTTCCGCA AGGACGGCGA GCGCGTCGTG GGCAAGGCCG CGAAGAACCA GGCCGTCACG AACCCCGAGA ACACCGTGTC GTCCGTGAAG CGCTTCATCG GCCGTTCCTA CGACGAGACG CCGGAAGAGC GCAAGACCGT CAGCTACAAG CTGCAGAAGG GCAAGGACGG CCGCGCGGTG GTCGACATCG ACGGCAAGGA CTACACGCCG GAGGAAATCT CCGCCATGGT ACTGCAGAAG CTGAAGAACG ACGCCGAGAA GCAGCTGGGC TCCCCGGTGA CGCAGGCCGT CATCACGGTG CCCGCGTACT TCAACGACGC GCAGCGCCAG GCCACGAAGG ACGCCGGCAA GATCGCGGGC CTCGAAGTGC TCCGCATCAT CAACGAGCCC ACGGCCGCCG CGCTGGCCTA CGGCCTCGAC AAGACCAACA AGGATGAGAA GATCCTCGTC TTCGACCTGG GCGGCGGTAC GTTCGACGTG TCCATCCTGG AGCTGGGCGA CGGCGTGTTC GAGGTTGCGT CCACCGCGGG CGACAACCAC CTGGGCGGCG ACGACTGGGA CCAGCGCATC ATCGACTGGA TGGCTGACAA GTTCCAGGCC GAGAACGGCA TCGACCTGCG CCAGGACAAG ATGGCTCTGC AGCGTTTGAA GGAAGCCGCC GAGAAGGCGA AGATGGAGCT GTCCTCCACC ACGCAGGCCA ACATCAACCT GCCGTTCATC ACGGCCGACG CTTCCGGCCC GAAGCACCTC GACTACACGC TGACGCGCGC CGAGTTCGAG CGCATCACGA AGGATCTGCT CGACCGCGTG AAGAAGCCCG TTGAGCAGGC GCTCAAGGAT GCCGGCCTCA AGACGGGCGA CATCGACGAG GTCATCCTCG TGGGCGGCTC CACCCGTATG CCCGCCGTGC AGGACCTCGT GAAGAAGCTC ACCGGCAAGG ATCCGAACAT GTCCGTGAAC CCGGACGAGG TCGTGGCCAT GGGTGCGGCG GTCCAGGGCG GCGTGCTGGC CGGCGACGTC GAGGGCATCC TGCTGCTCGA CGTGACCCCG CTGTCGCTGG GCGTGGAGAC GATGGGCGGC GTCATGACGA AGATGATCGA GCGCAACACC ACCATCCCCA CCCGCAAGAC CGAGATCTAC TCCACCGCGT CCGACAACCA GACGTCGGTC GAGGTGCACG TGCTGCAGGG CGAGCGCCAG ATGGCCTCCG ACAACAAGAC GCTGGGCAAG TTCCAGCTCA CCGGCATCCC GGCTGCGCGC CGTGGTGTGC CGCAGATCGA GGTCACTTTC GACATCGACG CCAACGGCAT CGTGAACGTG TCGGCGAAGG ACCTGGGCAC CGGCAAGCAG CAGCAGATCA CCATCTCCGG CTCCACCGCG CTGAACGACG ACGAGGTCGA GCGCATGGTG AAGGACGCCG AGGCCCATGC CGAGGAAGAC GCCAAGCGCA AGGAAGAGAT CGAGGTTCGC AACAACGCCG ACGCGTTGGT GAACGCCACC GAGCAGACGC TCCAGGAAGT GGGCGACAAG GCTCCGGCCG ACGTGAAGTC CGCCGCTGAG GAGGCCATCG CCGAGGCGAA GCAGGCGCTC GAGGGCTCCG ACATGGACGC CATCAAGGCC GCGACCGAGA AGATGCAGCA GGCGGGCTAC AAGCTGGCCG AGGTCGTGTA CTCCACGCAG GGCCCGGACG CCGCTTCGCA GGCCGCTGCC GCCGAGTCCA CCCCGGCCGA TGACACCATC GAAGCCGACT ACGAGGTCGT CGAGGACGAC GACAAGAAGG AAGGGAAGTA A
|
Protein sequence | MSKILGIDLG TTNSAMAVME GSEPEILVNA EGDRTTPSVE GFRKDGERVV GKAAKNQAVT NPENTVSSVK RFIGRSYDET PEERKTVSYK LQKGKDGRAV VDIDGKDYTP EEISAMVLQK LKNDAEKQLG SPVTQAVITV PAYFNDAQRQ ATKDAGKIAG LEVLRIINEP TAAALAYGLD KTNKDEKILV FDLGGGTFDV SILELGDGVF EVASTAGDNH LGGDDWDQRI IDWMADKFQA ENGIDLRQDK MALQRLKEAA EKAKMELSST TQANINLPFI TADASGPKHL DYTLTRAEFE RITKDLLDRV KKPVEQALKD AGLKTGDIDE VILVGGSTRM PAVQDLVKKL TGKDPNMSVN PDEVVAMGAA VQGGVLAGDV EGILLLDVTP LSLGVETMGG VMTKMIERNT TIPTRKTEIY STASDNQTSV EVHVLQGERQ MASDNKTLGK FQLTGIPAAR RGVPQIEVTF DIDANGIVNV SAKDLGTGKQ QQITISGSTA LNDDEVERMV KDAEAHAEED AKRKEEIEVR NNADALVNAT EQTLQEVGDK APADVKSAAE EAIAEAKQAL EGSDMDAIKA ATEKMQQAGY KLAEVVYSTQ GPDAASQAAA AESTPADDTI EADYEVVEDD DKKEGK
|
| |