Gene Emin_0100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0100 
Symbol 
ID6263633 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp104337 
End bp106196 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content46% 
IMG OID642610562 
Productchaperone protein DnaK 
Protein accessionYP_001875003 
Protein GI187250521 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR02350] chaperone protein DnaK 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.843776 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.187258 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGAA TCATAGGTAT AGACTTAGGA ACATCAAATA CGGCTGCTGC GGCCATGGAA 
GGCGGCAGGG CCACAATCAT TCCTTCAGCA GAAGGCAGCT CTATCGGAGG CAAAGCGTTT
CCTTCATACG TGGCGTTTAC CAAAGACGGA CAGAGATTGG TAGGCGAACC CGCCAGAAGG
CAGGCTATCG CCAATCCTGA AGGCACGGTT ACAGCTTTCA AAAGAAGAAT GGGCGAAGAT
TACAAATTTA CTCTAAGAGG CCAGGAATTT ACACCACAGC AGTTATCGGC TTTCGTATTA
CAGAAAGTTA AAAAAGACGC CGAGGCGTTC TTAGGAGAAC CTGTTGAAAA AGCGGTTATC
ACCGTACCCG CCTATTTTAA CGACAACCAA AGACAGGCCA CCAAAGACGC GGGCAGAATC
GCGGGTTTAG AAGTTGTAAG ACTTGTTAAC GAACCTACCG CGGCCGCCCT TGCCTACGGT
ATTGATAAAG CGGGCAAAGA ACAAAAAATA ATGGTATTTG ACTTAGGCGG CGGTACGCTT
GACGTTACAA TAATGGAAAT GGGTAAAGAA GGAACATTTG ACGTTTTATC CACCTCCGGC
GACACAAAAC TCGGCGGTAC TGATATGGAC AACGCCATCA TTGAATGGAT GGTAAGCGAA
TTTAAAAAAT CAACCGGCAT TGACTTATCA GCCGACAAAC AGGCCGCGCA ACGCTTAAAA
GACGCCGCGG AAAAAGCAAA AATCGAACTT TCCACTACAA TGGAAACCGA CATTAACCTT
CCGTTTATTA GCGCTGGAGC CGACGGCCCG AAACATTTGG AGCTTAAACT TTCCAGAGCT
AAACTTGAAA GCTTAGTTGA TTCCATTGTA AAACGCTGCG GCGCTTCCAT TGACCAGGCT
TTAAACGATT CTTCGCTTAA ATCAACCGAA ATAGACAAGA TTATTTTAGT AGGCGGCCCC
ACAAGAATGC CTATAGTCCA AAAATATGTT GAAGACCATG CCGGCAAAAA AATTGAACGC
GGCATTGACC CTATGGAATG CGTTGCCACA GGCGCCGCCG TACAAGCGGG TATTTTAACG
GGCGACGTTA AAGACGTTCT TTTATTAGAC GTTACCCCGT TATCCTTAGG TCTTGAAACC
TTAGGAGGAG TAACAACAAG GCTTATTGAA AGAAACACAA CCATACCTGT CAGAAAAACT
CAGGTCTTCA GCACCGCTTC GGACAATCAG CCCGCGGTTA CAATTAACGT TCTTCAGGGC
GAACGCCCCA TGGCAAAGGA CAATGTGCCT TTAGGCAAGT TTGATTTAGA CGGCATTCCA
CCAGCGCCGA GAGGCGTACC GCAGATCGAG GTTACCTTTG ACATTGACGC TAACGGTATT
TTAAACGTTT CCGCCAAAGA TTTGGGCACA AACAAACAAC AGCATATTAC AATTACTTCC
AAAACAAAAT TAAGCGACGA TGAAGTACAA AAATTTGTTA AAGAAGCAGA GAAATTTGCT
GATGAAGATA AGAAAACCAA AGAAAGAGTT GACGCTAAAA ACGAGGCTGA TTCAGTGCTC
TTCCAAACGG AAAAAGCGCT TAAAGAACAC GGCGATAAAG TTCCCCAGGA AGACAGACTT
AACATTGACC GCGCTTTAGG AGACCTTAAG GAAGCGTTAA AAGGCGACGA TGTTGAAAGA
ATTAAAAAAG CCAAAGACGA CGCGCTTGCT GCAAGCCAAA AACTTGGGGA AATAATATAT
AAAGAATCCC AGGCTAAAGC ACAAGGCGCG GCAGGCCCTC AACCGGGCGC GCAAGCCCAA
GGCCAGCCCA ACGACGGCGG CAAAGAAGAT GTTGTTGAAG CTGAAGTTGT TGATAAATAA
 
Protein sequence
MARIIGIDLG TSNTAAAAME GGRATIIPSA EGSSIGGKAF PSYVAFTKDG QRLVGEPARR 
QAIANPEGTV TAFKRRMGED YKFTLRGQEF TPQQLSAFVL QKVKKDAEAF LGEPVEKAVI
TVPAYFNDNQ RQATKDAGRI AGLEVVRLVN EPTAAALAYG IDKAGKEQKI MVFDLGGGTL
DVTIMEMGKE GTFDVLSTSG DTKLGGTDMD NAIIEWMVSE FKKSTGIDLS ADKQAAQRLK
DAAEKAKIEL STTMETDINL PFISAGADGP KHLELKLSRA KLESLVDSIV KRCGASIDQA
LNDSSLKSTE IDKIILVGGP TRMPIVQKYV EDHAGKKIER GIDPMECVAT GAAVQAGILT
GDVKDVLLLD VTPLSLGLET LGGVTTRLIE RNTTIPVRKT QVFSTASDNQ PAVTINVLQG
ERPMAKDNVP LGKFDLDGIP PAPRGVPQIE VTFDIDANGI LNVSAKDLGT NKQQHITITS
KTKLSDDEVQ KFVKEAEKFA DEDKKTKERV DAKNEADSVL FQTEKALKEH GDKVPQEDRL
NIDRALGDLK EALKGDDVER IKKAKDDALA ASQKLGEIIY KESQAKAQGA AGPQPGAQAQ
GQPNDGGKED VVEAEVVDK