Gene Dtox_3372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3372 
SymbolgroEL 
ID8430366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp3579080 
End bp3580720 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content47% 
IMG OID645035606 
Productchaperonin GroEL 
Protein accessionYP_003192725 
Protein GI258516503 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAAC AAATTATCTT CAACGAAGAT GCCCGCAAAG CATTGGAAAA AGGAGTCAAT 
CAATTAGCCG AAGCTGTACG TGTAACCCTT GGCCCGAAAG GCCGGAATGT GGTTCTTGAT
AAAAAATTCG GTGCGCCGAC AATTACCAAC GACGGTGTCA CCATTGCCAG AGAGATTGAA
CTGCCGGATG TATTCGAAAA CATGGGCGCT CAGCTGGTAA AAGAAGTTGC TACCAAAACC
AACGATGTAG CAGGTGACGG TACCACAACC GCTACGGTAC TGGCTCAAGC TATGGTTCGC
GAAGGCTTAA GAAACGTTAC TGCCGGTGCC AACCCGATGA TTATCAAGCG TGGTATTGAG
AAGGCTGTGG AAAAAGCAGT AGATGCTATT AAAAACAGCT CCAAGCCGAT TGAAAGCAAA
GGTGCTATTG CCCAGGTTGC TTCAATTTCT GCTAATGATG AAACTATCGG TAATTTAATT
GCCGACGCCA TGGAAAAAGT AGGAAAAGAC GGTGTTATCA CTGTTGAGGA ATCCAAGGGT
ATCGGTACCA CTTTAGATGT AGTGGAAGGT ATGAATTTTG ACCGCGGCTA TATTTCTCCG
TATATGATTA CCGATACTGA TAAAATGGAA GCAGATTTGG AGGAGCCCTA CATACTGTTG
ACAGACAAGA AGATTTCCTC CATTCAGGAA ATTCTGCCCA TTTTGGAAAA AGTGGTTCAG
TCCGGCAAAG CGCTTTTGAT CATTGCAGAA GATTTGGAAG GCGAAGCTCT GGCTACTCTG
GTTCTCAATA AACTGCGCGG AACCTTCACT TGTGTAGCAG TGAAAGCTCC TGGTTTCGGT
GATCGCCGCA AAGCCATGAT GCAGGATATA GCTATTCTAA CCGGTGCTCA GGTGATTACT
GAAGAACTCG GCTTAAAGCT GGATAAAGCT ACTATTGATA TGCTCGGCAG AGCTTCCAGA
GTCAGAGTTA AGAAAGAAGA AACCATCATT GTCGGCGGTT CCGGCAGTGT GGATGAAATC
AAACAGCGTG TTAACCAAAT CAAGGCACAG ATCGAAGAAA GCACTTCCGA CTTTGACCGC
GAGAAGCTCC AGGAGCGTTT GGCAAAGCTG GCCGGCGGCG TAGCCGTAAT CCAAGTTGGT
GCTGCCACTG AAGTTGAAAT GAAAGAGAAG AAGCTGCGCA TTGAGGATGC TCTTAATGCT
ACCAGGGCTG CCGTGGAAGA AGGTATCGTG TCCGGTGGCG GTGTTGCTTA TGTAAGCATT
ATTCCTGATC TTGTAGATAT GGAAGCAGCT AATTTAGACG AGAAATCCGG TATTGATATT
GTTCGCCGCG CTCTGGAAGA TCCCTTGCGC CAGATTGCCA ACAATGCAGG TCTTGAAGGC
TCAGTTGTGG TGGAAAAAGT TAAGGTTTCC GAAAACGGTG TAGGTTTCAA CGCCTTGACA
GGTGAATATG TCAATATGAT CGATGCCGGT ATTGTGGACC CGGCTAAAGT TACCCGCTCT
GCCCTGCAGA ACGCTGCCAG CATTGCTGCT ATGATTCTGA CCACTGAAAC CCTGATAGCT
GAGAAACCTG AAGAGGGTAA GGATGCTATG GCCGGCATGG GCGGCATGGG CGGTATGGGT
GGCATGGGCG GCATGATGTA A
 
Protein sequence
MAKQIIFNED ARKALEKGVN QLAEAVRVTL GPKGRNVVLD KKFGAPTITN DGVTIAREIE 
LPDVFENMGA QLVKEVATKT NDVAGDGTTT ATVLAQAMVR EGLRNVTAGA NPMIIKRGIE
KAVEKAVDAI KNSSKPIESK GAIAQVASIS ANDETIGNLI ADAMEKVGKD GVITVEESKG
IGTTLDVVEG MNFDRGYISP YMITDTDKME ADLEEPYILL TDKKISSIQE ILPILEKVVQ
SGKALLIIAE DLEGEALATL VLNKLRGTFT CVAVKAPGFG DRRKAMMQDI AILTGAQVIT
EELGLKLDKA TIDMLGRASR VRVKKEETII VGGSGSVDEI KQRVNQIKAQ IEESTSDFDR
EKLQERLAKL AGGVAVIQVG AATEVEMKEK KLRIEDALNA TRAAVEEGIV SGGGVAYVSI
IPDLVDMEAA NLDEKSGIDI VRRALEDPLR QIANNAGLEG SVVVEKVKVS ENGVGFNALT
GEYVNMIDAG IVDPAKVTRS ALQNAASIAA MILTTETLIA EKPEEGKDAM AGMGGMGGMG
GMGGMM