Gene Daud_2007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_2007 
SymbolgroEL 
ID6026392 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp2113684 
End bp2115327 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content64% 
IMG OID641594829 
Productchaperonin GroEL 
Protein accessionYP_001718130 
Protein GI169832148 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGCTA AGGACATAGT CTACCGTGAA GACGCGCGGA CCGCGATGGA ACGGGGCGTT 
AACGCCCTGG CCGACGCGGT GCGGGTGACG CTCGGGCCGA AAGGCCGCAA CGTGGTCCTG
GAAAAGAAAT TCGGTTCGCC GATGATCGTC AACGACGGCG TGACCATCGC CCGGGAGATC
GAACTGGAGA ATCCGTTCGA AAACATGGGG GCCCAGTTGG TCAAAGAAGT GGCCACCAAG
ACCAACGACA TCGCCGGTGA CGGGACCACC ACCGCCGCGG TGCTCGCCCA GGCCATCGTG
CGGGCCGGGC TGAAGAACGT GACGGCCGGT GCGAACCCGA TGATCCTGAA GCGGGGCATT
GAGAAGGCCG TGGAACGGAC CGTGGAGGAG ATCAAGAGCC GGGCCAAGCC GGTGGAGAGC
AAGGAGGCCA TCACCCAGGT GGCGTCCATT TCGGCGAACG ACACCACCAT CGGCAACCTG
ATCGCCGACG CGATGGAGAA GGTCGGCAAG GACGGCGTGA TCACCGTCGA GGAGTCCAAG
GGTATGGGCA CCTCACTGGA AGTCGTGGAC GGCATGAACT TCGACCGGGG TTACATTTCC
CCGTACATGA TCACCGACCC CGACAAGATG GAAGCCACCC TTGCCGATCC GTACATTCTG
ATCACCGACA AGAAAATCTC AGCCGTGGCC GATATTTTGC CCATCCTGGA GAAGGTGCTC
CAGGCCGGGA AGGCGCTCTT GATTATCGCC GAGGACGTGG AAGGCGAAGC GCTGGCCACC
CTGGTGGTCA ACAAGCTGCG GGGCACCTTA AACGTCGTGG CCGTCAAGGC GCCGGGGTTC
GGCGACCGCC GCAAGGCGAT GCTCGAGGAC ATCGCCATCC TCACCGGCGG CCGGGTGGTC
AGCGAAGAGG TGGGCTTGAA GCTCGACAAG GCCGGCCTGG ACCTCCTGGG CAAGGCCCGC
CAGGTCCGGG TGAAGAAGGA CGAGACCATC GTCGTGGACG GCCAGGGCGA CGCGGACGCG
ATCACCAAGC GGCTGGCCCA GATCAAAAAG CAGATCGAGG ACACCACCTC AGACTTCGAC
CGGGAGAAGC TCCAGGAGCG GCTGGCGAAA CTGGCCGGCG GCGTGGCGGT CATCAACGTG
GGCGCGGCGA CCGAGACCGA GATGAAGGAG AAAAAGCTCC GCATCGAGGA CGCCCTGAAC
GCGACCCGCG CGGCCGTGGA GGAGGGCATC GTCCCCGGGG GCGGCACCGT GTACGTGAAC
GTGATTCCCG TCCTGAACGG GCTGGAGCCC GAGCTTCCCG ACGAGCGGAC CGGTGTCGAC
ATCATCAAGC GCGCCCTTGA GGCCCCGCTG CGGCAGATCG CCAACAACGC CGGGGTAGAG
GGTTCCATCG TGGTCGAAAA AGTGAAGGAA AGCCCGGCGG GCGTTGGTTT CGACGCCCTC
AGCGAGCAAT ACACCGATAT GATCGGGGCC GGTATCGTGG ACCCGGCGAA GGTGACCCGC
ATCGCCCTGC AGAACGCCGC CAGCATCGCG GCGATGATCC TGACCACCGA GACCCTGGTG
GCCGAGAAGG TGGACAAGGA CAAGAAGGGC GGCATGGGCG GCATGGGCGG TATGGGCGGT
ATGGGCGGCA TGGACATGAT GTAG
 
Protein sequence
MAAKDIVYRE DARTAMERGV NALADAVRVT LGPKGRNVVL EKKFGSPMIV NDGVTIAREI 
ELENPFENMG AQLVKEVATK TNDIAGDGTT TAAVLAQAIV RAGLKNVTAG ANPMILKRGI
EKAVERTVEE IKSRAKPVES KEAITQVASI SANDTTIGNL IADAMEKVGK DGVITVEESK
GMGTSLEVVD GMNFDRGYIS PYMITDPDKM EATLADPYIL ITDKKISAVA DILPILEKVL
QAGKALLIIA EDVEGEALAT LVVNKLRGTL NVVAVKAPGF GDRRKAMLED IAILTGGRVV
SEEVGLKLDK AGLDLLGKAR QVRVKKDETI VVDGQGDADA ITKRLAQIKK QIEDTTSDFD
REKLQERLAK LAGGVAVINV GAATETEMKE KKLRIEDALN ATRAAVEEGI VPGGGTVYVN
VIPVLNGLEP ELPDERTGVD IIKRALEAPL RQIANNAGVE GSIVVEKVKE SPAGVGFDAL
SEQYTDMIGA GIVDPAKVTR IALQNAASIA AMILTTETLV AEKVDKDKKG GMGGMGGMGG
MGGMDMM