Gene Saro_0035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0035 
SymbolgroEL 
ID3916038 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp35710 
End bp37353 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content65% 
IMG OID640442760 
Productchaperonin GroEL 
Protein accessionYP_495318 
Protein GI87198061 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0226788 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGCCA AGGACGTAAA GTTCGGCCGC GACGCTCGTG AACGCATTCT GCGCGGCGTC 
GACATCCTCG CTGATGCCGT GAAGGTCACG CTGGGCCCCA AGGGCCGCAA CGTCGTGATC
GACAAGAGCT TCGGCGCCCC CCGCATCACC AAGGACGGTG TTTCGGTCGC CAAGGAAATC
GAGCTCAAGG ACAAGTTCGA GAACATGGGC GCCCAGATGC TGCGCGAAGT GGCCTCGAAG
GCCAACGACG CGGCCGGTGA CGGCACCACC ACCGCGACCG TTCTTGCACA GGCGATCGTT
CGCGAAGGCA TGACCGCGGT TGCCGCCGGC ATGAACCCGA TGGACCTCAA GCGCGGCATC
GACATCGCCG TCGGCAAGGT CGTCGAGAAC CTCAAGGCCC GTTCGACCCC GGTCGCCGGT
TCGTCGGAAA TCGCCCAGGT CGGCATCATC TCGGCCAACG GCGACACCGA AGTCGGCCAG
AAGATCGCCG AGGCGATGGA GAAGGTCGGC AAGGAAGGCG TGATCACCGT TGAAGAGGCC
AAGGGCCTCG AATTCGAACT CGATGTCGTC GAAGGCATGC AGTTCGACCG CGGCTACCTC
TCGCCCTACT TCATCACCAA CCCGGAAAAG ATGACGGTCG AACTCGAGAA CCCGTACATC
CTGATCCACG AGAAGAAGCT GTCGTCGCTC CAGGCGATGC TGCCGATCCT CGAAGCCGTG
GTGCAGTCGG GCCGTCCGCT CCTCATCATC GCCGAGGACA TCGAAGGCGA AGCGCTGGCC
ACCCTCGTGG TCAACAAGCT GCGCGGTGGC CTCAAGATTG CCGCCGTCAA GGCTCCGGGC
TTCGGTGACC GCCGCAAGGC CATGCTGGGC GACATCGCCA CGCTGACCGC CGGCGAAATG
ATCTCCGAAG ACCTCGGCAT CAAGCTGGAG AGCGTCACGC TCGCCATGCT CGGCCAGGCC
AAGAAGGTCA CCATCGACAA GGACAACACC ACGATCGTCG ACGGCGCCGG TTCGGCCGAA
GAGATCAAGG CCCGCGTCGA GCAGATCCGT GCGCAGATCG AAGTCACCAC TTCGGACTAC
GACCGCGAGA AGCTGCAGGA ACGCCTTGCC AAGCTTGCTG GCGGCGTTGC CGTGATCAAG
GTCGGCGGCG CGACCGAAGT CGAGGTCAAG GAGCGCAAGG ACCGCGTCGA CGACGCTCTC
CACGCCACCC GCGCCGCAGT CGAGGAAGGT ATCGTCCCGG GCGGCGGTAC GGCTCTGCTC
TATGCCACCA AGGCTCTCGA AGGCCTCAAG GGCGCCAACG ACGACCAGAC CAAGGGCATC
GACATCGTGC GCCGCGCGAT CCAGGCCCCG ATCCGTCAGA TCGCCGCGAA CGCGGGTCAT
GACGGTGCGG TCGTCTCGGG CAACCTGCTG CGCGAGAACG ACGAGAACCA GGGCTTCAAC
GCCGCGACCG ACACCTACGA GAACCTGAAG GCCGCCGGCG TCATCGACCC GACCAAGGTC
GTGCGCACCG CGCTTCAGGA CGCTGCCTCG GTCTCGGGCC TGCTGATCAC CACGGAAGCG
GCGATCAGCG AGAAGCCGGA CGACAAGCCC GCGATGCCGC CGATGGGCGG TGGAATGGGC
GGCATGGGCG GCATGGACTT CTAA
 
Protein sequence
MAAKDVKFGR DARERILRGV DILADAVKVT LGPKGRNVVI DKSFGAPRIT KDGVSVAKEI 
ELKDKFENMG AQMLREVASK ANDAAGDGTT TATVLAQAIV REGMTAVAAG MNPMDLKRGI
DIAVGKVVEN LKARSTPVAG SSEIAQVGII SANGDTEVGQ KIAEAMEKVG KEGVITVEEA
KGLEFELDVV EGMQFDRGYL SPYFITNPEK MTVELENPYI LIHEKKLSSL QAMLPILEAV
VQSGRPLLII AEDIEGEALA TLVVNKLRGG LKIAAVKAPG FGDRRKAMLG DIATLTAGEM
ISEDLGIKLE SVTLAMLGQA KKVTIDKDNT TIVDGAGSAE EIKARVEQIR AQIEVTTSDY
DREKLQERLA KLAGGVAVIK VGGATEVEVK ERKDRVDDAL HATRAAVEEG IVPGGGTALL
YATKALEGLK GANDDQTKGI DIVRRAIQAP IRQIAANAGH DGAVVSGNLL RENDENQGFN
AATDTYENLK AAGVIDPTKV VRTALQDAAS VSGLLITTEA AISEKPDDKP AMPPMGGGMG
GMGGMDF