Gene Nmul_A2344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2344 
SymbolgroEL 
ID3784748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2671276 
End bp2672934 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content58% 
IMG OID637812435 
Productchaperonin GroEL 
Protein accessionYP_413027 
Protein GI82703461 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000584882 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGCGA AAGAGGTTAA ATTCAGCGAT TCAGCTCGCC ACAAGATGGT GAATGGAGTC 
AACATCCTGG CCGATGCGGT AAAAGTCACT CTCGGACCCA AGGGGCGTAA TGTCGTCCTG
GAGCGTTCCT ACGGCTCGCC CACCATCACC AAGGATGGAG TTTCAGTAGC GAAGGAAATC
GAACTGAAAG ACAAGTTCGA GAACATGGGT GCGCAGATGG TAAAGGAAGT CGCCAGCAAG
ACTTCCGATG TTGCCGGTGA CGGCACGACC ACGGCTACCG TTCTGGCGCA ATCCATTGTC
AAGGAAGGCA TGAAGTACGT CGCCGCCGGG ATGAATCCCA TGGATCTCAA GCGGGGAATC
GACAAGGCGG TCGTCGCCAC GGTTGAGGAA CTGAAGAAAC TTTCCAAACC CTGCACCACC
GGCAAGGAAA TCGCGCAGGT CGGCAGCATT TCCGCCAATT CCGATCCCGA AATCGGCAAG
ATCATTGCTG ATGCAATGGA AAAAGTTGGC AAGGAAGGTG TCATCACCGT GGAGGACGGT
TCCGGGCTGC AAAACGAACT GGAAGTAGTC GAAGGCATGC AGTTCGATCG TGGCTACCTG
TCGCCCTACT TCATTAACAA TGCGGACAGG CAGATCGCAC TGCTGGAAAG CCCGTTCATC
CTCTTGCACG ACAAGAAAAT CTCCAACATC CGTGACTTGC TGCCAGTACT GGAACAGGTG
GCGAAGGCCG GCAAGCCGTT GCTGATCATC GCTGAAGATG TCGATGGTGA AGCGCTCGCC
ACGCTGGTGG TCAACAACAT CCGCGGCATT CTCAAGACCT GCGCAGTGAA AGCGCCCGGC
TTTGGTGACC GCCGCAAAGC CATGCTCGAG GATATCGCCA TTCTCACCGG CGGTACGGTG
ATTGCCGAAG AAGTGGGCCT CTCGCTGGAA AAGGCGACCC TGGCGGAACT GGGGCAAGCC
AAGCGGGTGG AAGTCGGCAA GGAGGAAACT ACCATCATCG ACGGCGCCGG GGATACCCAG
AACATCGAAG GCCGTGTGAA GCAGATCCGC GCCCAGATCG AAGAAGCGAC CAGCGACTAC
GATAAGGAGA AACTGCAGGA GCGCGTGGCG AAGCTGGCCG GCGGCGTGGC GTTGATCAAA
GTGGGCGCTG CGACAGAAGT GGAGATGAAG GAGAAAAAAG CGCGGGTGGA AGATGCGTTG
CATGCTACCC GGGCTGCGGT GGAAGAAGGT ATCGTTCCCG GCGGCGGTGT CGCGCTCCTG
CGCACCCGTA GCGCGGTGTC GAACCTGAAA GGCGACAACC ATGACCAGGA TGCCGGAATC
AAGATCGTTC TGCGTGCCCT GGAAGAGCCC TTGCGTCAGA TCGTTGCCAA CTGCGGTGAT
GAGCCTTCCG TGGTCATCAA CAAGGTTCTG GAAGGAACGG AGAACTTCGG CTATAACGCG
GCCAGCAGTG AATATGGTGA CATGGTTCAA ATGGGTGTGC TCGACCCCAC CAAAGTGACG
CGTTATGCCT TGCAGCATGC TGCTTCTATC GCAGGTCTGA TGCTCACCAC GGATGCGCTG
GTAGCGGAAG TGCCCAAGGA AGAGGGTGCC GGCGGCGGTA TGGGCGGAGG CATGGGCGGT
ATGGGTGGCA TGGGTGGCAT GGGCGGCATG GACATGTAA
 
Protein sequence
MAAKEVKFSD SARHKMVNGV NILADAVKVT LGPKGRNVVL ERSYGSPTIT KDGVSVAKEI 
ELKDKFENMG AQMVKEVASK TSDVAGDGTT TATVLAQSIV KEGMKYVAAG MNPMDLKRGI
DKAVVATVEE LKKLSKPCTT GKEIAQVGSI SANSDPEIGK IIADAMEKVG KEGVITVEDG
SGLQNELEVV EGMQFDRGYL SPYFINNADR QIALLESPFI LLHDKKISNI RDLLPVLEQV
AKAGKPLLII AEDVDGEALA TLVVNNIRGI LKTCAVKAPG FGDRRKAMLE DIAILTGGTV
IAEEVGLSLE KATLAELGQA KRVEVGKEET TIIDGAGDTQ NIEGRVKQIR AQIEEATSDY
DKEKLQERVA KLAGGVALIK VGAATEVEMK EKKARVEDAL HATRAAVEEG IVPGGGVALL
RTRSAVSNLK GDNHDQDAGI KIVLRALEEP LRQIVANCGD EPSVVINKVL EGTENFGYNA
ASSEYGDMVQ MGVLDPTKVT RYALQHAASI AGLMLTTDAL VAEVPKEEGA GGGMGGGMGG
MGGMGGMGGM DM