Gene Noc_2921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2921 
SymbolgroEL 
ID3705349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3304016 
End bp3305680 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content55% 
IMG OID637739398 
Productchaperonin GroEL 
Protein accessionYP_344897 
Protein GI77166372 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000375677 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGCTA AAGATGTAAG ATTTAGCGAA GATGCACGCC ATCGCATGAT GCATGGCGTC 
AATGTTTTGG CCGATGCGGT ACGGGTCACC TTGGGCCCCA GGGGGCGGAA TGTGGTGCTG
GAGAAGAGCT TTGGCGCACC CACCATTACC AAGGACGGCG TTAGCGTTGC CAAAGAAATT
GAACTTAAGG ATAGGTTCGA GAACATGGGC GCCCAGATGG TCAAGGAAGT GGCTTCCCAG
ACTTCCGATG TGGCAGGAGA TGGGACCACC ACAGCAACTG TGCTGGCTCA GAGCATACTG
CGCGAGGGTA TGAAGGCGGT GGCCGCTGGC ATGAACCCCA TGGATCTCAA ACGGGGCGTT
GACAAGGCAG TGGTGGCTGC GGTTGAAGAA CTGAAAAAAC TCTCCAAGCC CTGCGAGGAT
AGCAAGGCTA TCGGCCAGGT GGGCACTATC TCCGCCAATG CGGAAGAGTC CGTGGGTAAA
ATCATTGCCG AAGCCATGGA TAAGGTAGGT AAGGAAGGCG TGATTACGGT GGAGGAAGGT
TCTGGCTTGG ATAATGAGCT GGAAGTTGTG GAAGGAATGC AGTTTGATCG GGGTTATCTC
TCGCCCTATT TCATCACCGA TCAGCAGTCC ATGGCGGCGG ATCTGGATGA TCCTTATATC
CTTATTCACG ATAAAAAAAT CTCCAATATT CGCGACTTGC TGCCGGTGCT GGAAAGCGTG
GCCAAAGCGG GTAAGCCGTT ACTGGTGATT TCTGAGGATG TGGAAGGCGA AGCGCTGGCG
ACCCTGGTGG TCAATACCAT CCGCGGTATC GTTAAAGTGT GCGCGGTCAA GGCACCAGGC
TTTGGTGATC GCCGTAAGGC CATGCTGGAA GATATCGCCG TGCTGACTGG CGGCACGGTC
ATTTCCGAGG AGGTTGGTCT TTCCCTGGAT AAAGTGACCC TGGATGATTT GGGTCGGGCC
AAGAAAATCA CCGTTAATAA GGAAAACACC ACCATCGTAG ATGGTGCGGG TAGTGCTGAC
GATATTAAAG CCCGGGTCGA GCAAGTCCGG ATACAGATTG AGGAAGCCAC TTCCGATTAC
GATAAAGAGA AGCTCCAGGA GCGGGTTGCT AAATTGGCAG GTGGCGTGGC CGTCATCAAG
GTGGGCGCCG CGACCGAAAT GGAGATGAAA GAGAAAAAGG CCCGTGTGGA AGACGCTCTG
CACGCCACCC GTGCGGCGGT AGAAGAAGGC GTGGTCCCTG GTGGTGGGGT AGCGCTGATT
CGGGCTCTAT TAGGTATTAA GGATCTTAAA GGTGCGAATC ATGATCAAGA TGTGGGCATT
AATATTGCCC GTCGCGCCAT GGAAGAACCT CTGCGCCAGA TCGTGAACAA TTCTGGGGAA
GAGGCTTCTG TCATCGTTAA CCAGATTAAG GGAGGCGAGG GTAACTACGG TTACAATGCC
GCCACCGGAG AGTTTGGCGA CATGATTGCC ATGGGAATCC TGGACCCCAC CAAGGTCAGC
CGGACGGCCC TGCAAAATGC TGCCAGCGTG GCGGGTCTGA TGATCACCAC CGAGGCGATG
ATTGCCGAGG CGCCGAAGGA CGAGGAGGCA TCTCCTGGCG GTGCGCCTGG CATGGGCGGC
GGCATGGGTG GTATGGGCGG CATGGGTGAT ATGGGCATGA TGTAA
 
Protein sequence
MAAKDVRFSE DARHRMMHGV NVLADAVRVT LGPRGRNVVL EKSFGAPTIT KDGVSVAKEI 
ELKDRFENMG AQMVKEVASQ TSDVAGDGTT TATVLAQSIL REGMKAVAAG MNPMDLKRGV
DKAVVAAVEE LKKLSKPCED SKAIGQVGTI SANAEESVGK IIAEAMDKVG KEGVITVEEG
SGLDNELEVV EGMQFDRGYL SPYFITDQQS MAADLDDPYI LIHDKKISNI RDLLPVLESV
AKAGKPLLVI SEDVEGEALA TLVVNTIRGI VKVCAVKAPG FGDRRKAMLE DIAVLTGGTV
ISEEVGLSLD KVTLDDLGRA KKITVNKENT TIVDGAGSAD DIKARVEQVR IQIEEATSDY
DKEKLQERVA KLAGGVAVIK VGAATEMEMK EKKARVEDAL HATRAAVEEG VVPGGGVALI
RALLGIKDLK GANHDQDVGI NIARRAMEEP LRQIVNNSGE EASVIVNQIK GGEGNYGYNA
ATGEFGDMIA MGILDPTKVS RTALQNAASV AGLMITTEAM IAEAPKDEEA SPGGAPGMGG
GMGGMGGMGD MGMM