Gene Arth_2886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2886 
SymbolgroEL 
ID4444443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3250565 
End bp3252175 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content66% 
IMG OID639690709 
Productchaperonin GroEL 
Protein accessionYP_832365 
Protein GI116671432 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00686344 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAGC AGCTTGCGTT TAACGACGCT GCCCGCCGGT CGCTTGAAGC CGGCATCGAT 
AAGCTCGCCA ACACTGTCAA GGTGACGCTT GGCCCGCGCG GCCGCAACGT CGTGCTGGAC
AAGAAGTGGG GCGCTCCCAC CATTACGAAC GACGGCGTGA CCATCGCCCG CGAAGTCGAA
CTGGATGACC CGTTCGAGAA CCTCGGCGCG CAGCTGGCCA AGGAAGTCGC CACCAAGACC
AACGATGTTG CCGGCGACGG CACCACCACC GCCACCGTGC TGGCACAGGC ACTGGTCAAG
GAAGGCCTGC GCAACGTGGC GGCTGGCGCC GCCCCGGGCC AGATCAAGCG CGGCATCGAG
GTTTCCGTCG AAGCCGTCGC AGCACGCCTG CTGGAGAACG CCCGCCCTGT CGAAGGCTCC
CAGGTTGCGA ACGTTGCAGC CATCTCCGCC CAGAGCGACG AGATCGGCGA GCTCCTGGCC
GAGGCTTTCG GCAAGGTCGG CAAGGATGGT GTGATCACCA TCGAGGAGTC CTCCACCACG
CAGACCGAGC TCGTCCTCAC CGAGGGCATG CAGTTCGACA AGGGCTACCT TTCCCCGTAC
TTCGTCACCG ACGCGGAACG CCAGGAGGCA GTCCTCGAAG ACGCCCTTAT CCTGATCAAC
CAGGGCAAGA TCTCCTCGGT GCAGGAATTC CTGCCCCTCC TGGAGAAAGC GCTGCAGAGC
TCCAAGCCGC TGTTCATCAT TGCCGAGGAC GTCGAGGGCG AGGCCCTGTC CACGCTCATC
GTCAACCGCA TCCGCGGCAC CCTGAACGTC GTTGCCGTCA AGGCTCCGGG CTTCGGTGAC
CGCCGCAAGG CCATGCTGCA GGACATCGCC ACCCTCACCG GTGCGCAGGT TGTCTCCCCG
GAACTGGGCC TCAGCCTTGA TTCCGTTGGC CTCGAGGTGC TGGGCACGGC CCGCCGCATC
ACGGTGACCA AGGACAACAC CACCATTGTT GACGGCGCCG GCACGGCCGA GGACGTAGCG
GCACGCGTTG CCCAGCTGCG CGCCGAGCTG ACCCGCACCG ACTCCGACTG GGACAAGGAA
AAGCTCCAGG AGCGCCTGGC CAAGCTGGCC GGCGGCATCG GTGTGATCAA GGTTGGCGCA
GCCACCGAGG TGGAGCTGAA GGAAAAGAAG CACCGCATCG AGGACGCAGT CTCCTCCACC
CGCGCTGCCC TCGAAGAAGG CATCGTTGCC GGTGGCGGTT CGGCCCTCAT CCACGCCCTG
AAGGCGCTGG ACGAGGACCC TGCAGTCACC GCACTCGAAG GCGATGCAGC CTCGGCTGTG
GGCATCGTTC GCCGGGCACT CGTCCAGCCG CTGCGCTGGA TCGCCCAGAA CGCCGGTTTC
GACGGCTACG TCGTTGCCGC CAAGGTTGCC GAGTCGGCTG TCAACCAGGG CTTCAACGCC
AAGAGCGGCG ATTACGAAGA CCTGATCGCT GCAGGCGTCA TCGACCCCGT GAAGGTGACG
CGCGCAGCCC TCCGCAACGC CGCTTCCATC GCAGCGCTGG TTCTCACCAC AGAGACCCTC
GTTGTCGAGA AGCCGGCCGA CGAGGACGAG CACGCTGGCC ACAAGCACTA G
 
Protein sequence
MAKQLAFNDA ARRSLEAGID KLANTVKVTL GPRGRNVVLD KKWGAPTITN DGVTIAREVE 
LDDPFENLGA QLAKEVATKT NDVAGDGTTT ATVLAQALVK EGLRNVAAGA APGQIKRGIE
VSVEAVAARL LENARPVEGS QVANVAAISA QSDEIGELLA EAFGKVGKDG VITIEESSTT
QTELVLTEGM QFDKGYLSPY FVTDAERQEA VLEDALILIN QGKISSVQEF LPLLEKALQS
SKPLFIIAED VEGEALSTLI VNRIRGTLNV VAVKAPGFGD RRKAMLQDIA TLTGAQVVSP
ELGLSLDSVG LEVLGTARRI TVTKDNTTIV DGAGTAEDVA ARVAQLRAEL TRTDSDWDKE
KLQERLAKLA GGIGVIKVGA ATEVELKEKK HRIEDAVSST RAALEEGIVA GGGSALIHAL
KALDEDPAVT ALEGDAASAV GIVRRALVQP LRWIAQNAGF DGYVVAAKVA ESAVNQGFNA
KSGDYEDLIA AGVIDPVKVT RAALRNAASI AALVLTTETL VVEKPADEDE HAGHKH