Gene RoseRS_1077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1077 
Symbol 
ID5208023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp1338730 
End bp1340370 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content62% 
IMG OID640594691 
Productchaperonin GroEL 
Protein accessionYP_001275436 
Protein GI148655231 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAAC AGGTGATATT CAACGAGCAG GCGCGCGCAG CGCTCAAGCA CGGCGTTGAT 
ACCCTGGCGC TCGCTGTGAA GACAACGCTT GGTCCTCGCG GGCGCAACGT TGCGATGGGC
AAGAAATGGG GTGCACCCTC CGTCACCCAT GACGGCGTCA CCGTAGCGAA GGAGGTCGAA
CTGAAGGACC CCTTCCAGAA TATGGGCGCC CAACTCCTCA AAGAAGCCGC CAGCAAAACG
AACGATGTCG CCGGTGACGG CACAACAACG GCCACAGTGC TGGCGCAGGC GATGATCGAC
GAAGGATTGA AACTGGTCGC CGCAGGCGCC AACCCCATGA TCTTCAAACG TGGTCTGGAT
AAAGGGCGCG AGGCGCTGGT TGCACGCATC AAAGAGCAAT CGATCACCCT CAAGAGCCGT
GACGAGATTC GCCAGGTAGC GACCATCTCC GCCCAAGACC CGGAGATCGG CGAGTTGCTG
GCGACCATCA TGGATAAGAT CGGGCATGAT GGGGTCGTCA CCATCGAAGA GGGCAAAGGC
ACAACCCTGG AGTACGAACT GGTCGAGGGC ATGCAGTTCG ACCGCGGGTA CATTTCGCCC
TACTTCGTGA CCGATTCGAG CCGCATGGAG GCGGTCATCG ACGAGCCGTA CATCCTGATC
ACCGACAAGA AGATCAGCGC CGTCAATGAT CTGCTCCCGA TTCTGGAGGC GGTGCTGGCG
ACCGGCAAGA AGGACCTGGT GATCATTGCT GAAGATGTCG ATGGCGAAGC GCTGGCGACC
CTGGTGGTCA ACAAGATGCG CGGCACCCTC AACGCGCTGG CGGTGAAGGC CCCCGGTTTT
GGCGACCGCC GCAAAGCGAT GCTCCAGGAC ATCGCCATCC TGACCGGCGG CACGGTCATC
AGTGAGGAGG TCGGGCGCAA ACTCGACAGC GCCAAAGTGC AAGACCTCGG TCGCGCTCGC
CGGGTGAAGT CGGACAAAGA CAACACGGTG ATTGTCGAAG GGTTCGGCGA CAAGCAGGCG
ATCCAGGCGC GCATCCGGCA GCTGAAGCAG CAGATCGAAA CCACGACATC GGACTACGAC
CGTGAGAAAC TGCAGGAGCG CGTCGCCAAA CTGTCAGGCG GCGTGGCGGT GATCAAGGTC
GGCGCTCCGA CCGAACCGGC GCTCAAGGAG CGCAAGGCGC GCGTTGAGGA TGCGCTGAAC
GCGACCCGCG CCGCAGTCGA GGAAGGCATC GTACCGGGCG GCGGCATCGC GCTGTTGAAC
GCCATCCCGG CGCTCGATAA TGTACAGACG CAGTTTGAGG AAGAGCGCAT GGCGCTGAAC
ATTCTGCGCC GCGCGCTGGA AGAGCCGCTG CGCCAGCTGG CGATCAACGC CGGTGAGGAC
GGCTCGGTGG TGGTGAATCA GGTGCGCACG CTCCAGCGTG AACACAACAA TCCGAACTAC
GGGTTCGATG TGATGACCGG GAAATACGTT GATCTCATGC AGGCTGGCAT CATCGACCCG
GCAAAGGTGG TGCGCACCGC GCTCGAGAAT GCGGTCAGCG TTGCAGGTAT CGTCCTGACG
ACCGATGCGC TGATCACCGA TGCGCCGGAG CCGAAGAAGA ACGGTGCGCG CACGCCATCG
ATGCCGGAGG AGGAGTTCTG A
 
Protein sequence
MAKQVIFNEQ ARAALKHGVD TLALAVKTTL GPRGRNVAMG KKWGAPSVTH DGVTVAKEVE 
LKDPFQNMGA QLLKEAASKT NDVAGDGTTT ATVLAQAMID EGLKLVAAGA NPMIFKRGLD
KGREALVARI KEQSITLKSR DEIRQVATIS AQDPEIGELL ATIMDKIGHD GVVTIEEGKG
TTLEYELVEG MQFDRGYISP YFVTDSSRME AVIDEPYILI TDKKISAVND LLPILEAVLA
TGKKDLVIIA EDVDGEALAT LVVNKMRGTL NALAVKAPGF GDRRKAMLQD IAILTGGTVI
SEEVGRKLDS AKVQDLGRAR RVKSDKDNTV IVEGFGDKQA IQARIRQLKQ QIETTTSDYD
REKLQERVAK LSGGVAVIKV GAPTEPALKE RKARVEDALN ATRAAVEEGI VPGGGIALLN
AIPALDNVQT QFEEERMALN ILRRALEEPL RQLAINAGED GSVVVNQVRT LQREHNNPNY
GFDVMTGKYV DLMQAGIIDP AKVVRTALEN AVSVAGIVLT TDALITDAPE PKKNGARTPS
MPEEEF