Gene Rcas_2779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2779 
Symbol 
ID5540265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3595724 
End bp3597364 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content60% 
IMG OID640894905 
Productchaperonin GroEL 
Protein accessionYP_001432868 
Protein GI156742739 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAAC AGATTATCTT CAACGAGCAG GCGCGCGCAG CGCTCAAGCA TGGTGTCGAT 
ACCATGGCGC TCGCTGTCAA GACCACCCTC GGACCGCGCG GGCGCAATGT GGCGATGGGG
AAGAAATGGG GTTCTCCGGC GGTGACCCAC GATGGCGTAA CCGTTGCCAA GGAAGTCGAG
CTGAAGGACC CCTTCGAGAA TATGGGCGCC CAATTGCTCA AGGAAGCCGC CAGCAAAACG
AACGATGTCG CCGGAGACGG CACGACGACC GCAACCGTCC TGGCGCAGGC GATGATCGAT
GAAGGGCTGA AACTGGTGGC AGCCGGCGCT AATCCTATGA TCCTCAAACG CGGTCTCGAC
AAGGGACGCG AGGCTCTGGT GGCGCGCATT AAAGAGCAAG CCATCTCGTT GAAGAGCCGT
GACGAAATCC GCCAGGTGGC GACGATCTCT GCGCAGGACC CAGAGATCGG CGAGTTGCTG
GCGACAATCA TGGACAAGAT CGGGCACGAC GGCGTTGTGA CGATTGAAGA GGGCAAAGGC
ACAACGCTGG AGTACGAACT GGTCGAGGGT ATGCAGTTCG ACCGCGGCTA CATCTCACCC
TACTTCGTCA CCGACTCGAG CCGCATGGAG GCAGTGATCG ACGAGCCGTA CATCCTGATC
ACCGACAAGA AAATCAGCGC TGTTAATGAT CTGCTTCCCG TCCTTGAAGC CGTACTGGCG
ACCGGCAAAA AAGACCTGGT GATCATCGCC GAGGATGTCG ATGGCGAAGC ACTGGCGACA
CTGGTGGTCA ACAAGATGCG CGGTACGCTG AACCCGCTGG CAGTGAAAGC CCCCGGCTTT
GGTGATCGCC GCAAAGCCAT GCTCCAGGAC ATCGCCATCC TGACCGGCGG CACGCTCATC
AGCGAAGAAG TCGGGCGCAA ACTCGATAGC GCCAAAGTGC AGGACCTTGG CCGCGCCCGC
CGGGTGAAGT CGGACAAGGA CAATACCGTC ATTGTTGAAG GGTTTGGCGA CAAGCAGGCG
ATCCAGGCGC GCATCAGGCA ACTGAAGCAG CAGATCGAAA CCACAACGTC GGACTATGAC
CGCGAGAAGT TGCAGGAGCG CGTTGCCAAG CTGTCGGGCG GCGTGGCGGT GATCAAAGTC
GGCGCACCGA CCGAACCGGC GCTAAAGGAG CGCAAGGCGC GCGTCGAGGA CGCGCTGAAC
GCAACCCGCG CGGCAGTTGA GGAGGGCATT GTGCCTGGCG GCGGCGTTGC GCTGCTGAAT
GCCATTTCAG CGCTCGATAA CGTCCAGACG CAGTTCGAGG AAGAGCGCAT GGCATTGAAC
GTGTTGCGGC GTGCGCTGGA AGAGCCGCTG CGCCAGCTGG CGATCAACGC CGGTGAGGAT
GGCTCAGTGG TAGTGAATCA GGTGCGAACC TTGCAGCGTG AGCACAATAA TCCGCACTAC
GGGTTTGATG TGATGACCGG CAACTATGTC GATCTGATGC AGGCAGGCAT CATCGACCCG
GCGAAAGTTG TGCGCAGCGC GCTGGAGAAC GCCGTCAGCG TCGCAGGCAT CGTCCTGACA
ACCGATGCGC TGATTACCGA AGCGCCGGAG CCGAAGAAGA ACGGCGCACG CACGCCGTCG
ATGCCGGATG AGGAGTTCTA A
 
Protein sequence
MAKQIIFNEQ ARAALKHGVD TMALAVKTTL GPRGRNVAMG KKWGSPAVTH DGVTVAKEVE 
LKDPFENMGA QLLKEAASKT NDVAGDGTTT ATVLAQAMID EGLKLVAAGA NPMILKRGLD
KGREALVARI KEQAISLKSR DEIRQVATIS AQDPEIGELL ATIMDKIGHD GVVTIEEGKG
TTLEYELVEG MQFDRGYISP YFVTDSSRME AVIDEPYILI TDKKISAVND LLPVLEAVLA
TGKKDLVIIA EDVDGEALAT LVVNKMRGTL NPLAVKAPGF GDRRKAMLQD IAILTGGTLI
SEEVGRKLDS AKVQDLGRAR RVKSDKDNTV IVEGFGDKQA IQARIRQLKQ QIETTTSDYD
REKLQERVAK LSGGVAVIKV GAPTEPALKE RKARVEDALN ATRAAVEEGI VPGGGVALLN
AISALDNVQT QFEEERMALN VLRRALEEPL RQLAINAGED GSVVVNQVRT LQREHNNPHY
GFDVMTGNYV DLMQAGIIDP AKVVRSALEN AVSVAGIVLT TDALITEAPE PKKNGARTPS
MPDEEF