Gene Haur_3679 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3679 
Symbol 
ID5735555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4621519 
End bp4623162 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content52% 
IMG OID641280828 
Productchaperonin GroEL 
Protein accessionYP_001546443 
Protein GI159900196 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.59981 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAGC AAGTTGCTTT TAATGAAGAA GCCCGCCGCG CACTCAAACG TGGCGTGGAC 
GTGGTTGCAG ATGCAGTTAA GACGACCTTG GGCCCACGCG GTCGCAACGT CGCTATCGAT
AAAAAATTCG GCTCACCAAC CGTAACCCAC GACGGCGTTA CCGTTGCTAA AGAAATCGAA
TTAAAAGACC CATTTGAAAA CATGGGCGCT CGCTTGTTGG TTGAAGCCGC AACCAAAACC
AACGATGTTG CTGGTGACGG TACCACCACC GCCACCGTTT TGGCTCAAGC AATTGTCCAC
GAAGGCTTGC GCCAAGTGGC TGCTGGCGCT AACTCGATGA TGATCAAGCG TGGCTTGGAC
AAAGGCACTG CCGTTTTGTT GCAAGCAATC CGCGATTTGG CCAAGCCAGT CAACGATCGC
ACCGACATCT CAAGCGTTGC CACCATCTCA GCTGCCGATT CGTCAATTGG CGATTTGATT
GCTGAAGTGA TGGACAAAGT TGGCAAAGAC GGCGTTATCA CCGTTGAAGA AGGCAAAGGC
TTGGGCTACG AAACCGAGTA TACCGAAGGT ATGCAATTCG ATCGTGGCTA CATCTCAGCC
TACTTCGTCA CCAACAGCGA TCGCATGGAA TCAGATTTGG AAGACCCCTA CATTTTGATC
ACCGACAAGA AGATCAGCTC GATCCAAGAA ATCTTGCCAG TGCTCGAAAA AGTCTTGCAA
TTCACCAAGA ACTTCGTCAT TATCGCTGAA GATATCGACG GCGAAGCCTT GCCAACCCTC
GTGTTGAACA AATTGCGCGG CACGATCAAC GTGTTGGCAA TCAAAGCTCC TGGCTTCGGC
GATCGCCGCA AAGCCATGTT GCAAGATATC GCCATCCTCA CCGGTGGTAC GGTTATCAGC
GAAGAAATTG GCCGCAAGCT TGATAGCGCC ACGGTCGAAG ATTTGGGCCG CGCTCGCCGC
GTGATTGCCA ACAAAGACGA AACCACGGTT ATCGAAGGCC GCGGCGACGA AGATGCAATC
AAAGCTCGGA TCGAACAAAT TCGTGCTCAA ATTGAAACCA CCACCAGCGA TTTCGATCGC
GAGAAACTGC AAGAACGCTT GGCCAAATTG GCTGGTGGCG TAGCAGTGCT CAAAGTTGGT
GCTGCAACCG AGCCAGAATT GAAAGAACGC AAGCACCGCG TCGAAGATGC CCTCTCAACC
GCTCGTGCAG CTGTTGAAGA AGGTATCGTG CCTGGTGGCG GGATTGCCTT GTTGAGCGTA
TTGCCAGCCT TGGATAGCGT TGTGCCAGCC AACCAAGACG AAAAAGCTGC TGTCTTGATT
CTGCGCCGCG CCTTGGAAGA ACCAATTCGC CAATTGGCCC GCAACGCTGG TGAAGATGGT
GCTGTGATTA TCGACACCGT GCGCCGCTTG CAAAAAGAAA AAGGCGATTC AACCCTTGGC
TACAACGTCA TCACTGGCGA ATATGGCTCA ATGGTTGAAA TGGGCATCAT CGACCCAGCC
AAGGTAACTC GCTCGGCCTT GCAAAACGCC GTTTCGATTG CCTCGATGAT CTTGACCACC
GATGCTTTGG TCGCCGATAT CCCAGAAAAA GAAGCTGCTC CAGCTCCTGG TGGTATGGGT
GGCATGGGCG GCATGGATTT CTAA
 
Protein sequence
MAKQVAFNEE ARRALKRGVD VVADAVKTTL GPRGRNVAID KKFGSPTVTH DGVTVAKEIE 
LKDPFENMGA RLLVEAATKT NDVAGDGTTT ATVLAQAIVH EGLRQVAAGA NSMMIKRGLD
KGTAVLLQAI RDLAKPVNDR TDISSVATIS AADSSIGDLI AEVMDKVGKD GVITVEEGKG
LGYETEYTEG MQFDRGYISA YFVTNSDRME SDLEDPYILI TDKKISSIQE ILPVLEKVLQ
FTKNFVIIAE DIDGEALPTL VLNKLRGTIN VLAIKAPGFG DRRKAMLQDI AILTGGTVIS
EEIGRKLDSA TVEDLGRARR VIANKDETTV IEGRGDEDAI KARIEQIRAQ IETTTSDFDR
EKLQERLAKL AGGVAVLKVG AATEPELKER KHRVEDALST ARAAVEEGIV PGGGIALLSV
LPALDSVVPA NQDEKAAVLI LRRALEEPIR QLARNAGEDG AVIIDTVRRL QKEKGDSTLG
YNVITGEYGS MVEMGIIDPA KVTRSALQNA VSIASMILTT DALVADIPEK EAAPAPGGMG
GMGGMDF