Gene Gmet_3534 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGmet_3534 
Symbol 
ID3739793 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter metallireducens GS-15 
KingdomBacteria 
Replicon accessionNC_007517 
Strand
Start bp3967975 
End bp3969006 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content62% 
IMG OID637780823 
Productheat-inducible transcription repressor HrcA 
Protein accessionYP_386464 
Protein GI78224717 
COG category[K] Transcription 
COG ID[COG1420] Transcriptional regulator of heat shock gene 
TIGRFAM ID[TIGR00331] heat shock gene repressor HrcA 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.000167088 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAGAAG GGCTTTCAGA GCGCAATAAA AAGATCCTTG AGGCGATCAT CGAGGACTAT 
ATTGTGACCG CCGAGCCGGT GGGAAGCCGG GCCGTCACGC GGCGGCACGG TCTCAACCTG
TCGCCGGCCA CGGTCCGCAA CGCCATGGCC GATCTGGAGG AGATGGGGCT CCTTACCTCG
CCCCACACCT CTGCCGGCCG TGTTCCCACT GACCGGGCGT ACCGTTTTTA CGTGGATTCC
CTTCTGGCGG TGGGGCGCAT CGACAAGGCC GAGCGGGAGC GGATACGGAA GCGCTACACC
GGCGCGGGGC GCGACATGGG TGCGGTCCTC CACGAGACGA CCCGGCTCCT GTCGTCGGTT
TCCCACTACA TGGGGATTGT CCTAGCTCCC CGCTTCTCCT CCACCACCCT CCGGCACATG
GAATTCGTGA AGCTGGGGGG GCGCCGGATC CTGGCCATCC TGGTGGCCGA TAACGGCGCG
GTCCAGAACC GGCTCATCGA GTCAGAGGAA GAGTTCGCCG AGGCCGACCT GGTCCGGATG
TCCAACTACC TGAACGAGCT TCTTCAGGGG CTTCCCGTCG CCCAGGTGCG GACCCGTATC
CTGGAGGAGA TGCGGAACGA AAAGGTCCTC TACGACACGC TCTTCTCACG GGCGCTCAAA
CTTTCGGAGC AGACCATCGT GGAGGGTAGC GCCCAGATCT TCATGGAAGG GCAGACCAAC
ATTCTGGATC AGCCTGAATT CGCCGATGTG GGGCGCATGA AGGATGTGTT CCGGGCTTTC
GAGGAGAAGA ACCAGCTGGT GAACCTCCTG GACCGCTGCA TCTCGGCCCA GGGAGTCCAG
ATTTTCATCG GTTCCGAGAC CCACCTGAAC CACATGGAGG GGTTGAGCGT CATCACCTCC
GCCTATGTGT CGGGGAAAAA CACCCTCGGC GTCCTCGGGG TGATCGGCCC GACCCGCATG
GGGTACGGCA AGGTGATCCC CATCGTCGAC TACACCGCGA AGCTTGTGAG CAAGCTGCTC
GAAGACGAGT AG
 
Protein sequence
MGEGLSERNK KILEAIIEDY IVTAEPVGSR AVTRRHGLNL SPATVRNAMA DLEEMGLLTS 
PHTSAGRVPT DRAYRFYVDS LLAVGRIDKA ERERIRKRYT GAGRDMGAVL HETTRLLSSV
SHYMGIVLAP RFSSTTLRHM EFVKLGGRRI LAILVADNGA VQNRLIESEE EFAEADLVRM
SNYLNELLQG LPVAQVRTRI LEEMRNEKVL YDTLFSRALK LSEQTIVEGS AQIFMEGQTN
ILDQPEFADV GRMKDVFRAF EEKNQLVNLL DRCISAQGVQ IFIGSETHLN HMEGLSVITS
AYVSGKNTLG VLGVIGPTRM GYGKVIPIVD YTAKLVSKLL EDE