Gene GM21_3576 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3576 
Symbol 
ID8138949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4157884 
End bp4158915 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content61% 
IMG OID644871196 
Productheat-inducible transcription repressor HrcA 
Protein accessionYP_003023355 
Protein GI253702166 
COG category[K] Transcription 
COG ID[COG1420] Transcriptional regulator of heat shock gene 
TIGRFAM ID[TIGR00331] heat shock gene repressor HrcA 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones137 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGAAC AGCTTTCCGA GCGCGGCAAA CGGATCCTCG AGGCGGTGAT CGAGGATTAC 
ATAGCAACAG CCGAACCGGT GGGAAGCCGG ACCATAACGC GCAGCCATGC GCTCGCGCTT
TCCCCGGCCA CGGTGAGAAA CGTCATGTCG GATCTGGAGG AGATGGGGCT TTTGACCTCC
CCGCACACCT CCGCCGGGCG CATCCCGACC GATAAGGCCT ACCGCCTGTA CGTAAACTCC
ATTCTCGAGG TGAAGAACCT CGCCCGGGAC AACCGGGAGG AGATCAGAAG GCGCTGCAGG
ATGGCTGGAA GGGATATCGC AGAGGTGCTC AAGGAAACGA GCCGGCTCTT GTCCTCGACC
TCTAGCTACA TGGGCGTGGT AATGGCGCCG CACCTGGCCG CGAACGTCTT TCACCAGATG
GAATTCGTCA AGCTGTCGAG CCGCAGGGTG CTCGCCATCC TGGTGTCGCA AAACGGCACG
GTGCAGAACC GGCTCCTGGA GACCGGCGAG GAGATCGCGC AGGAAGACCT GGTCCGGATG
GCCAACTACC TGAACGGGAT GCTGCAGGGG CTCACCATCG CCCAGGTGCG CGATAGGCTT
TTAAGCGAGA TGCAAAGCGA GAAGGTGCGT TACGACACCA TGATGGCGCG CGCTTTGACC
CTCTCGCAGC AGACCATACA GGCCGACGGG GCCGAGATCT TCCTGGAAGG GCAGGCGAAT
ATCCTGGAGC AGCCCGAGTT CGCCGACACG GCCAAGATGC GGGAGATCTT CCGGACCTTC
GAGAAAAAGA GCCTGCTCTT GGACCTTTTG GACCGTTCGC TGTCGGCCGA GGGGGTGCAG
ATCTTCATAG GCTCGGAGTC GAACCTCCTC AAGATGGAGG GGATGAGTCT CGTCACCTCC
ACCTACATGA CCGGAAAGGA CACGGTAGGC GTCCTCGGGG TGATAGGTCC TACCCGCATG
GGGTACGGCA GGGTGATCCC CATCGTTGAT TACACAGCCA AGCTGATCAG CCGTCTGCTC
GAAGCGGAGT AG
 
Protein sequence
MEEQLSERGK RILEAVIEDY IATAEPVGSR TITRSHALAL SPATVRNVMS DLEEMGLLTS 
PHTSAGRIPT DKAYRLYVNS ILEVKNLARD NREEIRRRCR MAGRDIAEVL KETSRLLSST
SSYMGVVMAP HLAANVFHQM EFVKLSSRRV LAILVSQNGT VQNRLLETGE EIAQEDLVRM
ANYLNGMLQG LTIAQVRDRL LSEMQSEKVR YDTMMARALT LSQQTIQADG AEIFLEGQAN
ILEQPEFADT AKMREIFRTF EKKSLLLDLL DRSLSAEGVQ IFIGSESNLL KMEGMSLVTS
TYMTGKDTVG VLGVIGPTRM GYGRVIPIVD YTAKLISRLL EAE