Gene GSU0031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0031 
SymbolhrcA 
ID2685607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp40721 
End bp41749 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content61% 
IMG OID637124693 
Productheat-inducible transcription repressor HrcA 
Protein accessionNP_951093 
Protein GI39995142 
COG category[K] Transcription 
COG ID[COG1420] Transcriptional regulator of heat shock gene 
TIGRFAM ID[TIGR00331] heat shock gene repressor HrcA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.820963 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGATA TATCCGAGCG CAACAAGCTT ATCCTCGAAG CCATCATCGA GGATTACATC 
ACGACGGCGG AGCCGGTCGG CAGCCGCGCC GTGACCAAGC GGCACGGCCT GACCCTCTCG
CCGGCAACGG TGCGCAACGT CATGGCCGAC CTGGAGGAGA TGGGGTATCT CGTCTCTCCC
CATACTTCGG CGGGGCGGGT TCCCACCGAC AAGGCCTATC GGCTCTACGT GGACTCGCTT
CTGGCGGTTC GGCGCATCGA CAAGGTGGAG CGGGAGCGGA TCCGCAAGCG CTACGCGGAG
GCCGGCCGGG ATATTGGCGC AGTCCTTCAT GAAACGAGCC GATTGCTCTC CTCGGTCTCC
CACTACACGG GGATCGTCCT CGCCCCCCGC TTCTCCGCCA CCATTTTCCG GCACATTGAG
TTCGTAAAGC TGGGAGGGCG CCGTATCCTC GTCATCCTGG TGGCCGACAA CGGCACCGTC
CAGAATCGGC TCATCGAATC CGACGAGGAG TTTTCCTCCG AAGAACTGAT CAGGATGTCC
AACTACCTGA ATGAGCTTCT GGTGGGGGTG CCGGTGGGCC AGGTCCGCAC GCGGATACTG
GAGGAGATGA GAAACGAGAA GGTCCTTTAC GACAAGCTCC TCGCCCGTGC GCTCCAGCTT
TCCGAGCAGA GCCTGAATGA CGACGGCGCC CAGATCTTCA TCGAGGGACA GACCAATATC
CTTGAACAGC CCGAGTTCGC CGACTCGCGG CGGATGAGGG ATCTCTTCCG GGCGTTCGAG
GAAAAGAACC AACTGGTGGG GCTCCTGGAC CGGTGCCTCA ATGCCCAGGG CGTGCAGATA
TTTATCGGCG CCGAAACCCA TCTCAGCCAG ATGGAGGGTT TGAGCATCAT CACCTCCACG
TACCTCACGG GAAAGAACAC TCTCGGCGTT CTGGGGGTCA TCGGACCGAC CCGGATGGGC
TATGCCAAGG TGATCCCCAT CGTCGACTAT ACCGCGAAGC TGGTCAGCAG GCTCCTGGAA
GGGGAATAG
 
Protein sequence
MTDISERNKL ILEAIIEDYI TTAEPVGSRA VTKRHGLTLS PATVRNVMAD LEEMGYLVSP 
HTSAGRVPTD KAYRLYVDSL LAVRRIDKVE RERIRKRYAE AGRDIGAVLH ETSRLLSSVS
HYTGIVLAPR FSATIFRHIE FVKLGGRRIL VILVADNGTV QNRLIESDEE FSSEELIRMS
NYLNELLVGV PVGQVRTRIL EEMRNEKVLY DKLLARALQL SEQSLNDDGA QIFIEGQTNI
LEQPEFADSR RMRDLFRAFE EKNQLVGLLD RCLNAQGVQI FIGAETHLSQ MEGLSIITST
YLTGKNTLGV LGVIGPTRMG YAKVIPIVDY TAKLVSRLLE GE