Gene Mpe_A3333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3333 
SymbolhrcA 
ID4786432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3539374 
End bp3540402 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content69% 
IMG OID640091906 
Productheat-inducible transcription repressor 
Protein accessionYP_001022521 
Protein GI124268517 
COG category[K] Transcription 
COG ID[COG1420] Transcriptional regulator of heat shock gene 
TIGRFAM ID[TIGR00331] heat shock gene repressor HrcA 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGACG AACGTGCCAA GACCCTGTTG AAGACGCTGG TCGAGCGCTA TATCGCCGAT 
GGTCAGCCGG TCGGGTCGCG CACGCTGTCG CGGGCGTCCG GGCTGGAGCT GAGCCCGGCC
ACCATCCGCA ACGTGATGGC CGACCTGGAG GAGCTGGGCC TGATCGCCAG CCCGCACACC
AGCGCCGGGC GCATCCCCAC GGCGCGCGGC TACCGGCTGT TCGTCGACAC CATGCTCACC
GCGCGGCCGC TCGACATGGC GCGCAGCGAG CCGGGCCTGG CCGCTGCGCA AGAACAGCTG
CAGCCTGACC AGCCGCAGCG CGTGATCACC CATGCGGCCC AGGTGCTGTC CAACCTGTCG
CAGTTCGTCG GCGTGGTCAC CGCGCCGCGC AAGGCCGGCG TGTTCCACCA CATCGAGTTC
ATGCGCCTGG GCGAGCGGCG CGTGCTGGTG ATCCTGGTCT CGCCCGACGG CGACGTGCAG
AACCGCGTGA TCTTCACCGC GCGCGACCAC AGCCCGGCCG AGCTGGTGGA GGCCAGTAAC
TTCATCAACG CCCACTACAG CGGCCTCAGC ATCGAGGCGG TGCGCGAGCG GCTCAAGACC
GAGATCGACG CGCTGCGCGG CGAGATCGCC CTGCTGATGC AGGCCGCGGT GCAGTTCGGC
AGCGAAGCCG CCGACGGCGA GAACGAGCAG GTGGTGGTCT CGGGCGAGCG CAACCTGCTG
ACGATGCAGG ACTTCTCCAG CGACATGGGC TCGCTGCGCA AGCTGTTCGA CCTGTTCGAG
CAGAAGACCC AGCTGATGCG CCTGCTCGAC GTGAGCAGCC GCGCCGAGGG CGTGCGCATC
TACATCGGCG GCGAGAGCCA GATCGTGCCG TTCGAGGAGC TGTCGGTGGT GTCGGCGCCG
TACGAGGTGG ACGGCAAGAT CGTCGGCACG CTGGGCGTCA TCGGCCCCAC GCGCATGGCG
TACGACCGCA TGATCCAGAT CGTCGACATC ACCTCGCGGC TGGTGACGCA GGCGCTGAGC
CAGAAGTAG
 
Protein sequence
MLDERAKTLL KTLVERYIAD GQPVGSRTLS RASGLELSPA TIRNVMADLE ELGLIASPHT 
SAGRIPTARG YRLFVDTMLT ARPLDMARSE PGLAAAQEQL QPDQPQRVIT HAAQVLSNLS
QFVGVVTAPR KAGVFHHIEF MRLGERRVLV ILVSPDGDVQ NRVIFTARDH SPAELVEASN
FINAHYSGLS IEAVRERLKT EIDALRGEIA LLMQAAVQFG SEAADGENEQ VVVSGERNLL
TMQDFSSDMG SLRKLFDLFE QKTQLMRLLD VSSRAEGVRI YIGGESQIVP FEELSVVSAP
YEVDGKIVGT LGVIGPTRMA YDRMIQIVDI TSRLVTQALS QK