Gene Acid345_3244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3244 
Symbol 
ID4072579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3841095 
End bp3842138 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content60% 
IMG OID637985265 
Productheat-inducible transcription repressor HrcA 
Protein accessionYP_592319 
Protein GI94970271 
COG category[K] Transcription 
COG ID[COG1420] Transcriptional regulator of heat shock gene 
TIGRFAM ID[TIGR00331] heat shock gene repressor HrcA 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.488784 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0534413 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGAAC CTGTCCAAAT CGGTAGACGC GAGCGGGAGA TCCTTACCGC CATTGTCGAG 
ACCTATATCT CGACCGGTGA ACCCGTTGGC TCGCGCACGC TTTCTCGCGG TAGTCGCGAG
GGCCTGAGCG CCGCCACCAT TCGCAACGTG ATGGCCGACC TCTCCGATGC CGGGCTCCTC
GATCAGCCAC ATACTTCTGC CGGACGCGTT CCCACCGCCG CCGCCTATCG CTATTACGTG
AAGGGACTCA CCGGCGAAGC CAGCCTCGCA CCCGCCGAAG AAGGCCTGAT CAAAGACACT
TTCCACGGCG TCAACGATAT GCAGGAGTTC ATGGAGCGCA CCTCGCACGT GCTCTCGCTC
CTTTCGCAAA ACGTTGGAGT CGCCGTCGCG GGCATCGGCC CGAAGAATGC CCTCGAACAC
GTGCACTTCC AGCGTCTCGC GGAATCCAAG GTCCTCTGTG TCGTCGTCAA TAAGAACGGC
ATTGTGCGCG ATCGCATCAT GCGGCTCGGC AAAGACATTC CGCAGCTCGA ACTCGACGCG
GCCGCGCGTT TCCTCAACGA GAACTACCGC GGCTGGATCA TGGAAGAAAT CCGCGTCGAT
CTTGCGCGCC GCCTCGACCA GGAGCGCAGC GAATACGACC GCCTGATGCA CTCGGTCGAA
GAGCTCTATA AAAAAGGTGC TCTCGAATCC GAGACTACGC AGGATGTCTA CATCGAGGGC
ACTTCGAACC TGGTGGTTGA CGACCACGAC CGCGACCGCC TGCGCGAGCT GCTCAAGACG
CTTGAAGAAA AGCAGCGTCT CGTAAACCTG CTCTCCGCCT ATGTGGACGT GCGCCAGGAG
GCCGTCCGCG TGGTTGTGGG TCTTGAAGAC ACCTTGCCGC ATCTAAGTAA TTTCGTACTG
ATCGGAGCGC CCGCTCGCGT CGGCAACGAG GTCATGGGTT CTCTCGCAGT CATCGGGCCT
ACCCGCATTG ATTACGAGCA CACCATCTCG GCCGTTTCGT ATATCGCGCG CCTCTTCGAT
CACATCTGGA ATGATTCGGA ATAA
 
Protein sequence
MSEPVQIGRR EREILTAIVE TYISTGEPVG SRTLSRGSRE GLSAATIRNV MADLSDAGLL 
DQPHTSAGRV PTAAAYRYYV KGLTGEASLA PAEEGLIKDT FHGVNDMQEF MERTSHVLSL
LSQNVGVAVA GIGPKNALEH VHFQRLAESK VLCVVVNKNG IVRDRIMRLG KDIPQLELDA
AARFLNENYR GWIMEEIRVD LARRLDQERS EYDRLMHSVE ELYKKGALES ETTQDVYIEG
TSNLVVDDHD RDRLRELLKT LEEKQRLVNL LSAYVDVRQE AVRVVVGLED TLPHLSNFVL
IGAPARVGNE VMGSLAVIGP TRIDYEHTIS AVSYIARLFD HIWNDSE