Gene Acid345_0504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0504 
Symbol 
ID4069403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp620494 
End bp621777 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content60% 
IMG OID637982508 
Productresponse regulator receiver modulated metal dependent phosphohydrolase 
Protein accessionYP_589583 
Protein GI94967535 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3437] Response regulator containing a CheY-like receiver domain and an HD-GYP domain 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTTTCG CCGGTGGTGA ACTCCTTGAG GGCGCCGTCC ACGTAGAGAC CGCGGACTTT 
CATGTCCATG GTTTTGTCTT CGGTGGGGCC GAGGAAGAGC TTGACCGTCA TGGGTTTGCC
GAAGGTGATG ACCTTCGGCT TGCCCGCCGC CGATGCGAGT CCGGTGAACA TAACCGCGAA
CTGGAGGAGT GCGATACATC TGCGCATACT ACGAGGCTAA GCCCGCAAGC GCCCGCACTT
CAACCGAAAC CCGCTCCCGG CGCGATACAA CCAATCCAGA GCAGAACCCT TCGTAAAGAC
GTGACCCACG AATTTCCCGC GAAACGTTCC CAGCACATCC TGGTGGTTGA CGACTCGCCT
GAGCTTGCCA TGCTCATGCA GGAGCTGCTG GCTGCGCATG GCTACCTGGT GCAGCTTGCC
TATAGCGCAG CCGAGGCGGA GCGGGAAATT GCGGGGCGTC CGCCGGACCT GATCTTGCTG
GATGTGCAGA TGCCGGGCAA GAACGGCTAC GACCTTTGCC GTGAGCTGAA GGACAATCCG
GAGACGCGCC TGATCCCTAT AGTGATGATT ACCGGGCTTA CCGATCGCGA GAACCGGATC
AGGGGCATAG AATCCGGCGC CGATGAATTT CTGAACAAGC CGATTTTTCC GGAAGAACTG
TTTGCGCGGG TGAAGTCGCT ATTGCGGCTG AAGGAGTTTA CCGACGAACT CGACAACGCG
GAGGCAGTGT TGTGCACCCT CGGGTTGAGC GTGGAGGCGC GTGATCCTTA TACGGAAGGG
CACTGTGAGC GGCTTTCACA ATACGCGAGC GAACTCGGTG CGTTCCTGCG GATGGGCGCC
GATGCAATCC TTGCGCTGAA GCGCGGCGGA TACCTGCACG ATCTCGGCAA GATCGCGATT
CCGGACGAGA TCCTGAAGAA GGGATCCGAT CTGACTGCAG ATGAGTGGTC GATCATGCGG
CAGCACCCGA TCATCGGGGA GCGCATTTGT CAGCCGCTGC GTTCGCTGCG AAAAGTACTG
CCGATCATTC GCCACCATCA TGAACATTGG GATGGGAGCG GGTATCCAGA CCAGTTGGTG
GGGCTTGAGA TTCCGCTGCT GGCGCGCACG CTGCAGGTTG TGGATGTTTA CGATGCATTG
CGAACGGCGC GTCCTTACAA AGCGGCCCAG TCGCATGAAC AGGCTCGCGA GACGATGTTG
CGCGAGGCGG AGCGTGGCAG GTGGGACCGC GAACTTGTGC GCGCCATGTT CGCCATGCTG
ACTGAGCGGC GAAAGGCCGC TTGA
 
Protein sequence
MGFAGGELLE GAVHVETADF HVHGFVFGGA EEELDRHGFA EGDDLRLARR RCESGEHNRE 
LEECDTSAHT TRLSPQAPAL QPKPAPGAIQ PIQSRTLRKD VTHEFPAKRS QHILVVDDSP
ELAMLMQELL AAHGYLVQLA YSAAEAEREI AGRPPDLILL DVQMPGKNGY DLCRELKDNP
ETRLIPIVMI TGLTDRENRI RGIESGADEF LNKPIFPEEL FARVKSLLRL KEFTDELDNA
EAVLCTLGLS VEARDPYTEG HCERLSQYAS ELGAFLRMGA DAILALKRGG YLHDLGKIAI
PDEILKKGSD LTADEWSIMR QHPIIGERIC QPLRSLRKVL PIIRHHHEHW DGSGYPDQLV
GLEIPLLART LQVVDVYDAL RTARPYKAAQ SHEQARETML REAERGRWDR ELVRAMFAML
TERRKAA