Gene Acid345_2035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2035 
Symbol 
ID4073204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2437727 
End bp2438752 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content61% 
IMG OID637984049 
Productserine phosphatase 
Protein accessionYP_591110 
Protein GI94969062 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG1734] DnaK suppressor protein
[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACCG CAGAAACCGC CTACATCAAC CAGATCCGCG AACAACTGGT AACGCGTCGC 
CACAATCTCG AGACCGCGAT CACCAAGCAT GAATCGGCGC AGATTACGCA CCTCTTGCAC
GATGTGGATC AGGCGTTGGC TAAGCTCGAG ACCGGGAACT TTGGGGTCTG CCAAAACTGT
CATGAGTCCA TCGAAGTGGA CCGCGTGATG GCTGACCCGT TGGTGACGTT CTGCCTCGGC
TGCCTCACAC CCGCGCAGCA GCGCAGCCTC GAACAGGACC TCGAACTCGC AGCCCGCATG
CAGATCGGCC TGCTGCCGCC CGACGATTCC GCGGTAGCTG GTTGGGAGAC AGCCTTCCAT
TTCCGTCCCG CGCGTGTGGT CAGCGGCGAC TACTTCGACA TCATCGGCGA CGATCACGGC
GGAATGTACT TCATCATGGC CGACGTCGCC GGCAAAGGCG TTGGTGCCGC GATGCTTACC
GCCAGCCTCC GCTCGGTATT CCGCGCGCTC ATCCCAACCG CCGATTGCGT GGGCGAACTC
CTCACCCGCG CCAACCGCCT CTTCTGCGAG AGCGCCATGT CCGGCCAGTA CGCCACCCTG
GTGTTCGGCC ATGTGAATTG TGACGGCGCA CTCGACGTCG CCAACGCCGG TCATTTGCCA
TTACTACTAG CGAAGGGAGC GGATTTGGAG GTTATCGAGA GCACCGACTT GCCCTTTGGC
ATGTTCTGCT CTCAGCAGTT CACCGTGCAA CGGACTTCCC TGCAACCAGG CGATACGCTG
GTGCTCTACA CCGACGGAAT TTCCGAGGCG CTGAACGAAG CGGGCGAAGA ATTTGGAGTC
GAACAGATGC GCGAGTTCGT CCAGTCGCAC GGAACGAAGT TGCCCTGCGA GATGGTGAAG
AACTGCCGCG AGCGCCTCGA TGGCTTCCGC GGAAACGTCG AGCGCTTCGA CGACGAGACG
ATGCTGGCGA TCCAATTCGC TCCCGCCAGC AAGCTGAGCG AACCGCGGCA TCACGCCGTG
ATGTAA
 
Protein sequence
MTTAETAYIN QIREQLVTRR HNLETAITKH ESAQITHLLH DVDQALAKLE TGNFGVCQNC 
HESIEVDRVM ADPLVTFCLG CLTPAQQRSL EQDLELAARM QIGLLPPDDS AVAGWETAFH
FRPARVVSGD YFDIIGDDHG GMYFIMADVA GKGVGAAMLT ASLRSVFRAL IPTADCVGEL
LTRANRLFCE SAMSGQYATL VFGHVNCDGA LDVANAGHLP LLLAKGADLE VIESTDLPFG
MFCSQQFTVQ RTSLQPGDTL VLYTDGISEA LNEAGEEFGV EQMREFVQSH GTKLPCEMVK
NCRERLDGFR GNVERFDDET MLAIQFAPAS KLSEPRHHAV M