Gene Acid345_4131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4131 
Symbol 
ID4072322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4889602 
End bp4890711 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content58% 
IMG OID637986162 
Productdiguanylate cyclase 
Protein accessionYP_593205 
Protein GI94971157 
COG category[T] Signal transduction mechanisms 
COG ID[COG3706] Response regulator containing a CheY-like receiver domain and a GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCCTG ACGACGCTAC CGACTCCATT CCCCCAGAGC GGTTCGTGGT GGAAGTGTTC 
TCCGAACTCG ACCAGGCGAC GGTGCAGCGT CGCTGGCGAG AGGTGAATAC GATCCTGCGC
CTCGCCATGC TCGGCGGAAT GCAGATCCAA CTCGAAGCCA CACTCAACAT GCTGCTCGAC
TTCGCGGCCG AGATCTCATT CTTCGAAAAA TCTCTCGTCT ATTTCTGGGA AGAAGACAGC
GAACAAGTGA AGCTGCGCCT CGCAGGAGGT ATGGACCGCG AGACCGCCGA ACCCTTTGTG
CGCGGCAACA TCTTTAATTT TTGGGCGACA CGGTTTGGGC GTCCGCTGCT GGTGACGGCG
GGACATAACC TGCTGTCGGA TGCAGCACTG GAATCGCTGG GAGCGCGCTC AGCTTTGATC
GTGCCGCTGG TGGTCAGCAA TAAAGTCATC GGCTCCATGC AGTTGTTCTC CGCCGAGCAC
GAATCCTTCA CACGTGAAGA CGCGCAGCTA TTTTGGATGC TGACGCTCGT CGCCGAGAAC
CAACTCACCC GCGACTACGA AAACGAAGGC CTGATCAAGT TTGCATTCAC CGACTTCCTG
ACTGGGCTGA AGACCCGCGG ATACTTCGAG CAGCAGTTGG AACTCGAGAT CAAACGCGCG
GAACGAAAGC GCACGCCGAT CGCGATGCTC ATGCTCGATC TCGATCACTT CAAGACCTTG
AACGATACGT ATGGGCATCA CGTGGGCGAC CAGGTATTGC GCGATGTTTC CGCAGTGTTG
ATGAAAGACT TGCGCGAAGT GGACACCGCC GCGCGATACG GCGGTGAAGA GTTCGTCATT
GTGCTGCCGG AGACGAACAC CACGGGTGCG ATGCAGGTGG CGAACCGTAT TCGTCGCGGC
GTGGAGCAGG CAAAGTTCTT TGCAGGCTCG CCGCGGCAGG TCGAGCGACT GACCATCAGT
ATCGGGATTG CGATTTACGA CCAGGATGCG CAATCGAAGC GTGAACTGAT TGAAGCGGCG
GATGCGGCGC TGTACCAGGC AAAGGGCCAG GGTCGCAATC AGGTGGTGAC GCACGCGGAA
CTCGGTGTGA AGAAGAAGGA AGTGTCGTAG
 
Protein sequence
MLPDDATDSI PPERFVVEVF SELDQATVQR RWREVNTILR LAMLGGMQIQ LEATLNMLLD 
FAAEISFFEK SLVYFWEEDS EQVKLRLAGG MDRETAEPFV RGNIFNFWAT RFGRPLLVTA
GHNLLSDAAL ESLGARSALI VPLVVSNKVI GSMQLFSAEH ESFTREDAQL FWMLTLVAEN
QLTRDYENEG LIKFAFTDFL TGLKTRGYFE QQLELEIKRA ERKRTPIAML MLDLDHFKTL
NDTYGHHVGD QVLRDVSAVL MKDLREVDTA ARYGGEEFVI VLPETNTTGA MQVANRIRRG
VEQAKFFAGS PRQVERLTIS IGIAIYDQDA QSKRELIEAA DAALYQAKGQ GRNQVVTHAE
LGVKKKEVS