Gene Acid345_0204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0204 
Symbol 
ID4069673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp216440 
End bp217801 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content57% 
IMG OID637982204 
Productdiguanylate cyclase 
Protein accessionYP_589283 
Protein GI94967235 
COG category[T] Signal transduction mechanisms 
COG ID[COG3300] MHYT domain (predicted integral membrane sensor domain)
[COG3706] Response regulator containing a CheY-like receiver domain and a GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.209858 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.343798 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACACAA TTGCTGTGAC GTATGTCCAT TGGCTGGTTG CTGCTTCGAT CGCGACTTCC 
GTTGTTGCCT CCTACGCAGC GTTCAGTTTT GCTGAGCGTG TCGCGGCGTC AGACCGTCAA
CGCTCCCTTG GATGGCTGAT TGCCGGCGCC TTCGCCATGG GACTCGGCAT CTGGTCGATG
CACTATCTCG GCATGCTCGC GGTACAACTG CCGGTGCCAG TCGTTTATCA CGTTCCTACG
GTAATCATTT CCTTGTTGCT CGCGGTTGCG GCCTCCGCAG CGGTGTTGTG GGTGGTGAGC
CGTGAGAGTC TTTCGGGGCG GGCGATCGTC TTGGGCAGCG TTGCCATGGG CGGCGGCATC
GGCGCGATGC ACTATACGGG TATGGCAGCA ATGCGTTCTT ATGCCATGCA TCGCTATAAC
CCGTCACTGG TTTTGCTCTC GCTTGTAATC GCAGTGGCTT TTTCCTGGAT GGCGTTGCGC
ATCACCTTCC TGCTTCGAAG GGAGCCGGGC GCCCATGAAC TGCGCCGCAT GGGCGGAGCG
GTCCTGATGG GCATTGGCAT CGCCTCGATG CACTACACCG CTATGTTTGC CGTGACATTC
GAACGGGGCA ACACAGAATT CTCCACCGTC CATACGGTTC CGGTGACCCA TATCGACCAG
TTGGGCATCG TCGTAATGGC TGGCATGGTT TTATTTGGCG CGCTCATTTC CGCATACTTC
GATCGGCAGA TGAGCCGTGA CCTGCGCGTC TCGAATGAGC GACTGGCTGA GATGCAGGTG
GCGCTTCTTC AGAGAGAGAA AGAACTGAAG GAGGCGGTGG CCAAGCTGGA AGAGTTGTCG
ACCCGCGACG GATTAACGGG TTTGTACAAC CGCAGATTTT TCGATACGAC CCTGACAGCC
GAGTGCAAAC GCGCCGCACG TGCCAACTAT CCAATTAGCC TTTTGATTAT TGATGTTGAC
TGCTTCAAAG CACTCAACGA CCACTACGGC CATCTCGTGG GAGACGACTG CCTTCAGAAG
GTGGGAAAGA GCTTGGCGAG TGCGGTTCGG CGCACCTCGG ATGTAGTGGC GCGATACGGA
GGAGAAGAGT TTGCCCTCAT CCTCCCCAAT ACCAGCGAGG AGAGCGCCAC AACCATCGGA
GAAAATATTC GCCGTGCAGT TCTCGGCCTG GAGATCGCAA ACGCCAACTC CACTGCGGGC
CCCTTCGTTA CTCTGAGCGT AGGTGTATGC ACGCGTCGCC CTAGTCATCC TCGATCAAGC
GAGGATATTC TCGTGACGAC CAATGAGATT ATCCGCGCGG CGGATGAGGC GCTGTATCGA
GCGAAACGCG AAGGACGGAA TCGAGTGCTG CTGGGGGCTT GA
 
Protein sequence
MDTIAVTYVH WLVAASIATS VVASYAAFSF AERVAASDRQ RSLGWLIAGA FAMGLGIWSM 
HYLGMLAVQL PVPVVYHVPT VIISLLLAVA ASAAVLWVVS RESLSGRAIV LGSVAMGGGI
GAMHYTGMAA MRSYAMHRYN PSLVLLSLVI AVAFSWMALR ITFLLRREPG AHELRRMGGA
VLMGIGIASM HYTAMFAVTF ERGNTEFSTV HTVPVTHIDQ LGIVVMAGMV LFGALISAYF
DRQMSRDLRV SNERLAEMQV ALLQREKELK EAVAKLEELS TRDGLTGLYN RRFFDTTLTA
ECKRAARANY PISLLIIDVD CFKALNDHYG HLVGDDCLQK VGKSLASAVR RTSDVVARYG
GEEFALILPN TSEESATTIG ENIRRAVLGL EIANANSTAG PFVTLSVGVC TRRPSHPRSS
EDILVTTNEI IRAADEALYR AKREGRNRVL LGA