Gene Acid345_4523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4523 
Symbol 
ID4070201 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5365752 
End bp5367251 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content57% 
IMG OID637986562 
Productdiguanylate cyclase with GAF sensor 
Protein accessionYP_593597 
Protein GI94971549 
COG category[T] Signal transduction mechanisms 
COG ID[COG1956] GAF domain-containing protein
[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.967133 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.833222 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACTCCGC AGCAGGGCAT GCAAAAAATC GCGATTCTCT ACGACGCGAG CCAGGCGGTT 
CTCTCGACCT TCCAACTGGA TGAGGTCCTG AATCAGATCC TGGCCATCGT GCGCGATTAC
TTCCATTTGG AAGGCGCGTC GGTCCTGCTA TTTGACAAAG ACCGAACCGT TCTACGCGTG
CGACTGTCGT TCGGGAAACA ACCTAGCAGT ATGGAAGTTC CTCTTGGTAA AGGGCTTACA
GGCGCGGCCG TGCATCAGAA GAGGCCCATC TACTCTCCCG ATGTCACTAA GGACAAGCGC
TATATCTGCG CCGTGGAGAG CACCCGGTCG GAACTTGCCA TTCCCCTGAT GGTCCGCGAC
GAAGTGGTGG GGGTGCTCGA TTGCCAGAGC GATCAGGTGG ACTTTTTCGA TACGGAAACC
GAGGACTTAC TGACTCTCTT CGCGACTCAG GCGTCCATCG GCATCAGTAA CGCCAAGATT
TATTCACTGG AACAGAAGCG CGCCCTCCAA ATGAGCGCGA TCGGCGCCAT CGCGAGCAAG
GCCACGGCAT CGTTAAAGTT GGAAGAGTTG CTGGTGCAGC TCTGTATGGC GATCGCGACG
AACCTGGCGT TGGATGGGAT CTCTGTTCTC ACGTGTGAGC CCGATGGAAC GTTAATTCTC
CGCGCCCAGG AAGGGTCGTT GAAGCCGGTG ATCGAAACCA GTCAGGCTTT GTTGCCGGAG
GGGAATCCCG CGGAGCGCGC GCTGTCGCGG AAAAAGCCTG TCGTGCTTCA CCCCGATCTC
GACTCTCGAC CAGTGCTCTA CGAAGGCGCC GGTTCGGAGC TGATGCTGCC CCTGGTCTCC
GCTGGGAAAC CGCTTGGCAT GATCGTAATG GGGGCAAGGA GCGATACCGG CTTTCCGGAA
GAAGATATGC AGACGCTGGT GTCGGTGGCT GACATTTGTG CAACCGCCAT CCAAAACGCC
TTTTACTTCC AGCAAGTAGA AGCACTCGCT TACGTGGACG GCCTCACCGG CGTGTACAAC
CGCCGTTTTT TCGAGCGCAA GATCACCGAA GAACTCGAGC GGGCGGCCCG CTATGAAGGC
ACGCTTTCGG TGATCATGGT GGACATTGAT AACTTCAAGA AGGTGAACGA CGAGTTCGGT
CACCTGCTCG GCGACGAGGT GCTCCGCACC GTTGCACAGA TCTTCGCGGG TGCACTTCGC
AAGCCGGACC ACTGTTGTCG TTATGGTGGT GAAGAGTTCT CAATCATCCT TCCTGAAACA
TCGGGGCCAA AGGCTCTAAA AGTCGCAGAG AAGTTGAGAG GCTTGATAGC GGATTACGAT
TTCCCTGGCA TTCCTCGCCG GATAACGATC AGCGCCGGTG TAGCGGACTT CCCGACCTGC
GGTACAACCC GCGATGACAT CGTCGGCGCC GCTGACAACT GCTTGTACCT GGCGAAGCAA
TCGGGACGCA ATTGCGTGAT GTCGCCCTAC AACATGAATA CGCCGAAGTT GTTCACTTAA
 
Protein sequence
MTPQQGMQKI AILYDASQAV LSTFQLDEVL NQILAIVRDY FHLEGASVLL FDKDRTVLRV 
RLSFGKQPSS MEVPLGKGLT GAAVHQKRPI YSPDVTKDKR YICAVESTRS ELAIPLMVRD
EVVGVLDCQS DQVDFFDTET EDLLTLFATQ ASIGISNAKI YSLEQKRALQ MSAIGAIASK
ATASLKLEEL LVQLCMAIAT NLALDGISVL TCEPDGTLIL RAQEGSLKPV IETSQALLPE
GNPAERALSR KKPVVLHPDL DSRPVLYEGA GSELMLPLVS AGKPLGMIVM GARSDTGFPE
EDMQTLVSVA DICATAIQNA FYFQQVEALA YVDGLTGVYN RRFFERKITE ELERAARYEG
TLSVIMVDID NFKKVNDEFG HLLGDEVLRT VAQIFAGALR KPDHCCRYGG EEFSIILPET
SGPKALKVAE KLRGLIADYD FPGIPRRITI SAGVADFPTC GTTRDDIVGA ADNCLYLAKQ
SGRNCVMSPY NMNTPKLFT