Gene Acid345_3659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3659 
Symbol 
ID4072262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4329780 
End bp4330868 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content58% 
IMG OID637985682 
Productdiguanylate cyclase with GAF sensor 
Protein accessionYP_592734 
Protein GI94970686 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain
[COG2203] FOG: GAF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCAAC TCGACCCTCG AGGCCATCAT TCCACCATGT TTGACTCCGC AGGCGCCGAC 
CACAAGCGGC AGTTACAGGA GTTGGGTATC TTTCACGACG TCGCCAAAGC GCTGACGTCG
TCGCTGAACC TCGACTCCAT CCTGCAGACC ATTATGGAAA AGATGGCGGA ATATTTCCGC
CCCGATACCT GGTCACTCCT CATGGTGGAT GAGGAGAAAT CTGAGCTTTA TTTCGCCATC
GCTGTCGGTG ACGCCGCGGA GGCCCTCAAA ACCGTTCGCC TGAAACTCGG CGAAGGCATA
GCGGGATGGG TCGCACAGCA CGGCGAAAGC CTTCTCGTTC CCGACGTCTA TACCGATCCG
CGTTTTGCCA AGCGCATCGA CGAGATGACC AAGTGGCAGA CGCGCTCGAT TATCTGCATT
CCGCTCAAAT CGAAGCACCG CGTCCTCGGC GTGATCCAGT TGATCAACGT GGATATGCAA
GGCTTCGGCG GAAACGAGAT GCTGCTGCTG CAGGCGCTCG CCGATTACGC CGCCATCGCC
ATCGATAATG CCCGCGCCGT CGAGAAGATC CAGGAGTTGA CGATCACCGA CGACTGCACC
GGCCTCTACA ACGCGCGCCA TCTCTACAAA ACGCTCGAAG CCGAGGTCTA CCGCTCCACC
CGTTTCGGCT ACGAGTTCAG CATCCTGTTC CTCGACCTCG ACCACTTCAA GTCGGTGAAC
GATACGCACG GCCACCTCGT CGGCAGCCGC TTACTCGCGG AGATCGGCTA CGCCATTAAG
GCGCACCTTC GCTTGATTGA CTATGCCTTC CGCTATGGTG GCGACGAATT TGTCGTGCTG
TTGCCGCAGA CGTCGAAAGA GTCGGCGTTA GTTGTAGCCC GCCGCTTGCG CGATGTTTTC
CGTAATGAAT ATTGGCTGAA ACCAGAGGGT CTGAATCTCA ATGTGCGCGC CAGCATGGGT
GTCGCAACGT TCCCGGAGGA CGCAAAGTCC TCGCACGAAA TCATCCGCCA GGCCGACGAA
ATGATGTACG CGGTAAAAAA CACCACGCGC GATAACGTAA GCGTCGCCCG CCAAGGCATG
CTGACCTAA
 
Protein sequence
MSQLDPRGHH STMFDSAGAD HKRQLQELGI FHDVAKALTS SLNLDSILQT IMEKMAEYFR 
PDTWSLLMVD EEKSELYFAI AVGDAAEALK TVRLKLGEGI AGWVAQHGES LLVPDVYTDP
RFAKRIDEMT KWQTRSIICI PLKSKHRVLG VIQLINVDMQ GFGGNEMLLL QALADYAAIA
IDNARAVEKI QELTITDDCT GLYNARHLYK TLEAEVYRST RFGYEFSILF LDLDHFKSVN
DTHGHLVGSR LLAEIGYAIK AHLRLIDYAF RYGGDEFVVL LPQTSKESAL VVARRLRDVF
RNEYWLKPEG LNLNVRASMG VATFPEDAKS SHEIIRQADE MMYAVKNTTR DNVSVARQGM
LT