Gene Acid345_2398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2398 
Symbol 
ID4071396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2838735 
End bp2840255 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content55% 
IMG OID637984414 
Productperiplasmic sensor diguanylate phophodiesterase 
Protein accessionYP_591473 
Protein GI94969425 
COG category[T] Signal transduction mechanisms 
COG ID[COG4943] Predicted signal transduction protein containing sensor and EAL domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.625554 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAAAGT CTCCCATCAC CCCAGGATCG ACAGCTCTGC TCGTCGGACT TTTGGTCTTG 
CTGACGTCCT ATTTGGCCAT GGGCTCACTC GAGCGGTCGG TTGCGCGCCG GGAACTTGCG
CTGCAGGCAG CCATGATGCT TCGAGGATTT CAGGACGGCT ACCAGAATGC GCGCGCCGAC
CTGGCGAAGC TACCGTCCAT GGAAGAAATG AATTGCCGCG ATGGAATCAG TGACACGCTG
GCGCGCCTCA ATTTCGATAA CCAATACGTC CGATGGTACG GAATCGCGCA AGAGGGCAAG
GTGATCTGCC GTGGGCCACG GGTTGGCGTT GACTTGTCCG ATGCACGCTT CCATCGCATT
GATGATGAGT GGTTCCTGAT CTCGACAGAG TCGCCCACGA AAGCAAGCAA CTTGTTGCTG
GCGCAAAAAC GCGGCAACCT TTTGTATCTC GCCATGCTCG AACCCTTGCT GTTCGACTTT
ATGCATGAGG TGGATTGCAA AGCATGTGTG TCGTTCGAAT TCCTGGTGAG CGCCCAGCCG
AACGTTGAAA TGGAGTCTGC TCCCGCATCC GGCCCGTCTG TAATCCATTA CGTGGTGGAA
AAAACGCGGC TCAACGCGCA AATGAAATTC ACGCTGAATG CTACGCAGGA GTATGTGGAT
GCGTTCGCGT TCCCTGGGCG CGTACTGTCG ATGACGATCG CTGCCGCCTT CGGTCTTGTG
ATCGGACTTT CGGTGTACGG GAATTTAACA AAATACACAT CGACGGCGTT TCTCATCGAA
CAGGGCCTGA AGCGAAACGA GTTCCTTCCT TTCTATCAAC CCATTATTGA CAGTCGTGAC
GGATCGATTC TTGGGGCGGA GGCACTCGTC CGCTGGCAGC CGAAAGGCGG AAACCTTATT
CCGCCCGGGC AGTTCATTCC GTTTGCCGAA GAGAACCATT TAATTGATCC CATTACCGAC
CAATTGGAAG AGAAGGTGCT GGACGATATC AAACAATTCG GCTGGCAAGA CTCCAGTCGA
TTCGTCAGCA TTAATGCGGT CGCGGAGCAG ATAACGGACA CGCCCTTTTG TGCGAACCTG
CTGCGACGAC TCGCAGAGAA GCGCATCCCG GCGAAGAATT TCTCAGTGGA GATTACGGAG
CGGCATCAAT TCCCCGATCT CGACCGCGGG CGAGCCGCGC TGCAGTCCTT GGTAGAAGCC
GGTATCGAGA TCAAGCTCGA CGACGCAGGC ACCGGATTCG GCGGCTTTTC CTACATCCAG
GAATTGCCGA TCACCACATT GAAGATCGAC AAGATGTTCA TTGATACGCT TCGGCAAGAG
AAGCAGGACC CCAAACGCGC GGTTCTGCAG GCGATTATTG AGTTCGCAAA GACTGCCAAT
CTTCACGCAA TAGCCGAGGG TGTAGAGACC AAAGAACAAG TCAGCCAGCT GAGCGCGGCC
GGGGTCTTCG CCATACAAGG CTACGTGTAT TCCAAGCCGA TGCCGGCAGA AGAGTTCATT
CGCTGGATGA ACGCGCGCTA G
 
Protein sequence
MKKSPITPGS TALLVGLLVL LTSYLAMGSL ERSVARRELA LQAAMMLRGF QDGYQNARAD 
LAKLPSMEEM NCRDGISDTL ARLNFDNQYV RWYGIAQEGK VICRGPRVGV DLSDARFHRI
DDEWFLISTE SPTKASNLLL AQKRGNLLYL AMLEPLLFDF MHEVDCKACV SFEFLVSAQP
NVEMESAPAS GPSVIHYVVE KTRLNAQMKF TLNATQEYVD AFAFPGRVLS MTIAAAFGLV
IGLSVYGNLT KYTSTAFLIE QGLKRNEFLP FYQPIIDSRD GSILGAEALV RWQPKGGNLI
PPGQFIPFAE ENHLIDPITD QLEEKVLDDI KQFGWQDSSR FVSINAVAEQ ITDTPFCANL
LRRLAEKRIP AKNFSVEITE RHQFPDLDRG RAALQSLVEA GIEIKLDDAG TGFGGFSYIQ
ELPITTLKID KMFIDTLRQE KQDPKRAVLQ AIIEFAKTAN LHAIAEGVET KEQVSQLSAA
GVFAIQGYVY SKPMPAEEFI RWMNAR