Gene Caul_4493 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4493 
Symbol 
ID5901954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4865489 
End bp4867156 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content73% 
IMG OID641565012 
Productdiguanylate cyclase/phosphodiesterase 
Protein accessionYP_001686111 
Protein GI167648448 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain
[COG2200] FOG: EAL domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTTTA AGATCAGTCA GCGACGCGTA TGGGACGCCA CGGCCACGCT GGAGGCCTTG 
GGCGCCGCCG AAGTCGCGCT TTGGCTCTGG GAGCCGGAAA TCGACCGGCT GCGCGTCAAC
GGCGCGGCGC GCGCCCTGGG CCTGGGTCCG CTGGCCCCCG ACTGTTCCTC CGCCGCCTTC
CGCGCCCTGA CCCTGCCGCA GGACCGCGCC CAGGCCGAAG ACATCCTGAA GGTCCGCGAG
CCGGGCAGCG AGATCATGGC CCGCCTGCGC ATGCGCGGCG GCGGCACCTG CATCTGGCGC
GGCGTGTGGC TGGAAGACGG CGTCCGCGCC GCCGGCGTCG CCGCCCCCGA GGTCAAGTTC
GCCGCCTCCG AGCGCGACAG CCTCACCGGC CTGATGGATC GCAAGAGCTT CATCGCCCGC
GCCCGCGAGC GGCTGGCGCG GCCGGGCATG CACGAGCTGG TCGTCGCCGA CCTCGACCGC
CTGCGCCGCC TGAACGAGGC CCTGGGCCAC GAGCGCGCCG ACCTGGTGCT GGCCGCCCTG
GGTTCGCGCC TGGCCGCCGC CTTCCCGTCC GAGGCCCTGC TGGGCCGGAT CGGCGAGGAC
GAATTCGCGG TGCTCTGCGA CCCGGCCGGC TTCGAGCCCG CCGAGCTGCT GCGCAACGCC
TTGGAGCAGC CCCTGCGGGT GGCCGGCTTC GACATCCATC CCACCCTGTC GATCGGCGCC
GTCTCGGCCG AAGGCGGCGA GGACGCGCCA GAAGCCGCCG AGCTGCTGCG TCGCGCCGAG
CTGGCCGTCG AGGCCGCCGC GTCCGGCGGT CGTGGCGGGG CCGCCGCCTA CGGGCGCTCG
ATGGAGACCG ACGGCCTGTC GCGCCTGGCC CTGGAGGCCG ATCTGCGCGG CGCCATCGCC
CGCGGCGAGA TCCGGCCCTA TTTCCAGCCC GTGGTGCGGC TGTCGACCGG CGCCCTGTCT
GGCTTCGAGG CCCTGGCCCG CTGGATCCAT CCGCGCCGCG GCATGCTGCC GCCCGACGAG
TTCCTCCCGC TGGTGGAGGA GATGGGCCTG ATGAGCGAGC TGGGCGAGCA CATGATGCGC
ACCTCGGCCC GGCAATTGGC TGCCTGGCGC GACACCCATC CGGCCGTCGG CGCCCTGACC
GTCAGCGTCA ACCTGTCGAC CGGCGAGATC GACCGCCCCG GCCTGGTCCA CGACGTGCGC
CAGGTGCTCA AGGAGCACAA CCTGCCGCGC GGCGCGCTGA AGCTGGAAGT GACCGAGAGC
GACATCATGC GCGATCCCGA GCGGGCCGCC GTGATCCTGC GAACCCTGCG CGACGCCGGG
GCCGGCCTGG CGCTGGACGA CTTCGGCACC GGCTTCTCGT CGCTGTCGTA CCTGACCCGC
CTGCCGTTCG ACACGCTGAA GATCGACCGC TACTTCGTGC GGACCATGGG CAGCAACGCC
GGCTCGGCCA AGATCGTCCG CTCGGTGGTG AAGCTGGGCC AGGACCTGGA CCTGGAAGTC
GTCGCCGAGG GCGTCGAGAA CGCCGAGATG GCCCGCGCCC TGCAGGCCCT GGGCTGCGAC
TACGGCCAGG GCTTCGGCTA CGCGCCGGCC CTGTCGCCGC AGGAGGCCGA GGTCTATCTG
AACGAGGCCT ATGTCGACGG CGCCGCGCCG GTGAAGGCTC GGGGTTGA
 
Protein sequence
MSFKISQRRV WDATATLEAL GAAEVALWLW EPEIDRLRVN GAARALGLGP LAPDCSSAAF 
RALTLPQDRA QAEDILKVRE PGSEIMARLR MRGGGTCIWR GVWLEDGVRA AGVAAPEVKF
AASERDSLTG LMDRKSFIAR ARERLARPGM HELVVADLDR LRRLNEALGH ERADLVLAAL
GSRLAAAFPS EALLGRIGED EFAVLCDPAG FEPAELLRNA LEQPLRVAGF DIHPTLSIGA
VSAEGGEDAP EAAELLRRAE LAVEAAASGG RGGAAAYGRS METDGLSRLA LEADLRGAIA
RGEIRPYFQP VVRLSTGALS GFEALARWIH PRRGMLPPDE FLPLVEEMGL MSELGEHMMR
TSARQLAAWR DTHPAVGALT VSVNLSTGEI DRPGLVHDVR QVLKEHNLPR GALKLEVTES
DIMRDPERAA VILRTLRDAG AGLALDDFGT GFSSLSYLTR LPFDTLKIDR YFVRTMGSNA
GSAKIVRSVV KLGQDLDLEV VAEGVENAEM ARALQALGCD YGQGFGYAPA LSPQEAEVYL
NEAYVDGAAP VKARG