Gene Saro_1106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1106 
Symbol 
ID3916402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1152536 
End bp1154830 
Gene Length2295 bp 
Protein Length764 aa 
Translation table11 
GC content65% 
IMG OID640443841 
Productperiplasmic sensor diguanylate cyclase/phosphodiesterase 
Protein accessionYP_496385 
Protein GI87199128 
COG category[T] Signal transduction mechanisms 
COG ID[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0307907 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGACGG GACCCTTCAT TCGGCAACGC CGGTCGCGGC AGACAGCCGG CGCGGCGAGC 
GCGGGGTGGA TCGTGGCAGC CACGTTCGTC GCCCTGCTGG CGCTTACGAC CGCCCTGATC
GTGGCGTTCC GAAATTCGAC CAACATCGCT GACCACTTCG CCCGCACCGA AGAACAGCGG
CTGGTCGAAC GCTTCATGGA GCGCACCGAG AAACAGATCC TCGAAGCGGA CAAGCTGCAG
GTCGTGTGGG ACGATGCCGT CACCATGCTG AACAGCCCCA AGGCCGAGGT GTGGGCGCGC
AACTTCCTGG CAGGTTACTT CTGGGGAAGT CACCGCATCG ACCGCATATT TCACGTCAGG
TTCGACGGCA GCCTCGTACG CTGCTGGCAC GGCGTGAAAC TCTCGCAGGA CTGCCGCTAC
CGGCCCCTTT CCAGGACGAT TTCGGGCCTG ATTCGCCAGT CCCTGAAAGA CCAGACCCAG
CGCGGGCAGG TGCGCGATTG GCGAAAGCAC GGCAGCGTGA ACTGGCCCTA CGATTCCAAG
GGCTTGCCCA TCGGACTCGG CCAGTCGTCG ATTGCCAGCG TCGAAGGCCA GCCGGCCATC
GTCGCGGTGG CTTCCGTCGT GCCCGACGTC ACGCCTTCAT TGCTCCAGGC AGAGCCCGAC
TACATCGTCC TCGTGCGCTT CATCGACGAG CGCATCATAT CGGACCTGCA TGAATCCCTG
GTGCTCGATG ACGTGCGCTT CGAGACTTCG GCGACCGACG ACAAGAATCG CAATTCCCTG
GCGATCAGGG ACCTGCACGG AGACCGGATC GGCTGGATAT CGTGGCTGTC AAAGCCGCCG
GGACCGGCCA TCCTGCGGCA GACGGCGCCG CTGCTGGCGG TCTACATCCT GTTCTTCGTC
GGCGTCGTGG CGGGCGGGGC GATCATCGTG CGCCGGATGC GCCGGACGAC AAGCGAACTG
ATCGCCAGCG AAGCGCAGGC GCAGCACAAT GCCCTGCACG ATGCCATGTC GGGCCTGCCG
AACCGCGCCC ACTTCATGCA ACGCCTGCGG CAGGAACTGA ACGCCTGCGT CGAACGACGC
GAACTGGGCG ACGTCTTCGT CGCCTATGTC GACATCGACC GGTTCAAGAT CGTCAACGAT
ACGCTGGGGC ACCATGTAGG CGACGAACTG GTGCGGCAGG TGGCGCTTCG CCTGCGTCGC
TCGCTCCCGC CAGGCGACTT CCTGTCGCGC TTCGGCGGCG ACGAATTCGT GCTCATGCGC
CGCACCACGG GTGGCCGCGC GGCGGCCGAC ATGCTTGGCA AGCAGATCAT GGCATTGACC
CGCGAGCCGT TCGTCATTTC CAGCAACAAC CTGGAAGTGA GCCTTTCGTG CGGGATAAGC
TGGGGCCCCG AACAGAGCGA GGACCCCGGC GAACTCCTCC GGCGGGCGGA CATCGCTCTC
TATCGCGCGA AGCAACGGGG CCGCGCGCGC TATCGCCGCT TCACGCGCGA CATGGATGCT
TCGGTCAAGC TGCGCCGCGA GATGGAAGTC GAACTGCGCC GCGCGATCGT CCGCGACGAA
CTGACGCTTG CCTACCAGCC CATCGTCCAT GCCGGGAGTG GCGCCATCGA GGGTTTCGAG
GCACTGCTGC GCTGGCCCCA CCCCGAGCGC GGCTCGATCA GGCCCGGCCT GTTCGTGCCT
GTCGCCGAAC AGGCGGGCAT GATGGTACCG CTCGGGTCAT GGGTGCTGCG ACGCGTGTTC
ACCGAAAGCC GGCAATGGCC GGATTGCGAC ATTTCGGTGA ATCTTTCGCC CCTGCAGATC
ATGTCGAGCG ACTTCCTCCA GGCGATGGAC GAACTGGTGC GCGAGACCGG GGCCGACCCG
CGGCGTTTCA TCCTCGAGGT CACCGAAGGG GTCATGCTCG ACCGCAGCGA CCATGTGCTC
GACGTGCTGA AGGGGCTCAA CTACCGGGGC TTCCGCATCG CGCTCGACGA TTTCGGCATC
GGCTATTCCT CGCTCAGCTA CCTGCGCTCG TTCCAGTTTG ACCGGATCAA GATCGACAGG
TCGTTCGTCC AGAACATCGA GGGCGATCTC GACGCCCATT CGATCCTGAA GGCCATCGTC
TCGCTCGGGC ATACCTTGCG CATGAAGGTC GTGGCGGAAG GGGTGGAGAC GCCGATGCAG
CGCGCGCTGG TCCAGGCAGC CGGCTGCCAG ATGATCCAGG GACACCTGTT CTGGGAGGCG
CTTCCGGTCG ACGAGGCGAA GGCGCTGGTC CGGCCCGCGA AAGTCCGCGG CCTGAGCAAG
GTCCGCGTCG GCTGA
 
Protein sequence
MGTGPFIRQR RSRQTAGAAS AGWIVAATFV ALLALTTALI VAFRNSTNIA DHFARTEEQR 
LVERFMERTE KQILEADKLQ VVWDDAVTML NSPKAEVWAR NFLAGYFWGS HRIDRIFHVR
FDGSLVRCWH GVKLSQDCRY RPLSRTISGL IRQSLKDQTQ RGQVRDWRKH GSVNWPYDSK
GLPIGLGQSS IASVEGQPAI VAVASVVPDV TPSLLQAEPD YIVLVRFIDE RIISDLHESL
VLDDVRFETS ATDDKNRNSL AIRDLHGDRI GWISWLSKPP GPAILRQTAP LLAVYILFFV
GVVAGGAIIV RRMRRTTSEL IASEAQAQHN ALHDAMSGLP NRAHFMQRLR QELNACVERR
ELGDVFVAYV DIDRFKIVND TLGHHVGDEL VRQVALRLRR SLPPGDFLSR FGGDEFVLMR
RTTGGRAAAD MLGKQIMALT REPFVISSNN LEVSLSCGIS WGPEQSEDPG ELLRRADIAL
YRAKQRGRAR YRRFTRDMDA SVKLRREMEV ELRRAIVRDE LTLAYQPIVH AGSGAIEGFE
ALLRWPHPER GSIRPGLFVP VAEQAGMMVP LGSWVLRRVF TESRQWPDCD ISVNLSPLQI
MSSDFLQAMD ELVRETGADP RRFILEVTEG VMLDRSDHVL DVLKGLNYRG FRIALDDFGI
GYSSLSYLRS FQFDRIKIDR SFVQNIEGDL DAHSILKAIV SLGHTLRMKV VAEGVETPMQ
RALVQAAGCQ MIQGHLFWEA LPVDEAKALV RPAKVRGLSK VRVG