Gene Saro_1435 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1435 
Symbol 
ID3916099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1478581 
End bp1481271 
Gene Length2691 bp 
Protein Length896 aa 
Translation table11 
GC content64% 
IMG OID640444178 
Productdiguanylate cyclase/phosphodiesterase with PAS/PAC sensor(s) 
Protein accessionYP_496713 
Protein GI87199456 
COG category[T] Signal transduction mechanisms 
COG ID[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.665756 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGCAG AGCAGTTTCC GGAAGTAACG AGCACCGCAG GCCGCGCGCG CCACTTTGCG 
CGTACGCGGC GCCGTGAGCG TTCGGCACAG CCGGCGTTCC GCGAAGCGGG GGCGACTTTC
CTGTTCGGAG GCGACCCCTC GAACCGTCTT GTGGCACCGG AATGGCCGAA GATGGCAATT
CCGATGGCGC TCGCACTGGC CGGGCTTGCC CTTTTCAACT TGTCCATCCC CACGATCCTT
CCGGCACTGC TCGCGCTTCT GGCGCTGGTT GCCACGCACT TCTCATCCAT TGCCGTCAAG
ATCGAGTCGG AGCGGAAGCT TGAAACTGGA CAGCGCCTTG CGTTGATGGG ACTGGCGGTT
GCCTTGCCGA TGTTCCTGTT TGGGTCGGCG ATGGGCTTCT GGGGGGAGTC AGGGCAGGTC
GCATGGTTCG GCTGTCTCGC CGTCGTGGTC ACGACTGGTG TCCTGGCGAC GGTGGTCCAG
TCAGGCCGCC TTGTCGCGGT GATCGCCGCC CAGGTAGGCA TCTGGTCCGG TGTGACCTGT
GTCGCGAGCC GTGCCGGCGG CTATTTTGCG CTGGTTCTCG GCGTTGCAAT TGCGATTGCA
GCCTATCTGC GGCAGCGGAA AATCGACTTG CATGCCAGGA AGAAATCCGA AAGCGACCAG
CGCGCCCAAA CCCGGGCCGA GGAGATTCTC GCAGACTACG AGGAAACCGG GCAGGGCTGG
TTCTGGGAAA CAGACAGGCG GGGACACCTT GCCTACCTTT CCAGTCCGGT CGCCGCGATC
CTCGGCCGCC GGGTGGAGGA GCTTGTCGGC CGACCGTTTT CGGAACTGTT CAACCTGACT
GAGCAGTCCG GGGAAGGCGA GCGCACACTG GCGTTCCACC TTTCCGCCCG GTCCAGCTTC
AGTGAACTCG CGGTGCGCGC GGCGACGCGC GATGAGGAGC GATGGTGGTC GATTACGGGA
CGGCCAACCT ATGACACATT CAACAACTTC GCGGGGTTCC GCGGATCCGG GACCGACCTG
ACGGAGAAGC GGCGCTCGCA GGAGCACGCA TCGCGGCTGG CCCATTTCGA TTCGCTCACC
GGTTTGTCGA ACCGGTTTCG CATGTCGCAA ACTCTCGAGA AGATCCTCCT TGCGCCGCAA
GTGCAGCACC GCGCTTGCGC GGTGTTTCTG CTCGACCTGG ACCGTTTCAA GCAGGTCAAC
GATACGCTCG GGCACCCTGC AGGCGACGCT CTGCTCAAAC AGGTGGCGCA GCGCCTGGAG
CGCGTCGTAG GCAAGATCGG GCGGGTCGGC CGCCTTGGCG GCGACGAGTT TGAAGTGATC
CTGCCCGGGA AGATGGAGCG TGGGCAGCTC GGGCACATGG CCGGGCGTAT CATCGAAAGC
CTTTCGCAGC CCTATTCGAT CGAGGGCAGC CGCGTCATGA TCGGGGCATC GGTCGGCATT
GCGCTCGCTC CTGACGATGG TGTGACCAGC GAGGCGCTGA TCCGCAACGC TGATCTGGCG
CTTTATGCCG CCAAGGATGG CGGGCGAGGT CGCTTCCATT TCTACGCCCC CGACCTCCAC
AGCGATGCCG AAGAGCGTCG TCAGATGGAG GAAGACTTGC GCGACGCGGT CTCGAATGGA
GGGCTCGAAT TATACTACCA GCCCGTCGTC CGGACCGCGA CCGAGAAGAT CACCGGGTTC
GAGGCCCTGC TGCGCTGGAA TCATCCGGAG CATGGATGGA TCAGTCCTGC TCGCTTCATC
CCGATTGCCG AGGATACCGG CCTCATCGCC ACCATCGGCG AGTGGGCTTT GCGCACTGCA
TGCCAGGATC TCGCACGCTG GCCGGAGAAC GTCCGGGTGG CGGTCAACGT GTCTCCGCTT
CAGTTCGCCA ACCCCTCGTT GCCGGTCGTC GTCACCAATG CCCTAGCCAC GGCCCAGGTG
GCGGCCGAGC GGCTTGAGCT GGAAATCACC GAGAGCGTGT TCCTCAACGA CGACGAGGGC
ACCGACCAGA TGTTCAAGTC GTTGAAGGCG ATCGGCGTGC GGCTAGCCCT GGACGATTTC
GGAACCGGCT ATTCGTCACT TGGCTATCTT CGCTCGGCGC CCTTCGACAA GATCAAGATC
GACCAATCCT TCGTTCGCGG TGCAACGCAG CCGGGCAGCC GAAACGGCGC GATCATCGCG
TCCATCGTCA GCCTTGCCGA AGCGCTGGGT ATGGAAACGA CCGCCGAGGG CGTGGAAACG
CTCGACGAAC TCGATCTCGT GCGCATGCTC GGGTGCAGCC ACGTCCAGGG GTATATCTAC
GAACGACCGC TATCCGCGAT GAACGCCGCT GCGCGGCTCT CCACAGGATT GATGGCCATC
GCGCAAGGCC CGCGTTCCGC GCGTGCCGCC CGCCAGTCCA TGCTGCGCAA GGTCATCCTC
GACCATGGCG GTCACCGCTA TGATGCCATG GTGCGTAACA TCAGCCAGAC CGGCGCACTC
ATCGAAGGGC TGTGGAATGT GCCGGCCGGA ACGATCTTCC GCATCCTGAT TGCCGACAAT
CACGTCGTGA CCGGGACCTG TCGCTGGTCG GCCGATGATC GGATGGGCGT TGAATTCTCG
GTGCCGCTAC GCCTGGATGA GAACGGGCGG ATCGCAGCCG TTGCCGCCCC CGTCGGGGTG
CGGTTCGCGA TTGCAGAACA AGAGGTCGTT GCGTCGCGCA AGGTTGGCTA A
 
Protein sequence
MSAEQFPEVT STAGRARHFA RTRRRERSAQ PAFREAGATF LFGGDPSNRL VAPEWPKMAI 
PMALALAGLA LFNLSIPTIL PALLALLALV ATHFSSIAVK IESERKLETG QRLALMGLAV
ALPMFLFGSA MGFWGESGQV AWFGCLAVVV TTGVLATVVQ SGRLVAVIAA QVGIWSGVTC
VASRAGGYFA LVLGVAIAIA AYLRQRKIDL HARKKSESDQ RAQTRAEEIL ADYEETGQGW
FWETDRRGHL AYLSSPVAAI LGRRVEELVG RPFSELFNLT EQSGEGERTL AFHLSARSSF
SELAVRAATR DEERWWSITG RPTYDTFNNF AGFRGSGTDL TEKRRSQEHA SRLAHFDSLT
GLSNRFRMSQ TLEKILLAPQ VQHRACAVFL LDLDRFKQVN DTLGHPAGDA LLKQVAQRLE
RVVGKIGRVG RLGGDEFEVI LPGKMERGQL GHMAGRIIES LSQPYSIEGS RVMIGASVGI
ALAPDDGVTS EALIRNADLA LYAAKDGGRG RFHFYAPDLH SDAEERRQME EDLRDAVSNG
GLELYYQPVV RTATEKITGF EALLRWNHPE HGWISPARFI PIAEDTGLIA TIGEWALRTA
CQDLARWPEN VRVAVNVSPL QFANPSLPVV VTNALATAQV AAERLELEIT ESVFLNDDEG
TDQMFKSLKA IGVRLALDDF GTGYSSLGYL RSAPFDKIKI DQSFVRGATQ PGSRNGAIIA
SIVSLAEALG METTAEGVET LDELDLVRML GCSHVQGYIY ERPLSAMNAA ARLSTGLMAI
AQGPRSARAA RQSMLRKVIL DHGGHRYDAM VRNISQTGAL IEGLWNVPAG TIFRILIADN
HVVTGTCRWS ADDRMGVEFS VPLRLDENGR IAAVAAPVGV RFAIAEQEVV ASRKVG