Gene Sbal223_3687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_3687 
Symbol 
ID7089621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp4378430 
End bp4380649 
Gene Length2220 bp 
Protein Length739 aa 
Translation table11 
GC content49% 
IMG OID643462567 
Productdiguanylate cyclase/phosphodiesterase 
Protein accessionYP_002359588 
Protein GI217974837 
COG category[T] Signal transduction mechanisms 
COG ID[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAGTAG GTTTAGCAAA TCCACTCCAA CAGGCCTTGG TGAGTATTTT TAGTGAGTCT 
CCGCTAGAAA CCTTAGAGCA AAATGTTGAC CGAGCACTTG AGACCATCAC ATTGCAGCTG
CATTGTGACG GAGTATTTGT ATTGACCGGC AGCCAGTCTC TGGATCGTTT ACGCACGCGT
AATTTGTATC TAAAACCCCA ATTTGCCAAA GGCCAACAGA CTCGAGTCTG GCCTTTAGCA
CGCATGCCAT TCTTTCGCTC CCTTGTCCGC ACCCCGCGAC TATTAAATCT GCCCGATGTG
AATACCTTGC CCGCCGATGC GCAGGCCGAG CGTGCCCTAT TAAGGGACTG GGACGTGAAG
AGTCTATTGG TATTGCCACC TGTGGTATTT GGCGAAACGC GTATTGCGCT GGGCGCAGTC
AATTGCTCCG AGTGTTGCGA ATGGAGCGAA GAGTTTATAA GTGAGTTTAA CCACGCCGCT
GTGATGATTG GCTCGGCGAT GGAACTGACG CGTATCGCCC TGAGCATGCT CGCCAGTGAG
CATAAATACT GTGAAGTCTT CAATCAGTTA CCCTTAGCCT GCGCCTTGTT AGATAAATAT
AATCAATTGA CTATGCTCAA TAAAGTCGCC CTGCAAACGC TGCCAATCCA ACATGGTTAT
GATTTATTCG ACATGGTGCG TGAGGAAGAA CATGCCATGT TAACTGATAC CTTGCATGTG
GTACGTGAAG GTGTGCTTGG TCAGGCTTGG TGTGAATTAC CGTTGAAATC TATCCATCAA
CTGGCTTGGT TGAAACTCAG TTTTAGCCAG ATCAGTGGCG ATAAAGATAC CTTAGTCATG
ATTGCCGAAG ATGTCAGCGA GAAGTATCGT CTAGCCGATG AGCTGTCGTT CCATGCCAAT
TATGATGCGC TCACAGGATT GCCCAATCGG TTACATTTCG AAGCCCTATT AGAAAACCTG
CTGCACGCCC ACGACGATAT GCCTATTTGC GTGGCGTTTC TCGATTTAGA CCAATTCCAA
GTCATCAACA ATATCAGTGG CCATCAGGCG GGCGATAAGC TGCTTTGCCA AGTGGCATTA
CGCTTAAAAC AGCTCGTGCG TAAGGGCGAT ATCGTCGCCC GTTTAGGTGG CGATGAGTTT
GGTATCTTGA TGCATTACTG CAATGTGGAC TCGGCGAAAC AAATCGCAAA ACGTATCTGT
ACCCAATTGG CGAACCATGA ATTTATTTGG GAAGGCCGCA GTCACAATGT CAGTGTCAGC
ATGGGGATTG CTAAACTCGA TAAAAAAGCC GCCGACATTT ATACCGTGAT GAGTCAGGCT
GATGCTGCTT GTCGCTTGGC GAAGGATCAA GGCCGCAATG GCTGGCATTT ATACAGTGCG
AGCGATCCTA AAATGAACCG TCTCTATACC GAGATGATGG CGTCGGTGGA CATAGTCGGC
GCACTGGCGT TAAACCAATT TGAGCTTTAT TTTCAAAGCA TAGTGCCATT AAATCGCGAG
GAGTCTGGTC TGCATTTAGA GATCTTACTG CGTATGGTGC AGGCCAACGG CACTATCGTG
TCTCCTGCCA TTTTCTTGCC CGCCGCCGAG CGATATAACT TAGCTTCTAA GGTCGACTTA
TGGGTGATCG ATAACTTGCT CAAGTGGGGC GGTTGCCATT TAGATATCTG GCAGCAATTG
GATCTCGTGT CGGTGAATTT GTCGGCGACC TCTTTGGGTG ACTTTGAGTT TATGAACTGG
CTAGAAATGC GTTTGATGGC CGAACCTGAG CTGGTGGACA AGCTTTGCAT CGAGATCACT
GAAACTGCGG CCGTGAGTCA GCTCGATCAA GCGACAAAGT TACTCGATAT ATTGCGTCCG
CTCAATTGTA AGTTAGCCCT CGATGACTTT GGGGCTGGCT TTTCTAGCTT TGCCTACCTT
AAGCGCCTTA ATGTGGACTT TGTGAAGGTG GATGGTCAGT TTGTAGTGAA CATCTGCGAA
GACAGTGCGG ATCAGGCGAT CGTTAAATCG ATTTGCCAAC TCGGCCAAGA CATGGGCTTT
GATGTGGTTG CCGAATTTGT CGAATCCCAA GATATTGGCC GGAAACTGCA AACCCTTGGC
GTCGACTATG CCCAAGGTTA CGCCATCAAT AAACCGATAC GGTTAGCTGA ATTACAGTCT
GGACTCAGTC AGCCTTGGCT CGAAAAACGT GAGACCTTTG CGGCCTATCC ACAACTCTAG
 
Protein sequence
MLVGLANPLQ QALVSIFSES PLETLEQNVD RALETITLQL HCDGVFVLTG SQSLDRLRTR 
NLYLKPQFAK GQQTRVWPLA RMPFFRSLVR TPRLLNLPDV NTLPADAQAE RALLRDWDVK
SLLVLPPVVF GETRIALGAV NCSECCEWSE EFISEFNHAA VMIGSAMELT RIALSMLASE
HKYCEVFNQL PLACALLDKY NQLTMLNKVA LQTLPIQHGY DLFDMVREEE HAMLTDTLHV
VREGVLGQAW CELPLKSIHQ LAWLKLSFSQ ISGDKDTLVM IAEDVSEKYR LADELSFHAN
YDALTGLPNR LHFEALLENL LHAHDDMPIC VAFLDLDQFQ VINNISGHQA GDKLLCQVAL
RLKQLVRKGD IVARLGGDEF GILMHYCNVD SAKQIAKRIC TQLANHEFIW EGRSHNVSVS
MGIAKLDKKA ADIYTVMSQA DAACRLAKDQ GRNGWHLYSA SDPKMNRLYT EMMASVDIVG
ALALNQFELY FQSIVPLNRE ESGLHLEILL RMVQANGTIV SPAIFLPAAE RYNLASKVDL
WVIDNLLKWG GCHLDIWQQL DLVSVNLSAT SLGDFEFMNW LEMRLMAEPE LVDKLCIEIT
ETAAVSQLDQ ATKLLDILRP LNCKLALDDF GAGFSSFAYL KRLNVDFVKV DGQFVVNICE
DSADQAIVKS ICQLGQDMGF DVVAEFVESQ DIGRKLQTLG VDYAQGYAIN KPIRLAELQS
GLSQPWLEKR ETFAAYPQL