Gene Sala_2109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2109 
Symbol 
ID4080084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2216760 
End bp2218472 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content66% 
IMG OID638010485 
Productdiguanylate cyclase/phosphodiesterase 
Protein accessionYP_617151 
Protein GI103487590 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain
[COG2200] FOG: EAL domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0707223 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGAAAG GTCGCACAAA ACCCGCTGTT ATGCCTATTG ATCGAACCGG CGAAGACCGG 
CCCTTGCTGT TTGTCGGCTG GATCGACGCG CTGCCCGTTC CGGCGGCGCT GGTCAGGCCG
ATGGCGCGCG GCAATTTCCG GCTCCACGCG AGCAATGCCG CCTTCGACCG ACTGAAGCTG
TCGCCGGTCG GCGCCGACGC GCCGATCGAG CTTCTGCGCG CAGTCGAACG CGCATCGCAG
CATCCCGACG AGGTGCAGGA GTTTTCGTGC CAGCTCGGCG AGGGACCGGC GGCGCGCGAC
CTGCGCGGAC ATATCGGCCC GCTGCCCACC GAAACGGATG ACGACGGGCT GTTCCTGCTG
ACGCTGATCG ATCGCACCCA GGAAATGATG ACCGAGCGCA ATCTGCGCCG CGAGCTGGTG
TCCGACAGCC TGACCGGCCT GCCCAACCGC GCCGGGTTCG AGGAGCTGGT CGAACAGCGG
GCGCACCGCG ATGCGCCGGG CGCCGATCAC GCCATACTGC TGCTCGACCT CGCGCGCTTT
TCGCGCATCA ACGAACATAT CGGCCCGATG GCGGGCGACG AGCTGATCAT CACCGTCGCA
CGGCGGTTGA AATCGAGCCT GCGCAGCGGC GACATACTGG CGCGCACCGG CGGTGACGAA
TTCGCCATTT CGACACGAAT CGCGGGCGGG CGCGCCGATG TGCGCGAAAT GGCGCGGCGC
ATCCGCGGCT GTTTCGACCA TCCCTTTCGT ATCGGCGAAC TGAAGGTCAG CGTCGATTGC
GCGCTCGGCT GTTCGATCCA GCCGACCAGC GAGATCGACG TCGCCAACCA GATCCGCCAC
GCGCAGATCG CGCTCAAGCG GGCGAAGCAG ACCGACCGCA TCGAAATCTA TGAACCCGAA
GCGGCGATGC TGTCGGACAA TCGTTTCGGG CTCGAGACCG AACTGCGCAA CGCGATCGAG
GAAGACCGGC TGCATCTCGC CTTCCAGCCG CTCATCGAGA TGGCGAGCGG GCGCGTCGCG
GGGTTCGAGG CGCTCGCGCG CTGGGACAAT AGCAGCGGGG TCGCGGTGTC ACCGACCGAA
TTCATTCCGA TCGCCGAGGA TTCGGGGCTG ATCGTCCCGC TGGGCCAATG GGCGATCGGC
AAGGCCGCGG CGGTGCTCGC CGACTGGGAC CGGCAGAATG GCGGGCAGGT CGTCAACGCC
TATTTCTCGG TCAATGTCTC GGCGATCCAG CTGGTGCGTG ACGACGTCGC GGGGGTGGTT
CGCCAGGCGC TCCAAAAGCA TGGCATCGGC GGCGAGCGGC TGATGATCGA GCTGACCGAA
AGCGCGATCA TCGGCGATCC CGACCTGGCG CTGTCGGTGC TGCGCGAACT GAAGGCACTC
GACGCGCGCG TCGCGATGGA CGATTTCGGC ACCGGCTATT CGAACCTTGC CTATCTTCAG
CGGCTGCCGA TCGACGTGCT CAAGATCGAC CGCAGCTTTG TCGAACATAT GGTCGACGAC
CGCGACAAGG TGGCGATCGT CCGCACGATC CAGAGCCTCG CCGAAGTGCT GGGGATGAAG
ACCACCGCCG AGGGGGTCGA GACAACCGAT CAGGCGCGGC TGCTGTCGGC GCTCGGCTGC
GACTTCGGGC AGGGCTATCT GTTCGCGCGG CCGATGGATG GCAAGGCGGC GCTCGATTAC
TGGCGTCAGT CGCTGACGCG GCCGATCTTC TGA
 
Protein sequence
MGKGRTKPAV MPIDRTGEDR PLLFVGWIDA LPVPAALVRP MARGNFRLHA SNAAFDRLKL 
SPVGADAPIE LLRAVERASQ HPDEVQEFSC QLGEGPAARD LRGHIGPLPT ETDDDGLFLL
TLIDRTQEMM TERNLRRELV SDSLTGLPNR AGFEELVEQR AHRDAPGADH AILLLDLARF
SRINEHIGPM AGDELIITVA RRLKSSLRSG DILARTGGDE FAISTRIAGG RADVREMARR
IRGCFDHPFR IGELKVSVDC ALGCSIQPTS EIDVANQIRH AQIALKRAKQ TDRIEIYEPE
AAMLSDNRFG LETELRNAIE EDRLHLAFQP LIEMASGRVA GFEALARWDN SSGVAVSPTE
FIPIAEDSGL IVPLGQWAIG KAAAVLADWD RQNGGQVVNA YFSVNVSAIQ LVRDDVAGVV
RQALQKHGIG GERLMIELTE SAIIGDPDLA LSVLRELKAL DARVAMDDFG TGYSNLAYLQ
RLPIDVLKID RSFVEHMVDD RDKVAIVRTI QSLAEVLGMK TTAEGVETTD QARLLSALGC
DFGQGYLFAR PMDGKAALDY WRQSLTRPIF