Gene Sare_4025 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4025 
Symbol 
ID5706429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4576389 
End bp4577939 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content73% 
IMG OID641273450 
Productdiguanylate cyclase 
Protein accessionYP_001538806 
Protein GI159039553 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000068225 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGGGTTGGC TGGACCGCGT CGGCGACCAG GTCGACACCC TGACACAGGC ACGCGGCCTG 
CAGGAGTCGA GCCGCTCCGC CGAGGCGTAC CGGATCCTCA CTGGCGTACT CACCACCACC
ACCGACCGGT ACGCGCGTGC CGACGCCCTG GTGCAACGCC TCTCGGCGTT GCTCAACCTG
GGCCGCACCG CGGAATACAC CCGGGCCATC GAGGAGGCCA CCACCGCGGT CCGAGACATC
GCCGAGCCGT ACCCGCACGG CCACCTCCAC GCGCTCGCCG CCCTCGCCGC CCACCACCAG
GGTGCGCTGG ACCGCTGCGT CATGCATCTG GTACGGGCCG CTCGTGCGCT GGGGGCCGTC
GAGGATACCG ACCGAGACAC CGCGTGGGGC TGGCACGACC TCGCAATGGC CTACTCCTAC
CTCAGCTTTC ACGGGTACGC GCTGGGCGCG ATCGAGCGAG CACGGCAGCT CGGGCTCGCC
GCCGGCATCC CGGAGGAGAC CTTCGCCGCC CCCGGCATCC GGCTACGCAA CGCCGTCGCG
CTGGACCACA CCGGCGACAG CGACGGCTGC CTGCGGGTGC TCCGCGACGT GGCCGGGGAC
CTGGGGCAGT TCCTGCGCGC CGGACGGGCC AGCCGACTGC GCCCGAGCAG CCTCGCCGCG
TACGGTTACG CCGCCGCGCG GCAGGCCGCC CTGGGCGACC GGTTGGCGGT GGGAACAGAC
GGTGCCCCGG CTCGACTGCT GAGCCACGGC GCCGACAGTG CCCGAGCTCG GGACATGCGC
CAACTCGGCG AGGTCTGCCT GGCCATCGCG GACGACCGTC CGATCGAGGC GGTCACCCGG
CTGGACACCG TACGGGTGTC CACCGAGACG CTGGGCGCGG CCGAGCCCGC CCGGCTACGC
AGCATCGCGC TGAGCCGGGC CGGGGAGCAC GCCGCCGCGC ACCGGGCCGA CCGGCGGGCG
TTCCGGCTCG CCGCGCAGCG CAACGATCGG CTCCGGGACG TCTACATCGA CGGGATCGCC
GCCCGGATCG ACCACGAGGA GATGCGTCGC GAGGCAGCCC GCTTCGAGGG CGAGGCACTC
ACCGATCCGC TGACCGGGCT ACCCAACCGG CGCCGGTTGG AGCGACACAT CGCCGCCGTG
ATGGCCCAGG GCGAACGGGT GGTGATCGGC GTGTGCGACC TGGACGGTTT CAAGGCGGTG
AACACACACC ACGGGCACCA CTCCGGTGAC CTGGTGCTGC AACGGGTCGC CGGCGTGGTC
AACCGGATGA TGCGGCGAAA CGACTTCGTG GCCCGCTACG GCGGCGACGA GTTCGTCGTG
GTGCTGCTCG GCACCGGCAT CGACGAGGCG GACGAGGTGG CACGCCGGAT CGAGTCCGCC
ATTCGGACCG AGGACTGGGA ATCCCTCGTA CCCGGCACCC CCGTCGGAGT CAGCATCGGC
TTCGCCGAGG TGGCTGCCAC CGGGCCCGAC GTTCAGGACG CCCTGAGCAC CGCCTTCGAG
GTCGCCGACC GGGAGATGCT CCGCGCGAAG ACCCGTCCCC GCGCGTCCTG A
 
Protein sequence
MGWLDRVGDQ VDTLTQARGL QESSRSAEAY RILTGVLTTT TDRYARADAL VQRLSALLNL 
GRTAEYTRAI EEATTAVRDI AEPYPHGHLH ALAALAAHHQ GALDRCVMHL VRAARALGAV
EDTDRDTAWG WHDLAMAYSY LSFHGYALGA IERARQLGLA AGIPEETFAA PGIRLRNAVA
LDHTGDSDGC LRVLRDVAGD LGQFLRAGRA SRLRPSSLAA YGYAAARQAA LGDRLAVGTD
GAPARLLSHG ADSARARDMR QLGEVCLAIA DDRPIEAVTR LDTVRVSTET LGAAEPARLR
SIALSRAGEH AAAHRADRRA FRLAAQRNDR LRDVYIDGIA ARIDHEEMRR EAARFEGEAL
TDPLTGLPNR RRLERHIAAV MAQGERVVIG VCDLDGFKAV NTHHGHHSGD LVLQRVAGVV
NRMMRRNDFV ARYGGDEFVV VLLGTGIDEA DEVARRIESA IRTEDWESLV PGTPVGVSIG
FAEVAATGPD VQDALSTAFE VADREMLRAK TRPRAS