Gene Smed_1471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1471 
Symbol 
ID5322329 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1554179 
End bp1555375 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content62% 
IMG OID640790419 
Productdiguanylate cyclase 
Protein accessionYP_001327151 
Protein GI150396684 
COG category[T] Signal transduction mechanisms 
COG ID[COG3706] Response regulator containing a CheY-like receiver domain and a GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0238873 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGGTG CAATTTCACT TCTGGCAGTG AATTTCGTCA TTGCCCAGAT ATTCGTGGCG 
GCCTTTCTCG TCATTGCGGT AAAAAGCCGG CACGGCCGGG CCGCCGCCTG GTGCGCCGCT
GGATTCGCCA TAGCCTCTCT GGCTGCGATT TGCGAAGCCG TCCTGCCGTT TACACCGGTG
CCCAAATTTT TTGCCGTCGG GGCCTTTGCA AGTGTCCTTG CCGGTTTCTG CCTGCTGCGT
TTTGGCCTGG GACTCTTTTA TGGGGTGCCG GCAAAACCAG CGGCGCTTGC GGCTTTCCTG
ATCGTTTCGG TGGTAGTCGA TCTGTTGATT TACGACCTTC CGCGCGGAAC GCTGCAGCAC
GCCTTCTTCT ATCAAATGCC CTTCTTCCTC ATTCAGGCCT GGACCGCGGC CGCGATCATG
CGTTCCCCGC GCCGTTCCTA TGCAGACAGG ATTCTCGTTG GACTCCTGGT GCTGAATGCT
CTTCATTTCC TCGGGAAGGT TTATGCCGCG ATAGCCGCGG GGGCGGGCTC CACGGCATCC
GACTATCTTC AGAGCCCATT CGCCTTGATC TCGCAGGCGC TCGGCGCCGC GTTGATCGTC
GGGACTGGTG TGGCGATCCT CGGCGTCATG GTCAAGGATA TCGTCGACGC GGCCCGCGCC
AGCTCCGAGA TCGACTCCCT CTCGGGCCTC TGGAACCGGC GCGGCTTCAT CGAGCGCGTC
GCGCCCTGGT TGCTGAGCCG CAACGCCAAG AGCCCCGGCG CGTTGATCCT TACGGACCTC
GACCGGTTCA AGGGCGTCAA CGACACCTAT GGCCATCATA CCGGCGATGA GGTCATTCGG
CAGTTCGCGC GTGTGCTTCT CGACCTGATC CCCAGCCGCG CCGCGGCGGC ACGCCTGGGC
GGCGAGGAGT TTGCCGTTTT CCTCCCCGGC GTCGACCTGG CCGATGCACG AGTGGTTGCC
CAAGGAATGC GAGCCGCGAT CGCCTCGACG CCGATTGCCG ACCTTCCGGA GACGGCCACG
ATTACCGCAA GTTTCGGCGT TGCGGCGATA GCTCCGGGAG AGTCACTGGA AATGGCGCTG
CAAAGGGCGG ATAAGGCGCT TTATGTGGCC AAGGCCGCCG GGCGAAACCG TGTCGAATGT
GCCGAGCCGA CGGCCCGCGT CGTCACACCG GTTGTCTCCC AGTGGATGAG GCAATAG
 
Protein sequence
MGGAISLLAV NFVIAQIFVA AFLVIAVKSR HGRAAAWCAA GFAIASLAAI CEAVLPFTPV 
PKFFAVGAFA SVLAGFCLLR FGLGLFYGVP AKPAALAAFL IVSVVVDLLI YDLPRGTLQH
AFFYQMPFFL IQAWTAAAIM RSPRRSYADR ILVGLLVLNA LHFLGKVYAA IAAGAGSTAS
DYLQSPFALI SQALGAALIV GTGVAILGVM VKDIVDAARA SSEIDSLSGL WNRRGFIERV
APWLLSRNAK SPGALILTDL DRFKGVNDTY GHHTGDEVIR QFARVLLDLI PSRAAAARLG
GEEFAVFLPG VDLADARVVA QGMRAAIAST PIADLPETAT ITASFGVAAI APGESLEMAL
QRADKALYVA KAAGRNRVEC AEPTARVVTP VVSQWMRQ