Gene Rleg_1998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1998 
Symbol 
ID8013034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1995298 
End bp1996473 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content57% 
IMG OID644824585 
Productdiguanylate cyclase 
Protein accessionYP_002975817 
Protein GI241204721 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.462389 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0959773 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCTTG CGACAGTCCT TCTTCTACAC AAGTCTTCCT TCATCGTCGG AGCGATCTGC 
TTCTTCTATG TGAGATGGCG TTCCGGAGCG ACGCCGGGGC TCGGCGTACT TGCCGTGGGT
TTTACTTTGC TCGCGATCGC CTCAACGCTG GCAGGGTGGG ACGAGAGCCT TCACATGTCC
GATAATGCCA GGACGTTCTG GAGTTTTTCA CTGGGCGCCA ATGGTTACGG GCTGATGGCG
GTTGGTCTTT TCGGCCTCAG TCGGCGTCAG AATTCTCTAC GAGACTGGTG GCCGTTGCTC
CTGCCCGTCG TCCTGATGCT GAGTGCGGCG ATCACGCCAT GGTATTTGAA CAATGGATCG
AGAGCATCCG TGTTCAATGG CAACGCCACC ATCCTGCTTG CCTTGTCAGG TTTCGTGATC
GCCCGCGACT TCTTCCACGA GCGTCTTACC GCGCGGCTCG GCCTCTCCGC TTCAATTTGG
GTGGCGACGT CTCTCTCGGC TTTGGTCGTC GTTGGTTTCA TCTTTCCAGA CGATGCGCCT
CTTCCCCCGC GCTACGCTTT CTTTCTGCTG ATCATCTGTC ATTTCGCAGT GGCTCTGTTC
GTGCTCGTCC TCGTGCAGGA AAGAGCCGAA GAAAAGCTTA TCAGGCTTGC AAATACCGAT
ATGCTGACCG GCATTCCCAA CCGGCAGCAT TTCTTCAATT CCCTTCCGAA AAGCCTCGGC
TCCGGAGACG CCTTCATCCT CATCGATATC GACTTTTTCA AACGTGTTAA CGATATGCAT
GGACACGACA AAGGCGATGT CGTTCTCATA AATGTTGCTC GGACGATTGC TCTGAGTGTC
CCCCCTTCCT GTGTGTTCGG CCGGCTGGGC GGCGAGGAGT TCAGCCTCTT TTTTCGCGGC
CAAACGGCAG CCTCTGCTTT TGCGCTCGCT GAGCGGATAC GCGAGGCTGT CAACGCCGTG
AGCCTTGTCT TGGAAGGAAA TCAGGTCACC CCTTCCGTCA GTGCAGGTGT GGCTCTTTGG
GAAGCGGGCC TGACCGAACA AGACATTCAG AAGCGTGCAG ATCAGGCGCT CTACATTGCC
AAAAACAAGG GTCGCAACAG GGTTGAGTTG TTCAGCAGCG TCGGGTTGAC CTCCAGCGTC
CTTGCGCCCG ACCCGCTACC GGCTCGCGCG GGCTGA
 
Protein sequence
MDLATVLLLH KSSFIVGAIC FFYVRWRSGA TPGLGVLAVG FTLLAIASTL AGWDESLHMS 
DNARTFWSFS LGANGYGLMA VGLFGLSRRQ NSLRDWWPLL LPVVLMLSAA ITPWYLNNGS
RASVFNGNAT ILLALSGFVI ARDFFHERLT ARLGLSASIW VATSLSALVV VGFIFPDDAP
LPPRYAFFLL IICHFAVALF VLVLVQERAE EKLIRLANTD MLTGIPNRQH FFNSLPKSLG
SGDAFILIDI DFFKRVNDMH GHDKGDVVLI NVARTIALSV PPSCVFGRLG GEEFSLFFRG
QTAASAFALA ERIREAVNAV SLVLEGNQVT PSVSAGVALW EAGLTEQDIQ KRADQALYIA
KNKGRNRVEL FSSVGLTSSV LAPDPLPARA G