Gene Rleg_4234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4234 
Symbol 
ID8015017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4329249 
End bp4330868 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content62% 
IMG OID644826804 
Productdiguanylate cyclase 
Protein accessionYP_002978013 
Protein GI241206917 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.256126 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTCG AAACCATTAA AAAGGTTATC CTGGCCGCCA TCATGCTGCC CTTCGTCTTC 
ATGCTCATTG AAGGCGTCGT CGTCATACGT TCTTCGGTCA ATCATTACTG GAACCTCGAA
AAGGACCGGC AGTTCGCCGA TGTGCTTGCC CGTGGCGGGT CGATCGCCGC CACGGAGATT
CTGACTGAAA TTGGCGCCAC CCGCCGCTAC CTCGCAGATC CCAGCGACAT GACTGCTATC
GACATGCAGC AAAGCCGGGT GACGCTCGAT CGCGAACGCC GCGCTTTTTA TGCCAGCCTG
CCCTCCCGCG AGGCGCTCGA CGAGGGGCTC GTCGGCGAAT TATCGATTCT CAGCCTTGCC
TACAGCCGTA TCGTCGCGGC GCGCAGCGCC GTCGACCAGG GCCGTTATGC CGGCAGCGAT
CCCGGTTCCA TTTACTGGTA TGCGGCTCTC AAGCAGCTTG CCGTCGTCGA TGCGCTTTCA
CCGCTGATCA GCGATCCGGT GCTGCTTGAG AAATCCAACC AGCTGATGGG CATCCTGCTG
ACCTATTACG GCGAAAGGCT GATCACCGGG ATCGGCACCC GTTATCTCAA CCAGGGGGTT
TCCGCCAGAT TGCCGGTCGA GCTCTTCGTG CAGGGCAAGG TCATGCTCGG CGAGGGCATG
GATCACATGG TTTTCCATTC CTCCGCGCCG GTCGTGCGCA ACATCGTCGC TTATCTTGGC
AGCGCCAGCC AGGTAAAGGC GAATGCGATC ACCGATGCCA TTCTCGCCGG AGCGCGGCCG
ACACGCGCGG TGCATGACGT CTGGGCCGCC GCGCAGAGCG AGCGCATGAG CTTCCTGCAG
CAGAGGATGA TCGAGGCCGC ACAGGATATT CACGAGACCG GCGAAAACCT GTCGACGCGC
TCGCACATAC ACCTGACGCG GATCCTGGCG CTGTGTGCCG GCCTGCTGAT TCTCGCCACA
TTGGTGCTGC TGCTGGCGGC AAAGGGCCTT CGCCTGATCG ACCGACTGAC CCAGGATCGG
GAGACGCTGG TCGGCGAGCT GCGCAGCGCC GCCCAGACCG ATCTTCTGAC CGGGCTTTAC
AACAGGCGCG GCTTCGAGGT CGCCGCATCC GCACTTCTCA CACAGGCCGA GCACGGATCA
CGCTGGATTT CCGTCGTGCT CTTCGACCTC GATCATTTCA AGAAGATCAA CGACGTTCAC
GGGCATGACG CCGGCGATGC TGTGCTCCGG CATGTCGCGG GCGTCGCGCG TAAGAATTTT
CGTTCCTTCG ATCTGCTGGT GCGCCATGGC GGCGAGGAGT TCCTGGCGCT TCTGCCGGAT
TCGACGCCTG ACGATGCTGC AATCGTTGCC GAGCGTGTGC GGCTGGCGAT CGAGGCGGCG
GAAATCCCCC TGCCGAGCGG CGATGTTCTC AAGGTGACGG CAAGTTTCGG ATGCGCCGGA
CGGGCAAATG AAGCCACCAA CCGGAACTTC GAGGATCTGG TCAAACGCGC CGACCTGGCG
CTTTACGCCG CCAAGGCCTC CGGCCGCAAC TGCGTCGTCT CGGGACCGAC CCTGCCGGCC
CCGGCCCAGG AGGAGCGACG CAAGGCGGTG TCGGGCGGTG GCTTTGATTC CCGCATATGA
 
Protein sequence
MKFETIKKVI LAAIMLPFVF MLIEGVVVIR SSVNHYWNLE KDRQFADVLA RGGSIAATEI 
LTEIGATRRY LADPSDMTAI DMQQSRVTLD RERRAFYASL PSREALDEGL VGELSILSLA
YSRIVAARSA VDQGRYAGSD PGSIYWYAAL KQLAVVDALS PLISDPVLLE KSNQLMGILL
TYYGERLITG IGTRYLNQGV SARLPVELFV QGKVMLGEGM DHMVFHSSAP VVRNIVAYLG
SASQVKANAI TDAILAGARP TRAVHDVWAA AQSERMSFLQ QRMIEAAQDI HETGENLSTR
SHIHLTRILA LCAGLLILAT LVLLLAAKGL RLIDRLTQDR ETLVGELRSA AQTDLLTGLY
NRRGFEVAAS ALLTQAEHGS RWISVVLFDL DHFKKINDVH GHDAGDAVLR HVAGVARKNF
RSFDLLVRHG GEEFLALLPD STPDDAAIVA ERVRLAIEAA EIPLPSGDVL KVTASFGCAG
RANEATNRNF EDLVKRADLA LYAAKASGRN CVVSGPTLPA PAQEERRKAV SGGGFDSRI