Gene Rleg2_1101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1101 
Symbol 
ID6979820 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1123302 
End bp1124363 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content61% 
IMG OID643395813 
Productputative adenylate/guanylate cyclase 
Protein accessionYP_002280621 
Protein GI209548704 
COG category[T] Signal transduction mechanisms 
COG ID[COG2114] Adenylate cyclase, family 3 (some proteins contain HAMP domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.969933 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.21412 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGAAA TATCTCCGAC GCAGAACTGG ATCCTGATCA CGATGGTTCT GGCTGCCAGC 
GGCGTCGTCT ATGATCTGAT GTTCTACTCC AATCAGACGC CCGTTGTCGG CGCGATCTTC
GCGCTGTTCA TCGGCATGCC GATCATCGCC TTCGAGCGCA AGGCGCTGTT CCGCACGCTG
TACAGGCGTA TCCAGAAGCT GCCGACCTTC GCTTTCATCA TCAGCGAGCT GGTGATCTAC
GAGATCCTGA TGAGCATCGG CTTTGCCTGC GCCGCGCTGC TGCTCTCGTC GCTCGGCATG
GTGAAGCCAA CATCGTTCCT CGATCTCGTC ATCATGCCCT ACGAGGTCTT CCTCTATGCG
CTTGCCGTCT GCTCGGCGCT GATCTTCATC CTGCGCGTGC GGGAGCTGCT CGGCCGCGAG
GTATTCGTCA GCATGCTGGT CAGCCGCTAC CGCAATCCAG TCAGGGAAGA GCGTGTCTTC
CTGTTCATCG ACCTGGTCGA CTCGACGGCT TTTGCCGAAA AGCACGGCGA CCTTCGTGCG
CAGCAGCTGC TGAGCTCGCT GTTTGCGACC TTCGCCGAGC CCGTCAGGCG CCATAAGGGC
ATGATCAACG ACTATGTCGG CGATGCGGCG ATCATCACCT GGCCGCTTGC CCGCGGCATC
AAGGGCGCGC GCTGTGTGCG CTGCATCTTC GACATCCTCG CCGATATCGA AGCCAACGCC
GCCGGCTGGC GGAAAAGCTA CGGACAGGTG CCGAAGCTGC GCGCCGCCCT TCACGGCGGC
GAGATCATCA CCGCCGAAAT TGGCGTCGAT CATCACAAGA TCAGCTATTT CGGCGACACG
GTGAACACCA CCGCCCGGCT GGAAACGCTC TGCCGCAGCC TCAATCGGCC AGTGCTGATT
TCGGCCGACC TTGCGCAGCG CATGAAATTT CCCGACGATA TATCCTGCGA GGATCTCGGC
ACCCATGCCG TCAGGGGGCG CGGCCAGGCG CTCGGCGTCA TGGCGCTTTC CTCACGCGCG
GTGACTGTGC TGAACACGCC TGCCGTCATT CTGCACGGCT GA
 
Protein sequence
MREISPTQNW ILITMVLAAS GVVYDLMFYS NQTPVVGAIF ALFIGMPIIA FERKALFRTL 
YRRIQKLPTF AFIISELVIY EILMSIGFAC AALLLSSLGM VKPTSFLDLV IMPYEVFLYA
LAVCSALIFI LRVRELLGRE VFVSMLVSRY RNPVREERVF LFIDLVDSTA FAEKHGDLRA
QQLLSSLFAT FAEPVRRHKG MINDYVGDAA IITWPLARGI KGARCVRCIF DILADIEANA
AGWRKSYGQV PKLRAALHGG EIITAEIGVD HHKISYFGDT VNTTARLETL CRSLNRPVLI
SADLAQRMKF PDDISCEDLG THAVRGRGQA LGVMALSSRA VTVLNTPAVI LHG