Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1786 |
Symbol | |
ID | 6980523 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 1829446 |
End bp | 1832691 |
Gene Length | 3246 bp |
Protein Length | 1081 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643396508 |
Product | adenylate/guanylate cyclase with TPR repeats |
Protein accession | YP_002281298 |
Protein GI | 209549381 |
COG category | [R] General function prediction only |
COG ID | [COG3899] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.145933 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTGCG CCGGCTGCGG TTTCGATATT CAGAGTGGTT TTGCCTTCTG CCCGCGATGC GGCGCCAGGC AGCCCCTCTC TTGTGCCGCC TGCGGCTGTG CTTGTCAGCC GGATTTTGCC TTCTGCCCGA GGTGCGGCGC GGCGATCTCA GGCCAAGTCC CGCCTGCGCC GAAGCCAAGC ATCGAGGTGG TGCCGCATAA ATCAGACAGC GATGCCGACC GGCGGCCGGT CACGGTGCTC TTTGCCGATC TCTGCGGCTT CACCACGCTG AGCGAGCAGA TCGACCCTGA AGTCATGCGG GTGCTGCAGA ACGAATTGTT CGAAGAGATG ACCCAGGCGG TCGAGGCTTA TGGCGGCTTC GTCGACAAGT TCGTCGGCGA TGCGCTGCTG GCGCTGTTCG GCGCGCCGGT TGCCCATGAA GACGACCCGG TCCGGGCGCT CGGTGCCGCG CTCGACATGA TCCGTCGGGC GACTGACGTC GGCGACCGCT GGCGCTCACG CGCCGGCGTA CCGCTGCGTC TGCATATCGG CATCAATAGC GGACCCGTCG TCACCGGCGG CTTCGGCGCG GTCAGCACCA AATCCTATTC GGTGACCGGC GACACGGTGA ACACCGCCCA GCGGCTGCAG TCGATGGCCG GCGAAAACGA TATCCTCGTC GGGCCGCTGA CCTATCGCCT CACCCGCCAT GCCTTCGCCT TCGACAGTCT TGGCGCGCAG ACGCTGCGCG GCAAAAGCGG AAATGTCATC GTCCATCGGC TGGCGGGGCT GCTGGAAGCG CCGCATACGG CGCGCGGGCT CGAAAGTTTC GGCCTTCAGG CGCCTATGAT CGGACGCGAT GGCGAGTTGT CGCGGCTGCT GACCTGCCTC GATCTCGCCT GCGGCGGCGC GGCGCAACTC GTCCGCCTGA TCGGCGAAGC CGGTATCGGC AAATCCCGGC TGGTCAACGA ATTCGTCGGC ACCGCCGGCG ATGCGGACCG TTTCCCTGGC CTTGCCATCC GCAAAGCCAC CTGCTCTCCC CTCGGCGAAC AATCCTATGG CACGCTCGCC GCGGTCGTGC GCAGCGCTTA CGGCATCGGC GAGCGGGATG ATCTCGCCAA AACACGGCAG CTGCTGACAA CCGGATTCCG CGCTCTCGAT CTCACCCAAG AGGATGTCGA GGGGCTTCTG CCGCTCTTTC TGCATGTCCT CGGTCTCGGC GATCCCGATG GGGCATTACG GCACATCGAG CCGGAACAGC TGCGGCGACA AATCTTCTAT GCCGTGCGTA CGGTCTTCGA GCGGCGGCTG GCGCAAGGCC CTCTGCTGCT CGTCATCGAG GATCTGCACT GGGCGGACGT TGCCTCGCTG GAAGTGCTCC GCTTCATGAT GGATCGGCTG GAGCGCAGCC GCCTGATGCT GCTGGCGATC TACCGGCCGA CATCACAGAC CGACCCGCTG AACTCCGATC GCGTCAATGT CACCGTGCAA CGTCTCGACC CGCTCTTTGC AGCCGATGGG CAGAAGCTGC TTGCTGCCTT TTTTGGCGAG AGCCATAGCA AGCTGCCGGT TGCCATGCGC AAGCGCATCC TGGATCGGGC CGGCGGCAAT CCGCTTTTCA TCGAAGAAAT CCTGCGCGGA TTGATCGACA TGGGCACGCT GCACCATGAT GGTCATCGCT GGCATGTCGC AGCCGACGAC AGCGATGTCG ATATCCCGGT CAATCTGCAG GCGCTGCTGC TTGCCCGCGT CGATCGCCTG CCGCAAGAGA TGAGACGCTT GGCGCAGGAG GCGGCGGTCG TCGGCCCAAA GTTCGATACC GCCCTGCTCC GCACCGTCGC ATCCGACCCG GCCGGCGTCG ACGCGGCGCT GGATTTTCTC TGCGATGCCA ATATCATCGA GGAATTGCGC GGCCCGGATG CCGCCGGGTC ACCTGGCTAT CGCTTCAGCC AGACGCTGAT GCATGACGTC ATCTACCACA ATCTGCTGCT GCAGCGACGC ATGGAGCTGC ATGGCAGGAT CGGCCGGGTT CTGGAACGCC AATATGGCGA GGCGCCCGAC CGGGCCGAGC ATCTGACGCA GCTCGGCCAC CACTTCAGCC TGACCACCGA GAAGGCCAAG GGCGCCGGCT ACCTGATGGC GGCGGGCGAT CTGGCCCGCA AGACCTACGC CAATGACGAT GCCATGCGGC TCTACCATCA GGCCCTTGCC GCCTTCGCCA ACGAGCCCGA GGTGTCGCCG GAACAATTGG CGCTTATGGA GCGTCTTGGC GATCTCTGCG GCCCCGCCGG CCGGCGCGAT GCAGCCTTGA CCCATTATCA GCGGGCGCTT GCGATCCATC GCACCAAGGA CGATCGGATC GCCGCAGCAC GGATCCTGCG AAAGACCGGC CGTCTGCATC TCGACGCGGG GCGCCGCGAC CAGGCGGAGA CCCATTGCGC TGCGGCCGAA GCGTTGATCG CGACGATCGA TGCGCCGGTC GAACATGCGC ACCTTTTGCA GGAGCGCGGC CATCTGGCCT TCCGCATGGG CGATCAGGCC GCTGCCGCCG AATGGGCGAC GCAGGCGCTG CAACGCCTGC AAACGCTGCC GGTCGACGGA ACGACCGAGG CCGGACGGGA GGCAGCGCGG GCGATGGCGG AGGCATTGAA CACCAAGGGT GCGGCGCTGG CGCGGCTCGG GCGCCGCCGC GATGCGGTGC AGGAAGTCGA GCGCAGCCTA TCGGTCGCTG AAAAGGCCGA TCTGCAAAGT GCGGCGTGCC GCGCCTATTC CAATCTCGGT GTTCTCTACA CCATCGTCGA TCCGGCCAAC GCCATCAGGA TTTGCCGGCG CGGACTGGAG GTCGCCACCC GTATCGGCGA TCTCGGCTTC CAGGCGCGGC TTCTCGCCAA TCTCGCGGTT TCCTGCTGCA CCTTTACCGA TCGCTGCGCG GCCGAGGGTG TTCCGGCGGC GGAGAAGGCC GTCGAAATCG ATCGGGCGCT CGATCAGCGC GACCACCTCT CCGTGCCGCT GATCGTCCTC GGGCAGATTC ATCAATGCCA CGGGCAGCCG AAGCTAGCTC GTAAGTATTA CGAGGAAGCC CTTGAGGTTG CGAAGGAAAT TGACGAGCCG CAGCTGCTCT TTCCGTGCTA TGATGGCTTG GCGACACTGA GCCTCGAACA TGACGATATG GACGAGGCGG AACGGTACTT CACGCTGGCG CAGGATGTGT GCATCCGCCA CAACCTCGAT CCAGGCACAC TCGTCGTTTT GCCGTTTCTC GACTGA
|
Protein sequence | MECAGCGFDI QSGFAFCPRC GARQPLSCAA CGCACQPDFA FCPRCGAAIS GQVPPAPKPS IEVVPHKSDS DADRRPVTVL FADLCGFTTL SEQIDPEVMR VLQNELFEEM TQAVEAYGGF VDKFVGDALL ALFGAPVAHE DDPVRALGAA LDMIRRATDV GDRWRSRAGV PLRLHIGINS GPVVTGGFGA VSTKSYSVTG DTVNTAQRLQ SMAGENDILV GPLTYRLTRH AFAFDSLGAQ TLRGKSGNVI VHRLAGLLEA PHTARGLESF GLQAPMIGRD GELSRLLTCL DLACGGAAQL VRLIGEAGIG KSRLVNEFVG TAGDADRFPG LAIRKATCSP LGEQSYGTLA AVVRSAYGIG ERDDLAKTRQ LLTTGFRALD LTQEDVEGLL PLFLHVLGLG DPDGALRHIE PEQLRRQIFY AVRTVFERRL AQGPLLLVIE DLHWADVASL EVLRFMMDRL ERSRLMLLAI YRPTSQTDPL NSDRVNVTVQ RLDPLFAADG QKLLAAFFGE SHSKLPVAMR KRILDRAGGN PLFIEEILRG LIDMGTLHHD GHRWHVAADD SDVDIPVNLQ ALLLARVDRL PQEMRRLAQE AAVVGPKFDT ALLRTVASDP AGVDAALDFL CDANIIEELR GPDAAGSPGY RFSQTLMHDV IYHNLLLQRR MELHGRIGRV LERQYGEAPD RAEHLTQLGH HFSLTTEKAK GAGYLMAAGD LARKTYANDD AMRLYHQALA AFANEPEVSP EQLALMERLG DLCGPAGRRD AALTHYQRAL AIHRTKDDRI AAARILRKTG RLHLDAGRRD QAETHCAAAE ALIATIDAPV EHAHLLQERG HLAFRMGDQA AAAEWATQAL QRLQTLPVDG TTEAGREAAR AMAEALNTKG AALARLGRRR DAVQEVERSL SVAEKADLQS AACRAYSNLG VLYTIVDPAN AIRICRRGLE VATRIGDLGF QARLLANLAV SCCTFTDRCA AEGVPAAEKA VEIDRALDQR DHLSVPLIVL GQIHQCHGQP KLARKYYEEA LEVAKEIDEP QLLFPCYDGL ATLSLEHDDM DEAERYFTLA QDVCIRHNLD PGTLVVLPFL D
|
| |