Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1790 |
Symbol | |
ID | 8012851 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 1782599 |
End bp | 1784863 |
Gene Length | 2265 bp |
Protein Length | 754 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644824381 |
Product | ComEC/Rec2-related protein |
Protein accession | YP_002975614 |
Protein GI | 241204518 |
COG category | [R] General function prediction only |
COG ID | [COG0658] Predicted membrane metal-binding protein |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0814214 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00430403 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGCATGC TGGGCGAGGA GACGGACCAT GGCCGCCTGC TGCTGTTCTC GCCGGTCTTC CTCGGCGCCG GGTCAGCCAT CTGGTTTCTC GCGGCATCGG ATTTTCCGCT CACTGCATCG CTGCTCTGCC TGTTGGTGCT GACCGTCGCA GTTCTCATCG CCAGCCGCAA CCGGGCGGCA TTGCGGGCAG CGCTGCTGGC GCTTTCGCTC GTGGCTTGCG GCATAGTCTC GGCGCAGTTC GAGAGCTGGC GGGCTTCGAC CGTGATCCTC GATTCATCAG TGACGACGAC GGTGACCGGC CGCGTCGAGC GGCGCGAAGG CGATGGTCGC GGCCGGTGGC GCTACATTCT TGCCGTCACC GGCACTGAAG CGCCTCAGGT CAAGCGGCCG CCGGAACGCA TCACCGCCAT CGCCCGCGGT GCGGACGCAG CCTTCGAGAT CGGCGATATC ATCACCGGCA GGGCGCGGCT GACGCCGCCG GCGGGCCCAG CACTTCCCGA GCTCAACGAC TTCTCCTTCA GTGCCTATTT CGACGGCATC GGCGCCAACG GCTTCTTCTA CGGTGCACCG ACGAAGGTCG ATGCGCAGGC AGGTCCGCAG GCTGCAAGAT CGGCGGTGGA AGCGCTGCTC GAAGGGCTCT ACCGGCTGCG CAGCGGCATT GGCGACCGGA TAAGGTCGAT CCTGCCCGGC GATACCGGCG CTTTTGCCGC CGCACTGGTG ACGGACGAGC GGCGCGCCAT CTCGGATGCG ACGACGGAGG CACTACGCCA GTCCGGTCTG GCTCATATCA TCGCCATCTC CGGTCTCAAC ATGGCGCTGT CGGCCGGCAT CTTCTTCGTC GGCTTTCGGA TGCTGCTCAG CATCTTTGCC GGTATCGCTG AGGCCTACCC GACGAAGAAG ATCGCTGCCG CCGGCGCGCT CATCGCCGTT ACTGCCTATT TCCTGATTTC GGGCTTTGCG GTCTCTGCCG AGCGCGCTTT CATCATGATG GCCATCATGT TGATTGCCGT CTTCTTCGAC CGGCCGTCGA TCAGCCTGCG CAATGTCGCG CTTTCCGCCC TCGTCATCAT CCTCATTTCA CCGTCCGAGG TCTTAGGTCC GAGCTTCCAG ATGTCCTTTG CCGCGACATT GGCGCTGGTC TCGGGTTATC AGTTGTGGAA GGACCGGCGC GTCCGCGAAA ACGCCTTTCT GAAGCTACCT ATCATCAGGC CCGTCGTTAC GGTTGCCGGT TTCTTCGGCG GCGTCTTCCT GACTTCGCTG ATCGGCGGTT TTTCCACCGC GCTGTTTTCG ATCGAGCATT TTCATCGCCT GACCGCCTAC GGCCTGCCGG CAAACCTCGC GACGATGCCG ATCATCTCCT TCATCGTCAT GCCGGCCGGC TTGCTGGCGA TGCTGCTCAT GCCTTTCGGT CTGGATGTTT TGCCCTGGAA GGTCGTCGGA TTCGGCCTCG ATCTGGTGAT CGCGGTCGCA AAGACGGTGT CCGGCTGGGA TGGCAATATC GACGTCGGCC GCTTGCCCGC CTGGTATTTC GCGGTCGCCG TGGCAGGCTT TCTGCTGCTG ACGCTGCTCC GCACCCGGCT GCGCCATATC GGCACATCAA TCATCGCGGT CGCAACGCTC ATCCTGTTGC TTCTACCGGT TCCCCGGCCG CCGGATCTGG TGATTTCGGA GGATGGCAGT CTCGTCGCGG TGGTCGAAGC GGCAGCAATG GCTTCCAACC GTGAAAGACC ACCGGATTTC ATCTTCGACC AGTGGCAGAG AGCCCTGGTC CTGCCGAGAC ATGATCCGCC GAAGATGCTG GATGGTCCTG CTATCCCGCA GGTAGGAGAA GACCGCCGCG TCAAGCTTTC CCGCGATCAG CAGAACGAAG CGAGGACAGC GATGCGGGCG GCCGCAGCGG CCGGTGAAGC AAACCGCTTT TCCTGCGTCA AGAGGGCCTG GTGCGCATCA AGGCTTGGCA ATGGTCGTGT GGTTGCTGTC ATCGACAATG CCGCCTATCT CGGCCCGGCA TGCGACGCGG CCGATATCAT CGTGACGTCG GTCCGCCTGC GTTTCAACAG CTGCCGCTCA GGTGCGACGC TCTTCACCGG CGAGACGCTG CGCAGGACCG GATCCATCGA GTTGCGCTTC GCGGACGCCG GCCTGGAGGT GGCAACCGCA TTTGACGCAT TGTCGCGACC ATGGATGCGC CATCGCGCCT TTGACTGGCG CAGCAACAGC TTTACCGAAT CCGGCCTAAC CGATGTCAGT GATAGCGGCG AATGA
|
Protein sequence | MRMLGEETDH GRLLLFSPVF LGAGSAIWFL AASDFPLTAS LLCLLVLTVA VLIASRNRAA LRAALLALSL VACGIVSAQF ESWRASTVIL DSSVTTTVTG RVERREGDGR GRWRYILAVT GTEAPQVKRP PERITAIARG ADAAFEIGDI ITGRARLTPP AGPALPELND FSFSAYFDGI GANGFFYGAP TKVDAQAGPQ AARSAVEALL EGLYRLRSGI GDRIRSILPG DTGAFAAALV TDERRAISDA TTEALRQSGL AHIIAISGLN MALSAGIFFV GFRMLLSIFA GIAEAYPTKK IAAAGALIAV TAYFLISGFA VSAERAFIMM AIMLIAVFFD RPSISLRNVA LSALVIILIS PSEVLGPSFQ MSFAATLALV SGYQLWKDRR VRENAFLKLP IIRPVVTVAG FFGGVFLTSL IGGFSTALFS IEHFHRLTAY GLPANLATMP IISFIVMPAG LLAMLLMPFG LDVLPWKVVG FGLDLVIAVA KTVSGWDGNI DVGRLPAWYF AVAVAGFLLL TLLRTRLRHI GTSIIAVATL ILLLLPVPRP PDLVISEDGS LVAVVEAAAM ASNRERPPDF IFDQWQRALV LPRHDPPKML DGPAIPQVGE DRRVKLSRDQ QNEARTAMRA AAAAGEANRF SCVKRAWCAS RLGNGRVVAV IDNAAYLGPA CDAADIIVTS VRLRFNSCRS GATLFTGETL RRTGSIELRF ADAGLEVATA FDALSRPWMR HRAFDWRSNS FTESGLTDVS DSGE
|
| |