Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6301 |
Symbol | |
ID | 8017031 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012854 |
Strand | - |
Start bp | 16554 |
End bp | 17579 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 644828097 |
Product | DGQHR domain protein |
Protein accession | YP_002979297 |
Protein GI | 241554084 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03187] DGQHR domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.483477 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGAAT CAATGTGGCT CAACTGCTCG ACTGGCGTCT CGGTCGACAG GCCGGTTCTC CTAGGATTTG CACCGGCAAA ACTCCTGCAT CGATACAGCT TTGCCGACGT GTTGAACGAA GACACAGGTC TTGGCTATCA GCGTCGCTTC AACTCGCAGC ACAGCCAAGA TTTCCGTCGG TACATTCGAC AGACCGGCGC CTCGACGATC CCCCTCACTT TGAATCTGCG GCCTGACGAA AAGGGTTGGA AAGTCGAGAA TGTCGGACCT GGACAGGCTC GGTTAGAGAT CGAGCTGGAC GCCGGCAAGA TCATGGCGCA GGTCGATTGC CAACATAGGC TCGGTTGTCT TGAGGATCTC GACATCCAGC TGCCGTTTAT GTGTTACGTG GGGCTCAGTC TCAAAGAAGA GATGGAAGTC TTCAGTACCA TTAACAGCAA GGCAAAAGGC CTGAGCAACA GTCTGCTGGA CTTTCATGAT GCACACCTGG CTGGAGACCT GGCGAAAGAT CGCCCGGAAA TCTTTATCGC TCTTCATCTG AACAATGACC CGGATTCGCC TTGGTGCCGA CAGCTTGATC TCGGCGGAGA GAGCACTTCC GGGATGACCC GGCGTGCGTC GCTTCGGACG ATGCAAAAGG CCATAAAGCG ATTTCTCAAC TCCACCCGGT CGCTCAAGAC GCGCTCACCG GAAACCGTCA CGCAGATCGT CATGTCCTTT TGGCGTGCAG TTGCCGAGGT GCTTCCTGCC CAGTGGAGCA CGCCGCGCAA GCACATCCTT ACCAAGGGTG TTGGCGTATA CGCGTTAATG GACATCGCTG CCGATCTTTA CAACGAGGCC GAGGATGGGG CCAAGCTGGA CCGTGGCTAT TTCGTCAATC GCCTCGCTGA CTTTGCCTAT GATATCGACT GGTCAACGAC CGGCCGCCTG AAAGGACTTG GCGGCGAGGG TGGGGTCAAC GAGGCCGTCG AATATATCCG CGAAACCCGC AAGCGCTCTC ATTTGAAAGT TGTCAGCAAT GGCTAA
|
Protein sequence | MAESMWLNCS TGVSVDRPVL LGFAPAKLLH RYSFADVLNE DTGLGYQRRF NSQHSQDFRR YIRQTGASTI PLTLNLRPDE KGWKVENVGP GQARLEIELD AGKIMAQVDC QHRLGCLEDL DIQLPFMCYV GLSLKEEMEV FSTINSKAKG LSNSLLDFHD AHLAGDLAKD RPEIFIALHL NNDPDSPWCR QLDLGGESTS GMTRRASLRT MQKAIKRFLN STRSLKTRSP ETVTQIVMSF WRAVAEVLPA QWSTPRKHIL TKGVGVYALM DIAADLYNEA EDGAKLDRGY FVNRLADFAY DIDWSTTGRL KGLGGEGGVN EAVEYIRETR KRSHLKVVSN G
|
| |