Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_0199 |
Symbol | |
ID | 8011429 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 202580 |
End bp | 205489 |
Gene Length | 2910 bp |
Protein Length | 969 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 644822792 |
Product | protein of unknown function DUF1156 |
Protein accession | YP_002974049 |
Protein GI | 241202953 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1743] Adenine-specific DNA methylase containing a Zn-ribbon |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.676396 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.41687 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCTA CTGTCAAAAC CCCGAAGAAG CTGATCGAGG TCGCGCTGCC ATTGGATGCG ATCAACGAAG CATGCGCGCA CGAAAAGCAG CCGGGTATAG GAGCGCATCC ACGAGGGTTG CATCTTTGGT GGGCACGACG CCCGCTGGCA GCAGCGCGCG CCGTGATCTT CGCGCAAATG GTCAACGACC CTTCATGGAA GTGGGAGCTT GAACGCCCTG GCGACATACC GCCGAACAAC ATTAAGGCAA GCTGGGCGGC CAGTCGCAAC CGCCTGTTCG CAATCATCAA AGAAATGGTC AAATGGGAGA ACTCGACCAA CGAGGCCGTC CTGCAAAAAG CCCGCGCCGA GATTCTTAGA TCATGGCGTG AGACCTGCGA TCTCAATAAG GACCATCCAC GAGCGGCGCA ACTGTTCGAT CCTGAAAGGC TGCCCGCCTT TCATGACCCC TTCGCTGGCG GGGGAGCATT ACCCCTGGAA GCGCAGCGTC TCGGTCTGGA ATCCTATGCG TCCGACCTCA ATCCAGTCGC CGTCCTCATC AACAAGGCGA TGATCGAGAT TCCAACGAAA TTTGCTGGTC GGCCGCCGGT TAGCCCGGTT GCCCGCGACA GTCAGGATGC TTGGAGTAGA CAATGGGCGG GTGTGCGGGG CCTCGCTGAG GACGTACGCC ACTACGGACA GTGGATACGG GATCAGGCGC AGAAGCGTAT CGGCAACCTA TATCCGCCTG TCGAAATCAC AGCCGACATG GCAAGAGAAC GGCCTGATGT AATGCCGCTT GTCGGACAGC GACTTAATGT TTTGACCACG ATTTGGGCGC GGACGGTCAA AAGCCCCAAC CCAGCCTTTC GCCATGTAAA TGTGCCGCTG GTCTCGACCT TCATATTGTC AAGCAAGGCG GGCAAAGAAG CCTATGTGGA ACCCATTGTC AGCGGTGATA CCTACCGCCT TACGGTAAAA GTCGGAAAAG CCCCAAAGGA CTCGGACGAA GGGACGAAGT TTTCGCGTGG CAACTTCCGA TGCCTACTCT CACAGGCACC TATTAGTGCT GATTACATCA GAAGCGAGGC CAAGGCTGGA CGTATGGGCG CCCGGCTATT GGCTGTTATC GCTGAAGGCC GAAACGGTCG CATATACCTC CCCCCAACTT CCGAGCAAGA AGATGCGGCA AACAAGGCCC AACCATTGTG GAAGCCTGAA CTGGAGTTCT TTCAGCAGGC CCTTGGATTC CGTATCGGTA ACTATGGAAT GACGGCCTGG AGTGATCTTT TCACTGCGCG CCAACTTGTT GCCCTCACCA CCTTTAGCGA TCTCGTATCC GATGTGATCG AAGTGATTAG ACGGGATGCC ATCAGTGCCG GCGTGGATGA CGATGGAATT CCGCTTAACG ATGGCGGCAA TTCAGCCTTA GCCTACGCGC AGGCTGTGGG CGTCTATTTA GCGTTCGCTA TAAGTCGTCT TGCAGACTAC GGAAGTTCAA TCGCTACCTG GAAGCCATCC GGCGAGCAGG TCATGCAGAC CTATAAGCGT CAAGCTCTTC CAATGACGTG GGACTTTCCT GATTCAAATC TTCTCGGAGA TAAGGCGATA TGTTGGACTA ATGCAGTAAA ATACGCTGCG GATAATTTAT TGTCTACGGC TGCAGCTTCA ACCCAAGCCG AAGGATTTGC GATCCAAAGC GATGCTCAAC AACAAACAAT AAGCCAAAAT AAAGTCGTAT CTACCGATCC CCCATACTAC GACAATATTG GATATGCTGA TCTATCAGAT TTCTTCTATG TTTGGCTGCG AAAAACACTG AAGCCAGTTT ATCCAGAACT TTTTGCAACT GTCGCCGTCC CCAAAGCGGA GGAGTTGGTC GCTACCCCCG CTCGCCATGG CGGCAGGGAG GGGGCGGAGG AGTTCTTTCT CCACGGTATG ACGCAGGCCA TGCAACGCCT AGCAACTCAG GCACACCCGT CATTTCCGGT CACAATTTAC TACGCTTTCA AGCAGTCCGA GACGCAAAAC GACACGGGCA CGTCTAGTAC GGGTTGGGAA ACTTTCTTGG ATGCGGTGAT CCGATCCGGG CTCGCTCTTA CCGGCACATG GCCGATGCGC ACCGAGCTGG GCAATCGGAT GCGCGGGCAG GAATCCAATG CGCTTGCGTC GAGCATTGTT CTGGTTTGTC GTCCGCGTTC GGCTACGGCG GATACCATTT CCCGCCGTGT GTTCCAACGG GAGTTGAACC AGGTTCTGCC CGAGGCGCTG GACGAGATGA CACGCGGCTC CGGAGAAGAC CGTTCCCCCG TCGCGCCGGT TGATCTCTCT CAAGCCATTA TCGGCCCCGG CATGGCGGTG TTCTCGAAAT ATGCTGCTGT CCTGGAGGCG GACGGCACTC CAATGACTGT GCAAACGGCG TTGCGGCTTA TCAATCGCTT CCTCGCCGAG GATGACTTCG ATCACGACTC CCAATTTTGC TTGCATTGGT TCGAGCAATA CGGCTGGAAG GAAGGCCGGT TCGGCGAGGC GGATACGCTC GCACGCGCCA AAGGTACGAG TGTTGACGGT GTGAAGCAGT CGGGCGTGCT GTTAGCCATG GGTGGCATTG TGCGGCTATT GAAGTGGGCT GAGTACCCTG CCGAATGGGA CCCAACGAAC GACGCACGCT TGCCCGTGTG GGAAGCCCTG CATCATCTGA TCCGCGTGTT CAAGACTGAC GGCGAAAGCG GCGCCGGCAA AGTGCTTGCG GCCATCGCGG CTAAGGCCGA GCCGACGCGT CAGCTTGCCT ATCGCCTCTA CACGCTTTGC GAGCGAGCAG GCTGGGCGGA GGATGCCCGT GCTTATAACG AAATCATAAC AAGCTGGGGC GCCATCGAGT CCGGCGCCGC AATGGCACCG AAGGCGCGTC AAAGCGACTT GTTTGGTTAA
|
Protein sequence | MTATVKTPKK LIEVALPLDA INEACAHEKQ PGIGAHPRGL HLWWARRPLA AARAVIFAQM VNDPSWKWEL ERPGDIPPNN IKASWAASRN RLFAIIKEMV KWENSTNEAV LQKARAEILR SWRETCDLNK DHPRAAQLFD PERLPAFHDP FAGGGALPLE AQRLGLESYA SDLNPVAVLI NKAMIEIPTK FAGRPPVSPV ARDSQDAWSR QWAGVRGLAE DVRHYGQWIR DQAQKRIGNL YPPVEITADM ARERPDVMPL VGQRLNVLTT IWARTVKSPN PAFRHVNVPL VSTFILSSKA GKEAYVEPIV SGDTYRLTVK VGKAPKDSDE GTKFSRGNFR CLLSQAPISA DYIRSEAKAG RMGARLLAVI AEGRNGRIYL PPTSEQEDAA NKAQPLWKPE LEFFQQALGF RIGNYGMTAW SDLFTARQLV ALTTFSDLVS DVIEVIRRDA ISAGVDDDGI PLNDGGNSAL AYAQAVGVYL AFAISRLADY GSSIATWKPS GEQVMQTYKR QALPMTWDFP DSNLLGDKAI CWTNAVKYAA DNLLSTAAAS TQAEGFAIQS DAQQQTISQN KVVSTDPPYY DNIGYADLSD FFYVWLRKTL KPVYPELFAT VAVPKAEELV ATPARHGGRE GAEEFFLHGM TQAMQRLATQ AHPSFPVTIY YAFKQSETQN DTGTSSTGWE TFLDAVIRSG LALTGTWPMR TELGNRMRGQ ESNALASSIV LVCRPRSATA DTISRRVFQR ELNQVLPEAL DEMTRGSGED RSPVAPVDLS QAIIGPGMAV FSKYAAVLEA DGTPMTVQTA LRLINRFLAE DDFDHDSQFC LHWFEQYGWK EGRFGEADTL ARAKGTSVDG VKQSGVLLAM GGIVRLLKWA EYPAEWDPTN DARLPVWEAL HHLIRVFKTD GESGAGKVLA AIAAKAEPTR QLAYRLYTLC ERAGWAEDAR AYNEIITSWG AIESGAAMAP KARQSDLFG
|
| |