Gene Rleg_0199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0199 
Symbol 
ID8011429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp202580 
End bp205489 
Gene Length2910 bp 
Protein Length969 aa 
Translation table11 
GC content56% 
IMG OID644822792 
Productprotein of unknown function DUF1156 
Protein accessionYP_002974049 
Protein GI241202953 
COG category[L] Replication, recombination and repair 
COG ID[COG1743] Adenine-specific DNA methylase containing a Zn-ribbon 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.676396 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.41687 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCTA CTGTCAAAAC CCCGAAGAAG CTGATCGAGG TCGCGCTGCC ATTGGATGCG 
ATCAACGAAG CATGCGCGCA CGAAAAGCAG CCGGGTATAG GAGCGCATCC ACGAGGGTTG
CATCTTTGGT GGGCACGACG CCCGCTGGCA GCAGCGCGCG CCGTGATCTT CGCGCAAATG
GTCAACGACC CTTCATGGAA GTGGGAGCTT GAACGCCCTG GCGACATACC GCCGAACAAC
ATTAAGGCAA GCTGGGCGGC CAGTCGCAAC CGCCTGTTCG CAATCATCAA AGAAATGGTC
AAATGGGAGA ACTCGACCAA CGAGGCCGTC CTGCAAAAAG CCCGCGCCGA GATTCTTAGA
TCATGGCGTG AGACCTGCGA TCTCAATAAG GACCATCCAC GAGCGGCGCA ACTGTTCGAT
CCTGAAAGGC TGCCCGCCTT TCATGACCCC TTCGCTGGCG GGGGAGCATT ACCCCTGGAA
GCGCAGCGTC TCGGTCTGGA ATCCTATGCG TCCGACCTCA ATCCAGTCGC CGTCCTCATC
AACAAGGCGA TGATCGAGAT TCCAACGAAA TTTGCTGGTC GGCCGCCGGT TAGCCCGGTT
GCCCGCGACA GTCAGGATGC TTGGAGTAGA CAATGGGCGG GTGTGCGGGG CCTCGCTGAG
GACGTACGCC ACTACGGACA GTGGATACGG GATCAGGCGC AGAAGCGTAT CGGCAACCTA
TATCCGCCTG TCGAAATCAC AGCCGACATG GCAAGAGAAC GGCCTGATGT AATGCCGCTT
GTCGGACAGC GACTTAATGT TTTGACCACG ATTTGGGCGC GGACGGTCAA AAGCCCCAAC
CCAGCCTTTC GCCATGTAAA TGTGCCGCTG GTCTCGACCT TCATATTGTC AAGCAAGGCG
GGCAAAGAAG CCTATGTGGA ACCCATTGTC AGCGGTGATA CCTACCGCCT TACGGTAAAA
GTCGGAAAAG CCCCAAAGGA CTCGGACGAA GGGACGAAGT TTTCGCGTGG CAACTTCCGA
TGCCTACTCT CACAGGCACC TATTAGTGCT GATTACATCA GAAGCGAGGC CAAGGCTGGA
CGTATGGGCG CCCGGCTATT GGCTGTTATC GCTGAAGGCC GAAACGGTCG CATATACCTC
CCCCCAACTT CCGAGCAAGA AGATGCGGCA AACAAGGCCC AACCATTGTG GAAGCCTGAA
CTGGAGTTCT TTCAGCAGGC CCTTGGATTC CGTATCGGTA ACTATGGAAT GACGGCCTGG
AGTGATCTTT TCACTGCGCG CCAACTTGTT GCCCTCACCA CCTTTAGCGA TCTCGTATCC
GATGTGATCG AAGTGATTAG ACGGGATGCC ATCAGTGCCG GCGTGGATGA CGATGGAATT
CCGCTTAACG ATGGCGGCAA TTCAGCCTTA GCCTACGCGC AGGCTGTGGG CGTCTATTTA
GCGTTCGCTA TAAGTCGTCT TGCAGACTAC GGAAGTTCAA TCGCTACCTG GAAGCCATCC
GGCGAGCAGG TCATGCAGAC CTATAAGCGT CAAGCTCTTC CAATGACGTG GGACTTTCCT
GATTCAAATC TTCTCGGAGA TAAGGCGATA TGTTGGACTA ATGCAGTAAA ATACGCTGCG
GATAATTTAT TGTCTACGGC TGCAGCTTCA ACCCAAGCCG AAGGATTTGC GATCCAAAGC
GATGCTCAAC AACAAACAAT AAGCCAAAAT AAAGTCGTAT CTACCGATCC CCCATACTAC
GACAATATTG GATATGCTGA TCTATCAGAT TTCTTCTATG TTTGGCTGCG AAAAACACTG
AAGCCAGTTT ATCCAGAACT TTTTGCAACT GTCGCCGTCC CCAAAGCGGA GGAGTTGGTC
GCTACCCCCG CTCGCCATGG CGGCAGGGAG GGGGCGGAGG AGTTCTTTCT CCACGGTATG
ACGCAGGCCA TGCAACGCCT AGCAACTCAG GCACACCCGT CATTTCCGGT CACAATTTAC
TACGCTTTCA AGCAGTCCGA GACGCAAAAC GACACGGGCA CGTCTAGTAC GGGTTGGGAA
ACTTTCTTGG ATGCGGTGAT CCGATCCGGG CTCGCTCTTA CCGGCACATG GCCGATGCGC
ACCGAGCTGG GCAATCGGAT GCGCGGGCAG GAATCCAATG CGCTTGCGTC GAGCATTGTT
CTGGTTTGTC GTCCGCGTTC GGCTACGGCG GATACCATTT CCCGCCGTGT GTTCCAACGG
GAGTTGAACC AGGTTCTGCC CGAGGCGCTG GACGAGATGA CACGCGGCTC CGGAGAAGAC
CGTTCCCCCG TCGCGCCGGT TGATCTCTCT CAAGCCATTA TCGGCCCCGG CATGGCGGTG
TTCTCGAAAT ATGCTGCTGT CCTGGAGGCG GACGGCACTC CAATGACTGT GCAAACGGCG
TTGCGGCTTA TCAATCGCTT CCTCGCCGAG GATGACTTCG ATCACGACTC CCAATTTTGC
TTGCATTGGT TCGAGCAATA CGGCTGGAAG GAAGGCCGGT TCGGCGAGGC GGATACGCTC
GCACGCGCCA AAGGTACGAG TGTTGACGGT GTGAAGCAGT CGGGCGTGCT GTTAGCCATG
GGTGGCATTG TGCGGCTATT GAAGTGGGCT GAGTACCCTG CCGAATGGGA CCCAACGAAC
GACGCACGCT TGCCCGTGTG GGAAGCCCTG CATCATCTGA TCCGCGTGTT CAAGACTGAC
GGCGAAAGCG GCGCCGGCAA AGTGCTTGCG GCCATCGCGG CTAAGGCCGA GCCGACGCGT
CAGCTTGCCT ATCGCCTCTA CACGCTTTGC GAGCGAGCAG GCTGGGCGGA GGATGCCCGT
GCTTATAACG AAATCATAAC AAGCTGGGGC GCCATCGAGT CCGGCGCCGC AATGGCACCG
AAGGCGCGTC AAAGCGACTT GTTTGGTTAA
 
Protein sequence
MTATVKTPKK LIEVALPLDA INEACAHEKQ PGIGAHPRGL HLWWARRPLA AARAVIFAQM 
VNDPSWKWEL ERPGDIPPNN IKASWAASRN RLFAIIKEMV KWENSTNEAV LQKARAEILR
SWRETCDLNK DHPRAAQLFD PERLPAFHDP FAGGGALPLE AQRLGLESYA SDLNPVAVLI
NKAMIEIPTK FAGRPPVSPV ARDSQDAWSR QWAGVRGLAE DVRHYGQWIR DQAQKRIGNL
YPPVEITADM ARERPDVMPL VGQRLNVLTT IWARTVKSPN PAFRHVNVPL VSTFILSSKA
GKEAYVEPIV SGDTYRLTVK VGKAPKDSDE GTKFSRGNFR CLLSQAPISA DYIRSEAKAG
RMGARLLAVI AEGRNGRIYL PPTSEQEDAA NKAQPLWKPE LEFFQQALGF RIGNYGMTAW
SDLFTARQLV ALTTFSDLVS DVIEVIRRDA ISAGVDDDGI PLNDGGNSAL AYAQAVGVYL
AFAISRLADY GSSIATWKPS GEQVMQTYKR QALPMTWDFP DSNLLGDKAI CWTNAVKYAA
DNLLSTAAAS TQAEGFAIQS DAQQQTISQN KVVSTDPPYY DNIGYADLSD FFYVWLRKTL
KPVYPELFAT VAVPKAEELV ATPARHGGRE GAEEFFLHGM TQAMQRLATQ AHPSFPVTIY
YAFKQSETQN DTGTSSTGWE TFLDAVIRSG LALTGTWPMR TELGNRMRGQ ESNALASSIV
LVCRPRSATA DTISRRVFQR ELNQVLPEAL DEMTRGSGED RSPVAPVDLS QAIIGPGMAV
FSKYAAVLEA DGTPMTVQTA LRLINRFLAE DDFDHDSQFC LHWFEQYGWK EGRFGEADTL
ARAKGTSVDG VKQSGVLLAM GGIVRLLKWA EYPAEWDPTN DARLPVWEAL HHLIRVFKTD
GESGAGKVLA AIAAKAEPTR QLAYRLYTLC ERAGWAEDAR AYNEIITSWG AIESGAAMAP
KARQSDLFG