Gene Rleg_0198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0198 
Symbol 
ID8011428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp199729 
End bp202566 
Gene Length2838 bp 
Protein Length945 aa 
Translation table11 
GC content61% 
IMG OID644822791 
Producthypothetical protein 
Protein accessionYP_002974048 
Protein GI241202952 
COG category[R] General function prediction only 
COG ID[COG1483] Predicted ATPase (AAA+ superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.220084 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCTT GGAGAGAAGT GGCTGTACCC CATCGGGACG TGCTTGAAGG CACTTTCCAA 
CAGTCAGAGT TCGCCGCCGA CATCACAGCG GTCAACACTG GCAAGGCCAG CCGCGAGTAT
CAGGATGCCG GCGCGTTTTT CGACCGCACC TTTATCACCG AGGGCATGGC GCTTCTATTA
ACGCAGGTTG CGCAGCGCCT CACAGGACGG GGCGGTGAGC CGGTCGTGCA GCTTCAGACC
GCCTTTGGCG GCGGCAAGAC GCACACCATG TTGGCCGTCT ATCACCTTGC CACCCGCAAA
TGTGCCCTGT CCGACCTGGC TGGTATCTCT GCGCTTGTCG ACCGGGCCGG CTTGATGGAT
GTGCCGCAGG CCCGCGTCGC AGTGCTGGAC GGCGTTGCGC ACGCGCCGGG CCAGCCATGG
AAGCGCGGGA GCCAGACAAT CAAGACCCTG TGGGGCGAGA TGGCGTGGCA GCTTGGCGGG
GCGGAAGCCT TCGCGCTTCT CGCCGAGGCT GACGTCACTG GCACATCGCC CGGCAAGGAC
GTGCTACGCG ATCTTCTGGA ACGTCACTCT CCATGCGTAG TGCTGATCGA TGAGTTGGTG
GCCTATATCC GTCAATTCCC AGAATCACAG CCGATCAGTG GGGGTAGCTA CGATTCGAAC
CTCTCCTTCG TGCAGGCTCT CACCGAAGCC GCGAAGCTGG TGCCGCGTGC CATCGTGCTC
GCATCGTTGC CGGAATCCGA TCTTGAAGCG GGGAGTCAGC GCGGCGCCGC GGCTCTGCGA
GCACTCGAAA AGACCTTTGG GCGTGTACAA GCGCTTTGGA AGCCAGTGGC GACCGAGGAA
GCCTTCGAGA TCGTGCGCCG CCGGCTATTC GAGCCGGTAC GCGACGAGAA GACCCGCGAA
GGCGTTTGCC GCGCCTTCGC CGACGCCTAC ATCGCCGAGG GTGTGAAGCT TCCTGCCGAT
ACACAAGAAC GACACTACTA CGACCGACTG CTGCACGCTT ATCCGATCCA CCCTGAGGTT
TTTGATCGCC TCTTTGAGGA CTGGACGACC ATTGACGGTT TTCAGCGCAC GCGGGGCGTC
CTGAAGCTCA TGGCGAAGGT CATCTTCCGG CTGTGGAAGG ACGACAACAA AGACCTGCTC
ATCATGCCTG GTAGCCTGCC GCTTTATGAT GGCAGCAGCC GCAACGAGCT GACCTATTAT
CTGCCCGCAG GATGGGACGC TGTGATCGAG CGCGACATCG ACGGCGACCG CGCCGAGACG
ACCGCGCTTG AGAACAAGGA GCCGCGCTTC GGTCAAGTGG GTGCGGCTCG GCGCATTGCG
CGCACGGTCT TCCTCGGCAG CGCCCCGTCG TCGGTCGCGT CCAAGGTCGT CGCTCGCGGC
ATTGACCGCG CCCATATCAT TCTCGGCTGC CTTCAGCCGG GACAGGCGGC GTCCGTCTAT
GCCGATGCGC TCGGCCGGCT GGCCGACCGG CTGCACTATC TCAATTCCTC GGGCGACAAG
AGTCATGACG CGACGCGCTT CTGGTTCGAC ACCCGCGCAA ATCTTCGGCG GGAAATGGAG
GATCGGAAGC GCCGGTTCGA CGACCGCACG GAGGTGCGCG GCAAGATCGC AGGAGCGTTG
AAACAGACCG TTGGCAGCCT CACTTATTTC GACGGCGTGC ATATCTTCGC GCCGCATGGC
GACGTGCCGG ATGACACCGC CCTGCGCTTG CTCGTGCTGC CGCCGGAAAC TTGGTACGCC
CGCGACGAAA ATCGCCTCGC CTTTGAAGCG GTGCTGGAGA CAATTGGCAA AAACGGCCCC
AAGCCGCGAT ACCGCAGCAA CCGACTCCTT TTCCTTGCAC CAGATCATGC TGCGCTTTCC
CGTCTTATGG ACGCAACACG CGTCGCGCTC GCATGGGGTT CGATTGTCGA GGACGTGAAA
GAGGGCCGCC TGAACATCGA CCTCTTGCAG AAAAATCAGG CCGAGAAAGA ACTGAAGAGT
GCAGAGGACG CGCTGCCGCG CGTGGTCCGC GAATGCTACA AATGGCTACT CTGCCCGATG
CAGGATGCAG CAACCGATCC GAAGCCCGGA ATTGAGGCAT TCGCCCTAAA TACTGCCGGT
GGCTCGATCG CCGCGGACAT CGAGCGAGTT TGCATCGACA ACGAACTGGT TATCACCACT
TGGTCGCCGA TCCACCTGCG CACCAAGCTT AAAGAGCTTT ACTGGAAGGG CGGGAAGCGA
GCGGCAAACG CGGCCGGGTT CTTCGAGGAC ACCCTGCGCT ATCTCTACAT GCCGCGCCTC
AAGACGCGGG ATGTTCTGTC GCAGGCGATC CAGGCTGGGG TAGCAGGCAA GGACTTTTTT
GGCACCGCCT ATGGGGAAGC AGATGGCAAG TTTGAGGGCT TTTACTTCGG CGGTGGCACT
GTCATCTTTG ATGACACATT GCTTTTGATC GAGCCTCAAG CAGCTCAAGC CTATGAGGAA
GCAAACCGGG AGGCACAACC TGCCGCTACT CCGCCTGTTT CCACTGCCAC GGCGGCAGGC
GGCGTGGCCG AGGCGCCGAA TGTCTATGTT TTCAATGGCG GGAGCACATC GCCGCCGGTG
GCAATCACAC CTACGTCAGG CCCTACGAAG CCGAAAACCT TTTACGGTTC CGCCGAGGTT
CCGCCCGCGA CCGCAAAGAT GCGCCTTGTG CAGATCGCCG AGGAAATCGT GTCGGTGCTC
ACATCCGATC CAAATGCGAC CGTCCGCCTT GTCGTGGAAA TTTCGGCCGA GTTCCCAGAT
GGAGCAGGCG ATGGCTTGAA ACGTGCAGTC TCGGAAAACG CCCGCAGCCT TGGCCTGAAA
TCGGCGGATT GGGATTAA
 
Protein sequence
MKPWREVAVP HRDVLEGTFQ QSEFAADITA VNTGKASREY QDAGAFFDRT FITEGMALLL 
TQVAQRLTGR GGEPVVQLQT AFGGGKTHTM LAVYHLATRK CALSDLAGIS ALVDRAGLMD
VPQARVAVLD GVAHAPGQPW KRGSQTIKTL WGEMAWQLGG AEAFALLAEA DVTGTSPGKD
VLRDLLERHS PCVVLIDELV AYIRQFPESQ PISGGSYDSN LSFVQALTEA AKLVPRAIVL
ASLPESDLEA GSQRGAAALR ALEKTFGRVQ ALWKPVATEE AFEIVRRRLF EPVRDEKTRE
GVCRAFADAY IAEGVKLPAD TQERHYYDRL LHAYPIHPEV FDRLFEDWTT IDGFQRTRGV
LKLMAKVIFR LWKDDNKDLL IMPGSLPLYD GSSRNELTYY LPAGWDAVIE RDIDGDRAET
TALENKEPRF GQVGAARRIA RTVFLGSAPS SVASKVVARG IDRAHIILGC LQPGQAASVY
ADALGRLADR LHYLNSSGDK SHDATRFWFD TRANLRREME DRKRRFDDRT EVRGKIAGAL
KQTVGSLTYF DGVHIFAPHG DVPDDTALRL LVLPPETWYA RDENRLAFEA VLETIGKNGP
KPRYRSNRLL FLAPDHAALS RLMDATRVAL AWGSIVEDVK EGRLNIDLLQ KNQAEKELKS
AEDALPRVVR ECYKWLLCPM QDAATDPKPG IEAFALNTAG GSIAADIERV CIDNELVITT
WSPIHLRTKL KELYWKGGKR AANAAGFFED TLRYLYMPRL KTRDVLSQAI QAGVAGKDFF
GTAYGEADGK FEGFYFGGGT VIFDDTLLLI EPQAAQAYEE ANREAQPAAT PPVSTATAAG
GVAEAPNVYV FNGGSTSPPV AITPTSGPTK PKTFYGSAEV PPATAKMRLV QIAEEIVSVL
TSDPNATVRL VVEISAEFPD GAGDGLKRAV SENARSLGLK SADWD