Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_0198 |
Symbol | |
ID | 8011428 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 199729 |
End bp | 202566 |
Gene Length | 2838 bp |
Protein Length | 945 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644822791 |
Product | hypothetical protein |
Protein accession | YP_002974048 |
Protein GI | 241202952 |
COG category | [R] General function prediction only |
COG ID | [COG1483] Predicted ATPase (AAA+ superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.220084 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACCTT GGAGAGAAGT GGCTGTACCC CATCGGGACG TGCTTGAAGG CACTTTCCAA CAGTCAGAGT TCGCCGCCGA CATCACAGCG GTCAACACTG GCAAGGCCAG CCGCGAGTAT CAGGATGCCG GCGCGTTTTT CGACCGCACC TTTATCACCG AGGGCATGGC GCTTCTATTA ACGCAGGTTG CGCAGCGCCT CACAGGACGG GGCGGTGAGC CGGTCGTGCA GCTTCAGACC GCCTTTGGCG GCGGCAAGAC GCACACCATG TTGGCCGTCT ATCACCTTGC CACCCGCAAA TGTGCCCTGT CCGACCTGGC TGGTATCTCT GCGCTTGTCG ACCGGGCCGG CTTGATGGAT GTGCCGCAGG CCCGCGTCGC AGTGCTGGAC GGCGTTGCGC ACGCGCCGGG CCAGCCATGG AAGCGCGGGA GCCAGACAAT CAAGACCCTG TGGGGCGAGA TGGCGTGGCA GCTTGGCGGG GCGGAAGCCT TCGCGCTTCT CGCCGAGGCT GACGTCACTG GCACATCGCC CGGCAAGGAC GTGCTACGCG ATCTTCTGGA ACGTCACTCT CCATGCGTAG TGCTGATCGA TGAGTTGGTG GCCTATATCC GTCAATTCCC AGAATCACAG CCGATCAGTG GGGGTAGCTA CGATTCGAAC CTCTCCTTCG TGCAGGCTCT CACCGAAGCC GCGAAGCTGG TGCCGCGTGC CATCGTGCTC GCATCGTTGC CGGAATCCGA TCTTGAAGCG GGGAGTCAGC GCGGCGCCGC GGCTCTGCGA GCACTCGAAA AGACCTTTGG GCGTGTACAA GCGCTTTGGA AGCCAGTGGC GACCGAGGAA GCCTTCGAGA TCGTGCGCCG CCGGCTATTC GAGCCGGTAC GCGACGAGAA GACCCGCGAA GGCGTTTGCC GCGCCTTCGC CGACGCCTAC ATCGCCGAGG GTGTGAAGCT TCCTGCCGAT ACACAAGAAC GACACTACTA CGACCGACTG CTGCACGCTT ATCCGATCCA CCCTGAGGTT TTTGATCGCC TCTTTGAGGA CTGGACGACC ATTGACGGTT TTCAGCGCAC GCGGGGCGTC CTGAAGCTCA TGGCGAAGGT CATCTTCCGG CTGTGGAAGG ACGACAACAA AGACCTGCTC ATCATGCCTG GTAGCCTGCC GCTTTATGAT GGCAGCAGCC GCAACGAGCT GACCTATTAT CTGCCCGCAG GATGGGACGC TGTGATCGAG CGCGACATCG ACGGCGACCG CGCCGAGACG ACCGCGCTTG AGAACAAGGA GCCGCGCTTC GGTCAAGTGG GTGCGGCTCG GCGCATTGCG CGCACGGTCT TCCTCGGCAG CGCCCCGTCG TCGGTCGCGT CCAAGGTCGT CGCTCGCGGC ATTGACCGCG CCCATATCAT TCTCGGCTGC CTTCAGCCGG GACAGGCGGC GTCCGTCTAT GCCGATGCGC TCGGCCGGCT GGCCGACCGG CTGCACTATC TCAATTCCTC GGGCGACAAG AGTCATGACG CGACGCGCTT CTGGTTCGAC ACCCGCGCAA ATCTTCGGCG GGAAATGGAG GATCGGAAGC GCCGGTTCGA CGACCGCACG GAGGTGCGCG GCAAGATCGC AGGAGCGTTG AAACAGACCG TTGGCAGCCT CACTTATTTC GACGGCGTGC ATATCTTCGC GCCGCATGGC GACGTGCCGG ATGACACCGC CCTGCGCTTG CTCGTGCTGC CGCCGGAAAC TTGGTACGCC CGCGACGAAA ATCGCCTCGC CTTTGAAGCG GTGCTGGAGA CAATTGGCAA AAACGGCCCC AAGCCGCGAT ACCGCAGCAA CCGACTCCTT TTCCTTGCAC CAGATCATGC TGCGCTTTCC CGTCTTATGG ACGCAACACG CGTCGCGCTC GCATGGGGTT CGATTGTCGA GGACGTGAAA GAGGGCCGCC TGAACATCGA CCTCTTGCAG AAAAATCAGG CCGAGAAAGA ACTGAAGAGT GCAGAGGACG CGCTGCCGCG CGTGGTCCGC GAATGCTACA AATGGCTACT CTGCCCGATG CAGGATGCAG CAACCGATCC GAAGCCCGGA ATTGAGGCAT TCGCCCTAAA TACTGCCGGT GGCTCGATCG CCGCGGACAT CGAGCGAGTT TGCATCGACA ACGAACTGGT TATCACCACT TGGTCGCCGA TCCACCTGCG CACCAAGCTT AAAGAGCTTT ACTGGAAGGG CGGGAAGCGA GCGGCAAACG CGGCCGGGTT CTTCGAGGAC ACCCTGCGCT ATCTCTACAT GCCGCGCCTC AAGACGCGGG ATGTTCTGTC GCAGGCGATC CAGGCTGGGG TAGCAGGCAA GGACTTTTTT GGCACCGCCT ATGGGGAAGC AGATGGCAAG TTTGAGGGCT TTTACTTCGG CGGTGGCACT GTCATCTTTG ATGACACATT GCTTTTGATC GAGCCTCAAG CAGCTCAAGC CTATGAGGAA GCAAACCGGG AGGCACAACC TGCCGCTACT CCGCCTGTTT CCACTGCCAC GGCGGCAGGC GGCGTGGCCG AGGCGCCGAA TGTCTATGTT TTCAATGGCG GGAGCACATC GCCGCCGGTG GCAATCACAC CTACGTCAGG CCCTACGAAG CCGAAAACCT TTTACGGTTC CGCCGAGGTT CCGCCCGCGA CCGCAAAGAT GCGCCTTGTG CAGATCGCCG AGGAAATCGT GTCGGTGCTC ACATCCGATC CAAATGCGAC CGTCCGCCTT GTCGTGGAAA TTTCGGCCGA GTTCCCAGAT GGAGCAGGCG ATGGCTTGAA ACGTGCAGTC TCGGAAAACG CCCGCAGCCT TGGCCTGAAA TCGGCGGATT GGGATTAA
|
Protein sequence | MKPWREVAVP HRDVLEGTFQ QSEFAADITA VNTGKASREY QDAGAFFDRT FITEGMALLL TQVAQRLTGR GGEPVVQLQT AFGGGKTHTM LAVYHLATRK CALSDLAGIS ALVDRAGLMD VPQARVAVLD GVAHAPGQPW KRGSQTIKTL WGEMAWQLGG AEAFALLAEA DVTGTSPGKD VLRDLLERHS PCVVLIDELV AYIRQFPESQ PISGGSYDSN LSFVQALTEA AKLVPRAIVL ASLPESDLEA GSQRGAAALR ALEKTFGRVQ ALWKPVATEE AFEIVRRRLF EPVRDEKTRE GVCRAFADAY IAEGVKLPAD TQERHYYDRL LHAYPIHPEV FDRLFEDWTT IDGFQRTRGV LKLMAKVIFR LWKDDNKDLL IMPGSLPLYD GSSRNELTYY LPAGWDAVIE RDIDGDRAET TALENKEPRF GQVGAARRIA RTVFLGSAPS SVASKVVARG IDRAHIILGC LQPGQAASVY ADALGRLADR LHYLNSSGDK SHDATRFWFD TRANLRREME DRKRRFDDRT EVRGKIAGAL KQTVGSLTYF DGVHIFAPHG DVPDDTALRL LVLPPETWYA RDENRLAFEA VLETIGKNGP KPRYRSNRLL FLAPDHAALS RLMDATRVAL AWGSIVEDVK EGRLNIDLLQ KNQAEKELKS AEDALPRVVR ECYKWLLCPM QDAATDPKPG IEAFALNTAG GSIAADIERV CIDNELVITT WSPIHLRTKL KELYWKGGKR AANAAGFFED TLRYLYMPRL KTRDVLSQAI QAGVAGKDFF GTAYGEADGK FEGFYFGGGT VIFDDTLLLI EPQAAQAYEE ANREAQPAAT PPVSTATAAG GVAEAPNVYV FNGGSTSPPV AITPTSGPTK PKTFYGSAEV PPATAKMRLV QIAEEIVSVL TSDPNATVRL VVEISAEFPD GAGDGLKRAV SENARSLGLK SADWD
|
| |