Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6039 |
Symbol | |
ID | 8016301 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012852 |
Strand | - |
Start bp | 71389 |
End bp | 74928 |
Gene Length | 3540 bp |
Protein Length | 1179 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644827347 |
Product | aldehyde oxidase and xanthine dehydrogenase molybdopterin binding |
Protein accession | YP_002978547 |
Protein GI | 241258663 |
COG category | [C] Energy production and conversion |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0887172 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCACC GCGAAGCACC CAAGTCCGCC TACATGGCAG CCGCGAGCAC GCTTCTCATT GTCAATGACG CGTCGATGGG CGACGCCGCC GAGCTTTATA TCGCTGTAGG GGCCGATAGC CGGGTGACAG CGTTCAACGG CCACGTCGAT CTCGGGACAG GCATCCGCAC CTCGCTGGCG CAAATCGTCG CCGAAGAGCT CAGCGTGCCG TTCGAACAGG TGGACATGGT GCTCGGCACG ACGACCGCCG CGCCCAATCA GGGCGCGACC ATTGCCAGCG AAACCATCCA GATCACCGCG ATACCGCTGC GCCAGGCCGC CGCCACCGCC CGCCACCATC TGCTCGTCAA GGCCGCCGAG AAGGTCGGCG TGCCGATTGA GCGGCTGCGG CTGGAGGACG GCATCATCCG TGCCGACGGC GGCGAGAACT GGCAGCTGAA CTTCGGCGAT GTTGTCGTCG GCAGCCATGT CAGGCTGTCG ATCGACAGTA ATGCGGCGCT GAAGCCGGCG TCGGACTATA AACTGGTTGG CTCGTCGCGC CCGCGCGTCG ATATTCCCGA GAAGGCAACC GGGCGCTGGA CCTATGTCCA CGATGTGCGC GTGCCTGGCA TGCTGCACGG CCGCGTTATT CGCCCGCCAT ATGCCGGATT CGATCACGGC GAGCATGTCG GAAACAGCTT GATCTCGATC GACGAGACCT CGGTCGCGCA TATCGACGGG CTGGTCGGCG TCGTCGCCAT CGGTGATTTC GTCGGTGTTG TCGCGACCCG CGAGGAAATC GCCATCGAGG CGTCCGGGAG CCTGAAGGTC GTGTGGCGCG CGCCGCCCGA GTGGCCGGAT CTGAACATGC CGGAGAAGGC GCTGCGTGCC AATCCGTCGA CGCCACGCAA GCTTGCCGAT CGCGGCAATG TCGACATGGC ATTGGCCGGC AGCGCCCAGC CGATGAACCG CACCTATGTC TGGCCTTACC AGATGCACGG TTCGATCGGC CCGTCCTGCG CGGTGGCCGA TTACAATGAC GCCGGGCTGA CCGTCTGGTC GGGAACCCAG AACCCTTATC CCATGCGCCG CGATCTGGCG CTGCTGCTCG ACTTGCCGGA AGAGCAGATC AGGGTCGAGC GGCTGGAGGC GGCCGGCTGT TACGGCCGCA ACTGCGCCGA TGACGTAACA GCCGACGCGG CGCTGCTGTC GCGGGCCGTC AAGGCACCCG TCCGGGTCCA GCTGACGCGC GAACAGGAAC ATGCCTGGGA GCCGAAGGGC GCCGCCCAGA TCATGGATGT GCGCGGCGGT CTCGACCTGG AGGGCGGTCC GTCCGCCTAT GATTTCGAAA CGCGCTATCC CTCCAATCTG GCGCCAACCC TTCCCCTGAT CCTCACCGGC AAGCTGCCGC CCGTTTCCGA CGTCGTGCAG ATGGGCGATC GTACCGCCAT CCCACCCTAT GCCTACGGCA ACCTGCGCGT AACCGTGCAC GACATGCCGC CGATCGCACG CGCCTCCTGG TTCCGTGGCG TGTCGGCGAT GCCGAATACC TTCGCCCACG AATGCTACGT CGACGAGCTG GCCGCCGCCG CCGGCGTCGA TCCGGTCGAG TACCGGCTCC GTTATCTCCA CGACCCACGC GCCGTCGACC TGGTGAACGC GCTGGCCGAA CGGGCGAAGT GGGTGCCGCA TACGACATGG GGCACGCTCA GCGGCGAAGG CGATCTGCTC TACGGCCGTG GATTTGCCTA TGCCGTCTAC GTCCACGGGC CGTTTCCGGG CAAGGCCGCC GCCTGGGCGG CATGGGTTGC CGATGTCGCC GTCAACAAGA AGACCGGCGA GATCGCGGTT ACGAAGGTGA CATGCGCGCA GGACTCGGGA ATGATGATCA ATCCCGACGG CGTGCGTCAC CAGATCCACG GCAACATCAT TCAGTCCACC AGCCGGGTGC TGAAGGAAAA GGTCGAATTT TCCTCGACCG CGGTGCAGTC GAAGGAATGG GGCGGCTATC CGCTGATCAC CTTCCCCGAG GTACCTGATA TCGACGTGCT GATGGTGCCG CGGCAGGACG AGCCGCCGCT CGGTGTCGGA GAGTCCGCCT CCGTTCCCAG CGCCTCGGCC ATCGCCAATG CCGTCTACGA TGCCACCGGC ATTCGTTTCC GCGAACTGCC GCTGACGCCG GAACTGGTCC TTGCCGCGCT GAACGGCAAG ACGGGCGAAA GGCCGGCCAC CCCGGTCGCG AAAAAGCGGA GGTGGTGGAA CGTCGGCCTG TCGGTGATCG GTGCCGTCGC CGCTCTGTCC GGTATCGTGA CCATGGCATC GCCATGGCGC CCGGCGATCG GCACCATCCA GCAGCCGGAC GCCAATGTCT ACAGCGCCGC CACCATCGAG CGCGGCCGGC TGGCGGCGGC GGCCGGCGCC TGCAATGTCT GCCATGTCGG AAACGACGGA ACGCCCTTTG CCGGTGGCCG GCGCTTCGAC ACGCCATTCG GCGCAGTTTA TGCCACCAAC ATCACGCCCG ATGCCCAAAG CGGCATCGGC GCCTGGTCCT ACCCGGCCTT CGAACGTGCC ATGCGCGAGG GCATCAGCCG CGACGGCCAT CACCTCTATC CCGCGCACCC CTACACCTCG TTCGCGGGCG CCGAGGATGC CGATCTTCAG GCGCTCTACG CCTACATGAT GACGCAGGCG CCGGTCGCCG AAAGGGCGCC TGAGACGAAG CTCAAATTCC CCTACAGCAT CCGCGCGATG ATGGCGGGCT GGAACGCGCT GTTCCTGAAG GCGCAGCCGT TCAAATATGT CGAGACCCGC GATGCGCAGT GGAACAGGGG CGCCTATCTC GTCGAGACCT TGGGCCACTG TTCGGCCTGC CACACCGAGC GCAATGTGCT CGGCGCGGAA AAGAGCGGCA GCGCCCGGCT TTCCGGCGGC TTTGCCGATG GCTGGGAGGC GCCGGCCCTG AATGCGTTCG CCAAGGGCCC GGTCGGCTGG ACGGCAGACG CCTTCTATGA CTATCTGCGC ACCGGACATT CGCGCGATCA CGGCAGTGCC GCCGGCCCGA TGGCGCATGT CGTCGAAGTC ATGCAGCCGC TTCCGGATAG CGATATCCGC GCTATGGCCA CATATCTCGC AAGCCTCAAC GAGGCCCCCG CCGATAGCAA GGCGCAGAGC GAGGCGGCCA TTGCCGCGAG TGAAGCGGCC AAGGCTTCCG CCGCGCGGAT TTCGCCGAAG GGCGAGCGGC TGTTTAGTGG CGCCTGCGCC ACATGCCACA CTGGAAACAC GATCCTGTCG TCGCTGGCGC TCAACAGCAA CCTTCATGCG GCGACCCCTG ACAATCTCAT CCAGGCAATC CTGAACGGTG TCGAGGCGCC GGCAATCCTT GCGCAGACGA CCGGCCGCCA AGCCCCGGAG GTGATGTCGA TGCCGGCTTT CCGCCAGACG CTGAACGACG GCCAGATCAA GGATCTCGCC GATTATCTGA GAGCGCGCTT CGCCCCCGAC AAGCCGGCCT GGACGGAAAC GACCAAAGCC ATGCAACGCG TGACGGCAGC AAACCACTAG
|
Protein sequence | MNHREAPKSA YMAAASTLLI VNDASMGDAA ELYIAVGADS RVTAFNGHVD LGTGIRTSLA QIVAEELSVP FEQVDMVLGT TTAAPNQGAT IASETIQITA IPLRQAAATA RHHLLVKAAE KVGVPIERLR LEDGIIRADG GENWQLNFGD VVVGSHVRLS IDSNAALKPA SDYKLVGSSR PRVDIPEKAT GRWTYVHDVR VPGMLHGRVI RPPYAGFDHG EHVGNSLISI DETSVAHIDG LVGVVAIGDF VGVVATREEI AIEASGSLKV VWRAPPEWPD LNMPEKALRA NPSTPRKLAD RGNVDMALAG SAQPMNRTYV WPYQMHGSIG PSCAVADYND AGLTVWSGTQ NPYPMRRDLA LLLDLPEEQI RVERLEAAGC YGRNCADDVT ADAALLSRAV KAPVRVQLTR EQEHAWEPKG AAQIMDVRGG LDLEGGPSAY DFETRYPSNL APTLPLILTG KLPPVSDVVQ MGDRTAIPPY AYGNLRVTVH DMPPIARASW FRGVSAMPNT FAHECYVDEL AAAAGVDPVE YRLRYLHDPR AVDLVNALAE RAKWVPHTTW GTLSGEGDLL YGRGFAYAVY VHGPFPGKAA AWAAWVADVA VNKKTGEIAV TKVTCAQDSG MMINPDGVRH QIHGNIIQST SRVLKEKVEF SSTAVQSKEW GGYPLITFPE VPDIDVLMVP RQDEPPLGVG ESASVPSASA IANAVYDATG IRFRELPLTP ELVLAALNGK TGERPATPVA KKRRWWNVGL SVIGAVAALS GIVTMASPWR PAIGTIQQPD ANVYSAATIE RGRLAAAAGA CNVCHVGNDG TPFAGGRRFD TPFGAVYATN ITPDAQSGIG AWSYPAFERA MREGISRDGH HLYPAHPYTS FAGAEDADLQ ALYAYMMTQA PVAERAPETK LKFPYSIRAM MAGWNALFLK AQPFKYVETR DAQWNRGAYL VETLGHCSAC HTERNVLGAE KSGSARLSGG FADGWEAPAL NAFAKGPVGW TADAFYDYLR TGHSRDHGSA AGPMAHVVEV MQPLPDSDIR AMATYLASLN EAPADSKAQS EAAIAASEAA KASAARISPK GERLFSGACA TCHTGNTILS SLALNSNLHA ATPDNLIQAI LNGVEAPAIL AQTTGRQAPE VMSMPAFRQT LNDGQIKDLA DYLRARFAPD KPAWTETTKA MQRVTAANH
|
| |