Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5561 |
Symbol | |
ID | 6978655 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 1208971 |
End bp | 1210320 |
Gene Length | 1350 bp |
Protein Length | 449 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643394659 |
Product | putative nitrilotriacetate monooxygenase protein component A |
Protein accession | YP_002279477 |
Protein GI | 209547559 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.200364 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCAGA AACACGTCAC GTTCGGCATC ATGCTGCAGG GTCCCGGCGG TCACATGAAT GCCTGGAAAC ATCCAAGCGG ACCGGCTGAT GCCAGCGTCA ATTTCGACTT CTTCGTCAAC ACGGCGCGCA AGGCGGAGGC GGCCGGTATC GCCTTCGCTT TCGTCGCCGA CGGGCTCTAT ATCAACGAGC AGTCGATCCC GCATTTCCTC AACCGGTTCG AGCCGATCGC CATTCTCTCG GCGCTTGCCG CCTCGACTTC GAAAATCGGC CTCGTCGGCA CAGTCTCGAC CTCCTACAGC GACCCCTTCA CCATCGCGCG CCAGTTCGCT TCGGTCGATC TCATCAGCGG CGGCCGGGCA GGGTGGAATG CCGTGACCTC GCCGCTCGAA GGCTCGGGGC GCAATTACAG CCGCGAACAC CCCGAACACG AACTGCGCTA CGAGATCGCC GAGGACTACA TCGATGCGAT CAAAGGCCTC TGGGATTCCT GGGATGACGA CGCCTTCGTG CGCAATCGCG AAACCGGCGT CTATGCCGAC AAGACCAAGA TGCACCGCCT CGACCACAAG GGCCGCTTTT TCCGCATCGA AGGGCCGCTC AACATCGGCC GTTCGAAGCA GGGGCAACCG GTGGTCTTCC AGGCCGGCGC TTCGGACTCC GGCATCAGGC TTGCCGGCAA ACATGCCGAT GCCGTCTTTA CCAATGGCGG ACCGTTCGAG GAGGCGCAGG CCTTCTATCG GCAGCTGATG GATAGCGTCA TCGCTCATGG ACGGCCCGCG GCGGAAGTCG GCATCTATCC CGGCATCGGC CCGATCGTCG GCAAGACGGC CGAGGAAGCG GAAGCCAAAT ATCAGGCGAT CCGCAATCTC GTCACCATCG ACGAGGCGCT CCTCTATCTC GGCCGCTTCT TCGATCACCT TGATTTCAGC GTCTACCCGC TCGATGAGGC TTTCCCGGAT CTCGGCGATA TCGGCAAGAA CAGCTTCCGC GCGACCACCG ACCGCATCAA GAGGACAGCG CGCGAAAAAG GCCTGACACT GCGCGAAATC GCGCTCGATG TCGCCACGCC ACGCACCGCC TTCATCGGCA CGGCGGAGCA TATCGCCGAC GAGATCATTC GCTGGGTGGA CAACGGCGCC GCCGACGGCT TCATCCTCGG TTTCCCCGTC ATCGCCGAGG GCTTCGACGA TTTTGCCGAA CACGTCCTGC CGGTCCTGAC CGAGCGGGGG TATTTCGATC CCGTCCTGAA GGGCGAGACG CTGCGCGACC ACCTCGGCCT GCCCTTCCGC GAAAGCCGGT ATGCGGCCAG TGCCGATCAG CTCGAGCCCG GAAAGGCTGT CGGCGCCTGA
|
Protein sequence | MAQKHVTFGI MLQGPGGHMN AWKHPSGPAD ASVNFDFFVN TARKAEAAGI AFAFVADGLY INEQSIPHFL NRFEPIAILS ALAASTSKIG LVGTVSTSYS DPFTIARQFA SVDLISGGRA GWNAVTSPLE GSGRNYSREH PEHELRYEIA EDYIDAIKGL WDSWDDDAFV RNRETGVYAD KTKMHRLDHK GRFFRIEGPL NIGRSKQGQP VVFQAGASDS GIRLAGKHAD AVFTNGGPFE EAQAFYRQLM DSVIAHGRPA AEVGIYPGIG PIVGKTAEEA EAKYQAIRNL VTIDEALLYL GRFFDHLDFS VYPLDEAFPD LGDIGKNSFR ATTDRIKRTA REKGLTLREI ALDVATPRTA FIGTAEHIAD EIIRWVDNGA ADGFILGFPV IAEGFDDFAE HVLPVLTERG YFDPVLKGET LRDHLGLPFR ESRYAASADQ LEPGKAVGA
|
| |