Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4933 |
Symbol | |
ID | 8007528 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | + |
Start bp | 312617 |
End bp | 314059 |
Gene Length | 1443 bp |
Protein Length | 480 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 644821852 |
Product | nitrogenase MoFe cofactor biosynthesis protein NifE |
Protein accession | YP_002973112 |
Protein GI | 241113277 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.613224 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCATCGA TCGAGACACA AATGCGAGAT GCCTCCGGCG AGGTATTCCG CACCACGGAA ACCAAGGAGA CATGTCACAA CGCGTCACAG GGATCAGCAG CGGGCGGTGG CTGCGCCTTC GACGGAGCTA AGGTCGTGCT GCAGCCAATC ACCGATGTCG CACATCTCGT CCACGCGCCG CTCGCATGCG AAGGCAATTC CTGGGACAAC CGAGGCGCCG CATCTTCAGG TCCAGTTCTT TGGCGCACAA GTTTCACCAC TGATCTTACC GAACTCGACA TAGTGACGGG AGATAGTGAG CGAAAGCTTC TCAAGGCTAT CCGGGAGATC AAGGAGGGGT ATGCGCCGGC CGCAATCTTC GTCTACGGAA CCTGTGTCAC GGAGCTGATC GGTGACGACA TCGATGCGGT CTGCAGGCAC GCAGCGCAGA GGTTCTCAAT ACCAGTGGTG CCGGTGAAGT CGCCAGGCTT CGGCGGTTCG AAGAACCTCG GTAACCGGCT CGCAGGCGAG GCTCTGCTCG AGCACGTCAT AGGCACGGTG GAGGCCGATG ATCCAGGTTT ATACGACATC AATATACTCG GCGAATTCAA CCTCTCAGGG GAATTCTGGT TGGTGAAGCC GCTTTTGGAC CGACTTGGGA TCCGTGTCCG CGCCTGCATT CCCGGCGACG CGCGCTACGC GCAGGTGGCT TCTGCCCATC GCTCACGTGC AGCTATGATG GTGTGCTCCA CTGCTCTCAT TAATGTTGCT CGCAAGATGG AAGAACGCTG GAACATTCCA TTTTTCGAAG GCTCCTTCTA CGGCATTTCC AGCACTTCTG AATCCCTGCG GCGGATCGCC CAACTGCTCG TAAAGAAGGG CGCTGGTTTC GCTTTGCTCC ATCATGTCGA GACTCTCTTA GCAGAAGAGG AGGAGGGGGC CTGGAGGAAG CTGGAAGTGT ATCGGCGTCG GCTTGAGGGG AAGCGCGTTC ATCTGAACAC CGGCGGGGTG AAATCCTGGT CCATCGTACA CGCACTGATC GAAATCGGCA TGGAGATTGT CGGTACATCC GTCAGGAAAT CGACTGCCAG GGATAAGGAG AGAATCAAGC AGATGCTGAA GGACGAGAAC CACCTTCACC AATCGATGGC AGCGAGCGAG CTCTATGCAA TGTTACGTGA ACACAAGCCT GATATCATGC TGTCGGGCGG ACGCACTCAG TTCGTCGCGC TTGAGGCGAA AATTCCTTGG CTCGACGTCA ACCAGGAACG CCAGCATGCT TACGCTGGCT ATGACGGCAT GGTGGAACTA GCACGCCAGA TTGATTTGGC AATCCGCAAC CCGGTTTGGG CGCAGTTGCG CGAACCGGCG CCGTGGAAGC AGTTCGTTAC GACCGTAAGA TCGGCGGAAC CAAGCAATAA TGAGATCTGC CGACGAGGCG ACAGGTGGTT TCCATTGAGG TGA
|
Protein sequence | MSSIETQMRD ASGEVFRTTE TKETCHNASQ GSAAGGGCAF DGAKVVLQPI TDVAHLVHAP LACEGNSWDN RGAASSGPVL WRTSFTTDLT ELDIVTGDSE RKLLKAIREI KEGYAPAAIF VYGTCVTELI GDDIDAVCRH AAQRFSIPVV PVKSPGFGGS KNLGNRLAGE ALLEHVIGTV EADDPGLYDI NILGEFNLSG EFWLVKPLLD RLGIRVRACI PGDARYAQVA SAHRSRAAMM VCSTALINVA RKMEERWNIP FFEGSFYGIS STSESLRRIA QLLVKKGAGF ALLHHVETLL AEEEEGAWRK LEVYRRRLEG KRVHLNTGGV KSWSIVHALI EIGMEIVGTS VRKSTARDKE RIKQMLKDEN HLHQSMAASE LYAMLREHKP DIMLSGGRTQ FVALEAKIPW LDVNQERQHA YAGYDGMVEL ARQIDLAIRN PVWAQLREPA PWKQFVTTVR SAEPSNNEIC RRGDRWFPLR
|
| |