Gene Rleg_4933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4933 
Symbol 
ID8007528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp312617 
End bp314059 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content57% 
IMG OID644821852 
Productnitrogenase MoFe cofactor biosynthesis protein NifE 
Protein accessionYP_002973112 
Protein GI241113277 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.613224 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATCGA TCGAGACACA AATGCGAGAT GCCTCCGGCG AGGTATTCCG CACCACGGAA 
ACCAAGGAGA CATGTCACAA CGCGTCACAG GGATCAGCAG CGGGCGGTGG CTGCGCCTTC
GACGGAGCTA AGGTCGTGCT GCAGCCAATC ACCGATGTCG CACATCTCGT CCACGCGCCG
CTCGCATGCG AAGGCAATTC CTGGGACAAC CGAGGCGCCG CATCTTCAGG TCCAGTTCTT
TGGCGCACAA GTTTCACCAC TGATCTTACC GAACTCGACA TAGTGACGGG AGATAGTGAG
CGAAAGCTTC TCAAGGCTAT CCGGGAGATC AAGGAGGGGT ATGCGCCGGC CGCAATCTTC
GTCTACGGAA CCTGTGTCAC GGAGCTGATC GGTGACGACA TCGATGCGGT CTGCAGGCAC
GCAGCGCAGA GGTTCTCAAT ACCAGTGGTG CCGGTGAAGT CGCCAGGCTT CGGCGGTTCG
AAGAACCTCG GTAACCGGCT CGCAGGCGAG GCTCTGCTCG AGCACGTCAT AGGCACGGTG
GAGGCCGATG ATCCAGGTTT ATACGACATC AATATACTCG GCGAATTCAA CCTCTCAGGG
GAATTCTGGT TGGTGAAGCC GCTTTTGGAC CGACTTGGGA TCCGTGTCCG CGCCTGCATT
CCCGGCGACG CGCGCTACGC GCAGGTGGCT TCTGCCCATC GCTCACGTGC AGCTATGATG
GTGTGCTCCA CTGCTCTCAT TAATGTTGCT CGCAAGATGG AAGAACGCTG GAACATTCCA
TTTTTCGAAG GCTCCTTCTA CGGCATTTCC AGCACTTCTG AATCCCTGCG GCGGATCGCC
CAACTGCTCG TAAAGAAGGG CGCTGGTTTC GCTTTGCTCC ATCATGTCGA GACTCTCTTA
GCAGAAGAGG AGGAGGGGGC CTGGAGGAAG CTGGAAGTGT ATCGGCGTCG GCTTGAGGGG
AAGCGCGTTC ATCTGAACAC CGGCGGGGTG AAATCCTGGT CCATCGTACA CGCACTGATC
GAAATCGGCA TGGAGATTGT CGGTACATCC GTCAGGAAAT CGACTGCCAG GGATAAGGAG
AGAATCAAGC AGATGCTGAA GGACGAGAAC CACCTTCACC AATCGATGGC AGCGAGCGAG
CTCTATGCAA TGTTACGTGA ACACAAGCCT GATATCATGC TGTCGGGCGG ACGCACTCAG
TTCGTCGCGC TTGAGGCGAA AATTCCTTGG CTCGACGTCA ACCAGGAACG CCAGCATGCT
TACGCTGGCT ATGACGGCAT GGTGGAACTA GCACGCCAGA TTGATTTGGC AATCCGCAAC
CCGGTTTGGG CGCAGTTGCG CGAACCGGCG CCGTGGAAGC AGTTCGTTAC GACCGTAAGA
TCGGCGGAAC CAAGCAATAA TGAGATCTGC CGACGAGGCG ACAGGTGGTT TCCATTGAGG
TGA
 
Protein sequence
MSSIETQMRD ASGEVFRTTE TKETCHNASQ GSAAGGGCAF DGAKVVLQPI TDVAHLVHAP 
LACEGNSWDN RGAASSGPVL WRTSFTTDLT ELDIVTGDSE RKLLKAIREI KEGYAPAAIF
VYGTCVTELI GDDIDAVCRH AAQRFSIPVV PVKSPGFGGS KNLGNRLAGE ALLEHVIGTV
EADDPGLYDI NILGEFNLSG EFWLVKPLLD RLGIRVRACI PGDARYAQVA SAHRSRAAMM
VCSTALINVA RKMEERWNIP FFEGSFYGIS STSESLRRIA QLLVKKGAGF ALLHHVETLL
AEEEEGAWRK LEVYRRRLEG KRVHLNTGGV KSWSIVHALI EIGMEIVGTS VRKSTARDKE
RIKQMLKDEN HLHQSMAASE LYAMLREHKP DIMLSGGRTQ FVALEAKIPW LDVNQERQHA
YAGYDGMVEL ARQIDLAIRN PVWAQLREPA PWKQFVTTVR SAEPSNNEIC RRGDRWFPLR