Gene Rleg2_5055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5055 
Symbol 
ID6978149 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp703181 
End bp704545 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content56% 
IMG OID643394195 
Productnitrogenase MoFe cofactor biosynthesis protein NifE 
Protein accessionYP_002279013 
Protein GI209547095 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATCGA TCGAGGCTCA AACAGGAGAT ACTTCCAGCC AACGTACCGC CGCGATATAC 
CCAACCAAAG AAAACAAGGC ATGTCAGAAG GCGACGCAAG GATCGGCGGC CGGCGGCTGC
GCCTTCGACG GAGCTAAGGT GGTGCTCCAG CCAATAGTGG ATGTCGCGCA TTTAATTCAC
GCGCCGCCCG CATGCGAGGG CAATTCCTGG GACAACCGAG GCGCGGCGTC GTCAGGTCCG
GTTCTTTGGC GCACCAGCTT TACCACTGAC ATTACTGAAA TCGATATAGT GACGGGAGAT
ACTGATCAGA AGCTTCTTGA GGCGATCCGC GAGATCAAAA AGGGATATGC ACCGGCGGCA
ATCTTCGTCT ATGGAACATG CGTAAGCGAG CTGATTGGTG GCAACATCGA CGCGGTCTGC
AGGCACGCAG CGCAGAAGTT CGCGATACCA GTGGTGCCGG TTAAGTCGCC GGGCTTCCGC
GGTTCGAAGA GCGTGGGCAA CAGGATCGCC GGAGAGGCTC TGCTCGAGCA CGTGATAGGC
ACGGTGGAGG CCGATAATAC TAGCCCATAC GACATCAATA TCCTCGGCGA ATTCAACCTC
TCAGGAGAGT TTTGGCTGGT GAAGCCGCTG TTGGACCGGC TTGGCATCCG TGTTCGCGCC
TGTATTCCCG GAGATGCGCG CTTTGCGCAG GTTGGTTCCG CCCACCGCTC CCGTGCAGCT
ATGGTGGTGT GCTCCACTGC TCAGATCAAC CTTGCACGTA AGATGGAAGC ACGCTGGGAT
ATTCCATTTT TTGAGGGGTC CTTCTATGGC ATCTCCGGCA CCTCGGAATC GCTTCGGCGG
ATCGCTCAAT TGCTCGTAAA CAAGGGCGCT GGTCTAGCAT TCCTCCACCG TACTGAGGAG
CTCATTGCAG ATGAGGAGGA AAGAGTCTGG AAGAATTTGG AAGTGTACCG GCGTAGGCTC
GTGGGCAAGC GCGTTCATCT GAACACCGGC GGCGTGAAAT CCTGGTCCAT CGTGCATGCA
TTGATCGAGA TCGGCATGGA AATTATCGGC ACATCAGTCA AGAAGTCGAC CGTCAGGGAC
AAAGAGAAAA TCAAACAGAT GCTAAAGAAC GAGAGCCGCC TGCATCACAC GATGGCAGCA
AGCAAGCTAT ACGCGGTGTT ACGCGGACAG AAGCCTGATA TCATGCTGTC GGGCGGACGC
ACTCAATTCG TTGCACTTGA GGCAAAAATA CCATGGCTCG ACGTCAATCA GGAGCGCCAG
CATCCCTACG CTGGCTACGA AGGCATGGTG AAACTCGCGC AAGAGATTGA TCTGGCAATC
CACAGCCCCA TCTGGGCGCA ATTGCGCGAA CCGGAGCCGT GGTAG
 
Protein sequence
MSSIEAQTGD TSSQRTAAIY PTKENKACQK ATQGSAAGGC AFDGAKVVLQ PIVDVAHLIH 
APPACEGNSW DNRGAASSGP VLWRTSFTTD ITEIDIVTGD TDQKLLEAIR EIKKGYAPAA
IFVYGTCVSE LIGGNIDAVC RHAAQKFAIP VVPVKSPGFR GSKSVGNRIA GEALLEHVIG
TVEADNTSPY DINILGEFNL SGEFWLVKPL LDRLGIRVRA CIPGDARFAQ VGSAHRSRAA
MVVCSTAQIN LARKMEARWD IPFFEGSFYG ISGTSESLRR IAQLLVNKGA GLAFLHRTEE
LIADEEERVW KNLEVYRRRL VGKRVHLNTG GVKSWSIVHA LIEIGMEIIG TSVKKSTVRD
KEKIKQMLKN ESRLHHTMAA SKLYAVLRGQ KPDIMLSGGR TQFVALEAKI PWLDVNQERQ
HPYAGYEGMV KLAQEIDLAI HSPIWAQLRE PEPW