Gene Rleg2_5056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5056 
Symbol 
ID6978150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp704636 
End bp705982 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content56% 
IMG OID643394196 
Productnitrogenase molybdenum-iron cofactor biosynthesis protein NifN 
Protein accessionYP_002279014 
Protein GI209547096 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01285] nitrogenase molybdenum-iron cofactor biosynthesis protein NifN 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCCCGCA TCCTAACCAA AACAAAAAGG GCAGCGATCA ACCCTCTAAA AGTGTCGCAA 
CCACTGGGCG CTGCACTGGC CTTTTTGGGT ATCGATGGTG CCCTGCCAAT ACTCCATGGT
AGCCAGGGAT GTACCAGTTT CGCGCTGGTG CTTATGGTGA GGCATTTCAA GCAAGCAGTT
CCACTGCAAA CGACGGCGAT GGACGCTATC ACCTCAGTGA TGGGCGCCGC CGATTCTCTC
GAGAGAGCAA TTGTCGAGCT GGCGGCTCGA ACGCGGCCGC GGCTAATCGG GATTTGCACA
ACAGCACTGG CAGAAACTCG CGACGAAGAT ATAGCAGGTG ACATCGTCAA CATCAAAAGC
GCACACTCGC AAGAACTCAA AGATTCAGAA GTGGTTCTCG CAAGGACGCC TGACTTCGCA
GGGGCAGTGG AGGAAGGGTG GTCGAAAGCA GTTACGGCAA TAATTGAGGC AATTACGCGC
CATGGGACGC GCGCTCGCGA TCCGAAGAAG ATCGTGATCT TTCCCGGATC GAACATGACG
GTCGCCGATA TAGAGCATTT GCGAGAGACA ATCGAGAGCT TCGGCTTAAC GCCTATGATC
CTGCCGGATG CCTCCGGCGG GTCTGACCTT ACCGCCAGCG GTCAGTGGGC GCCGATTGCG
CGAGGTGGCA CGACTGTGGA GCAGGTGCGA GATCTTGGCG CGGCATCACA GTGCATCGCC
GTCGGCGAGC AGATGAGGCG CCCGGCTGAA GCTTTGCAGG GCTTGACAGG CGTCCCATAC
GTCATGTTTG AGTCGCTGAC GGGTCTGAAT AACGCCGACC GGTTCGCTTG GCTCCTAAAG
TCAATTTCGC GCAACGATGT TCCAACGACT GTTCGTCGGG GCCGCCTGCA ATTGCAGGAG
GCCATGCTTG GTGGACATTT TTTGCTCGCA GGGAAGAAGA TCGCAATCGC TTCGGAGCCA
GACCAGCTGT TCCAGTTCGC GCAGTTCTTT ATCAGCATGG GGGCTCTACT TACAGCCGCG
GTCACGACCA CCTCGCATTC AACAGTATTG CAATTATTAC CAGCGGATAC AGTCCAGGTA
GGTGACCTAG CTGACCTCGA ACAGCTTGCC GCGGATACCG ATCTACTTGT AACGCACTCC
CATGGTCGAC AGGCCGCAGA ACGCCTCGCC GTGCCGCTAA TGCGAATAGG TTTTCCGATC
TTCGACCGAA TAGGCAGTCA GCATAAGTTG AACATTCTCT ATCGAGGAAC GCGCGACATG
ATCTACGAAG TAGGCAACTT AGTTCAGGCG GACCGCGATC GACAACCCCC TTTGAACCGG
TCGATTCCGG CTGTTCAAGT ATCCTGA
 
Protein sequence
MARILTKTKR AAINPLKVSQ PLGAALAFLG IDGALPILHG SQGCTSFALV LMVRHFKQAV 
PLQTTAMDAI TSVMGAADSL ERAIVELAAR TRPRLIGICT TALAETRDED IAGDIVNIKS
AHSQELKDSE VVLARTPDFA GAVEEGWSKA VTAIIEAITR HGTRARDPKK IVIFPGSNMT
VADIEHLRET IESFGLTPMI LPDASGGSDL TASGQWAPIA RGGTTVEQVR DLGAASQCIA
VGEQMRRPAE ALQGLTGVPY VMFESLTGLN NADRFAWLLK SISRNDVPTT VRRGRLQLQE
AMLGGHFLLA GKKIAIASEP DQLFQFAQFF ISMGALLTAA VTTTSHSTVL QLLPADTVQV
GDLADLEQLA ADTDLLVTHS HGRQAAERLA VPLMRIGFPI FDRIGSQHKL NILYRGTRDM
IYEVGNLVQA DRDRQPPLNR SIPAVQVS