Gene Rleg_4934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4934 
Symbol 
ID8007529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp314065 
End bp315396 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content59% 
IMG OID644821853 
Productnitrogenase molybdenum-iron cofactor biosynthesis protein NifN 
Protein accessionYP_002973113 
Protein GI241113278 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01285] nitrogenase molybdenum-iron cofactor biosynthesis protein NifN 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCGCA TCCTGACCCA AACAAAAACG GCAGCGATCA ACCCTCTAAA GTCGTCTCAA 
CCATTGGGCG CTGCCTTGGC GTTCCTGGGC GTCGATGGTG CGCTGCCAAT ACTGCATGGC
AGCCAGGGAT GTAGCAGTTT TGCACTGGTG CTGCTCGTGA GGCATTTCAA GCATATGGTC
CCGTTGCAGA CTACTGCAAT GGACCAGATC GCGACTGTGG TGGGCGGCGC CGAATTTCTC
GAGAAAGCTC TCGTCAAACT AAAAGCTCGC ACGTGGCCCC GGCTGATCGG GATCTGCACC
ACCGCTGTGG CAGAAACTCG CGACGAAGAT ATCGCACCTG ATATCTTTAA CGCCACAGGG
GCGGGCCTGA GAGGACGCAT TGATACGGAA GTGGTACTCG CACGCACTCC CGACTTCGCA
GGGGCGGTCG AGGAGGGATG GTCGAAGGCT GTTACGGCTA TTATCGAGGC AATCACGCGG
CCAGGGACTC AGGACCGCGA TGCGGGGAGG GTCGTGATTC TGCCCGGCTC GAATATGACA
GTTGCCGATG TAGAGCATCT GCGGGAAATG GTTGAGAGCT TCGGCTTAAT ACCCCTCATT
TTGCCTGACG TGTCCGGCGG GTCTGACGAA GCTGTCCGCG ATCGATGGAT TCAAATCTCA
CGCGGTGGCG CGAAGGTGGA GCATATCCGT GATCTCGGGG CGGCAACACA GTGCATCGCG
GTCGGCGAAC AGATGCGCCG ACCGGCTGAA GCTTTGCAGG GTCTGACTGG ATTGCCATAC
GTGATGTTCA GATCACTCAC GGGCCTGATT AACGCCGACC GTTTCGCCTG GCTCCTTGCA
GCGATTTCGC GGGACAGCGC GCCGGCAGCT GTCCGTCGTG GCCGCATGCA GTTGCAGGAG
GCGATGCTCA GCGGGCATTT TCATTTTGCA GGAAAGAAGG TCGCCATTGC GTGCGAGCCG
GACCAGCTCT TGCAGTTCGC CCAATTCTTT ATCGGCATGG GGGCGGTCAT TACAGCCGCG
GTCACCACTA CCGGGCACCC AAAGGTGCTG CAGACAGTAT CAGCGGATAC TGTCCAGGTG
GGTGATCTTG GTGACCTGGA ACAGCTCGCA GCGGATGCTG ATCTGCTCGT CACGCACTCC
CATGGCCGAC AGGCCGCAGA ACGCCTCAGC GTTCCACTGA TGCGAGTCGG TTTCCCTGTT
TTCGACCGCA TTGGAGGCCA GCATAAATTG AAAATCCTTT ATCGAGGAAC GCGCGACATG
ATCTTCGAAG TTGCCAATAT AATTCAGGCG AGCCAAGGTC TGCCGCCCGC TCGCGCACCA
GCGGACCCGT AG
 
Protein sequence
MARILTQTKT AAINPLKSSQ PLGAALAFLG VDGALPILHG SQGCSSFALV LLVRHFKHMV 
PLQTTAMDQI ATVVGGAEFL EKALVKLKAR TWPRLIGICT TAVAETRDED IAPDIFNATG
AGLRGRIDTE VVLARTPDFA GAVEEGWSKA VTAIIEAITR PGTQDRDAGR VVILPGSNMT
VADVEHLREM VESFGLIPLI LPDVSGGSDE AVRDRWIQIS RGGAKVEHIR DLGAATQCIA
VGEQMRRPAE ALQGLTGLPY VMFRSLTGLI NADRFAWLLA AISRDSAPAA VRRGRMQLQE
AMLSGHFHFA GKKVAIACEP DQLLQFAQFF IGMGAVITAA VTTTGHPKVL QTVSADTVQV
GDLGDLEQLA ADADLLVTHS HGRQAAERLS VPLMRVGFPV FDRIGGQHKL KILYRGTRDM
IFEVANIIQA SQGLPPARAP ADP