Gene Rleg_4931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4931 
Symbol 
ID8007526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp309442 
End bp310953 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content54% 
IMG OID644821850 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_002973110 
Protein GI241113275 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTCG ACTACGAGAA CGACGGTGAT TTCAACTCCA GGCTTATAGA TGCGGTACTC 
TCCCAGTATC CGGATAAGAC GGCTAAGCGC CGCAAAAAGC ACCTTGGCGT CGCGAAGGGC
CGAGAGGCAG CCGAGCAGAG CTCGGATGCG CTTTGTGAGA CCGGGGTGAA ATCCAACATC
AAGTCCATTC CGGGCGTGAT GACTGTTCGC GGCTGCGCTT ATGCCGGCTC GAAGGGTGTC
GTGTGGGGCC CGATCAAGGA TATGGTCCAT ATATCACATG GGCCTGTCGG TTGTGGGCAC
TATTCCTGGT CGCAACGCCG CAACTATTAC GTCGGTCTGA CGGGTGTCGA AGCCTTTGTC
ACCATGCAAT TCACGTCTGA CTTTCAAGAA AAGGATATTG TTTTTGGTGG CGACAAAAAG
CTCGAGAAGC TCATCGATGA AGTTGAGCAA CTGTTTCCAC TGAACAACGG TGTCAGCTTG
CAGTCAGAGT GTCCAATCGG ATTGATCGGC GACGATATTG AAGCTGTGGC GCGCAAGAAG
GCCAAGGAGC ACAACAAAAC GATCGTGCCG GTGCGATGCG AAGGGTTTCG TGGAGTGTCG
CAATCGCTTG GCCATCATAT CGCCAATGAC GCGATACGCG ACTGGGTTTT CGATAAGAAA
GACACCCACT ACGAGGCCAG CTTTTTCGAC GTTAACGTAA TAGGTGACTA CAATATCGGC
GGCGATGCGT GGGCTTCCCG CATTCTGCTG GAGGACATGG GGTTGCGGGT GGTCGGCAAC
TGGTCGGGAG ATGCCACACT CGCGGAGGTG GAGCGTGCGC CAAAAGCGAC GCTCAACCTT
ATTCACTGCT ACCGGTCCAT GAACTACATC GCTCGGCATA TGGAGGAAAA GTACGGCATT
CCCTGGATGG AGTACAACTT TTTCGGTCCT TCCCAGATCG AAGTTTCTTT GCGCAATATC
GCCGCATTTT TCGGGCCGGA GACCCAAGAT AGGGCCGAAG CACTCATCAC CAGATACCAA
CCCCTCGTCC AGGCGGTGAC GGAGAAATAC CGTCCGCGCC TCGATGGCAA AACTGTGATG
CTCTACGTTG GCGGATTGCG TCCCCGCCAT GTCATCACCG CCTATGAGGA TCTCGGAATG
GAGATCGTTG GCACGGGCTA CGAATTTGGC CATGGCGACG ATTACGAGCG CACCAGCCAC
TATGTCAAAA AAGGTACGCT TATCTACGAT GATGTGACCG GCTACGAGCT CGAGAACTTC
GTCGAGGCCA TTCGCCCGGA CCTAGTAGGC TCGGGCATCA AGGAAAAATA TCCGGTTCAA
AAAATGGGCA TACCGTTTCG CCAGATGCAT TCTTGGGACT ATTCGGGTCC GTATCATGGT
TATGACGGCT TCGCCATCTT TGCCAGAGAC ATGGATCTTG CCATCAACAA TCCGATCTGG
GGCCTCTACG ACGCGCCATG GAAAGAAGCG CACTGCAGCC ATGCCTGCAG TTGCGGCGGA
CAAGACGAAT AG
 
Protein sequence
MSLDYENDGD FNSRLIDAVL SQYPDKTAKR RKKHLGVAKG REAAEQSSDA LCETGVKSNI 
KSIPGVMTVR GCAYAGSKGV VWGPIKDMVH ISHGPVGCGH YSWSQRRNYY VGLTGVEAFV
TMQFTSDFQE KDIVFGGDKK LEKLIDEVEQ LFPLNNGVSL QSECPIGLIG DDIEAVARKK
AKEHNKTIVP VRCEGFRGVS QSLGHHIAND AIRDWVFDKK DTHYEASFFD VNVIGDYNIG
GDAWASRILL EDMGLRVVGN WSGDATLAEV ERAPKATLNL IHCYRSMNYI ARHMEEKYGI
PWMEYNFFGP SQIEVSLRNI AAFFGPETQD RAEALITRYQ PLVQAVTEKY RPRLDGKTVM
LYVGGLRPRH VITAYEDLGM EIVGTGYEFG HGDDYERTSH YVKKGTLIYD DVTGYELENF
VEAIRPDLVG SGIKEKYPVQ KMGIPFRQMH SWDYSGPYHG YDGFAIFARD MDLAINNPIW
GLYDAPWKEA HCSHACSCGG QDE