Gene Rleg2_5053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5053 
Symbol 
ID6978147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp699993 
End bp701522 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content53% 
IMG OID643394193 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_002279011 
Protein GI209547093 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTCG ATTACGAGAA CGACAGTGCT TTTCACACGA GGATTATCGA CGATGTCCTC 
TCGCAATATC CTGAAAAGGC GGCTAAGCGC CGCAAAAAGC ACCTTGGGGT CGCGAAGGGC
AAAGATGAGG CCGAGAAGGG ACCAGATGCG TTCTGCCAAC CCGAGGTGAA ATCAAACATC
AAGTCTAATC CCGGAGTGAT GACAGTTCGT GGCTGCGCAT ATGCCGGCTC TAAAGGTGTG
GTGTGGGGTC CAATCAAAGA CATGGTCCAC ATATCGCATG GGCCTGTCGG TTGCGGGCAA
TATTCCTGGT CGCAACGCCG CAATTATTAC GTCGGCCTGA CGGGTGTCGA CGCATTCGTC
ACCATGCAGT TCACGTCAGA CTTCCAGGAG AGGGATATCG TTTTCGGTGG CGACAAAAAG
CTCGAGAAGC TCATCGATGA AGTTGAGAAA CTTTTTCCGC TGAACAACGG TATCAGCTTG
CAATCCGAGT GTCCAATCGG ATTGATTGGG GATGACATAG AAGCTGTAGC TCGAAAGAAA
GCCAAGGAAT ACGATAAAAC AATCGTACCG GTGCGGTGCG AGGGCTTTCG TGGCGTGTCG
CAATCGCTCG GCCATCACAT CGCCAATGAT GCGATACGGG ACTGGGTCTT CGATAAGAGA
GACATTCACT TCGAGCAGGG GCCGAATGAC GTTAACGTCA TTGGTGACTA TAATATCGGC
GGCGATGCGT GGGCTTCGCG CATTCTTCTG CAGGACATCG GGTTGCGGGT GGTCGGCAAC
TGGTCGGGCG ATGCCACACT CGCGGAGTTG GAGCGCGCAC CAAAAGCGCG GCTCAATCTC
ATTCACTGCT ACCGTTCTAT GAACTACATC TCACGGCATA TGGAAGAAAA GTACGGCATT
CCCTGGATGG AGTACAACTT CTTCGGTCCT TCCCAGATTG AAGACTCTTT GCGCAATATT
GCCGCTTTTT TCGGGCCGGA GACCCAAGAA AAGGCCGAAG CGCTCATCCA AAGGTATCAA
CCCCTCGTCC AGGCGGTGAC GAAGAAGTAC CTCCCGCGCC TTTATGGCAA AAGAGTGATG
CTTTATGTGG GAGGATTGCG ACCTCGTCAC GTCATAACGG CCTACGAGGA TCTTGGAATG
GAGATCGTCG GTACCGGCTA CGAATTCGGT CACGGGGACG ACTACCAGCG CACGGGCCAA
CATGTCAAAA AAGGTACACT CATCTACGAT GATGTGACCG GCTACGAGCT CGAAAAATTC
ATCGAGGCAA TTCGGCCAGA TCTCGTCGGC TCGGGCATCA AGGAAAAGTA TCCTGTACAA
AAAATGGGCA TACCATTTCG TCAGATGCAT TCCTGGGATT ATTCTGGTCC GTATCACGGC
TACGACGGCT TCGCCATCTT TGCGAGAGAT ATGGATCTCG CCATCAACAA CCCAGTCTGG
GGGCTCTATG GCGCGCCATG GAAAAAAAGC ACCGCGCCGA TGACTGGACT TCCTGCAACT
GCAGACAAGA AGGCAAATCA TCTTTGCTGA
 
Protein sequence
MSLDYENDSA FHTRIIDDVL SQYPEKAAKR RKKHLGVAKG KDEAEKGPDA FCQPEVKSNI 
KSNPGVMTVR GCAYAGSKGV VWGPIKDMVH ISHGPVGCGQ YSWSQRRNYY VGLTGVDAFV
TMQFTSDFQE RDIVFGGDKK LEKLIDEVEK LFPLNNGISL QSECPIGLIG DDIEAVARKK
AKEYDKTIVP VRCEGFRGVS QSLGHHIAND AIRDWVFDKR DIHFEQGPND VNVIGDYNIG
GDAWASRILL QDIGLRVVGN WSGDATLAEL ERAPKARLNL IHCYRSMNYI SRHMEEKYGI
PWMEYNFFGP SQIEDSLRNI AAFFGPETQE KAEALIQRYQ PLVQAVTKKY LPRLYGKRVM
LYVGGLRPRH VITAYEDLGM EIVGTGYEFG HGDDYQRTGQ HVKKGTLIYD DVTGYELEKF
IEAIRPDLVG SGIKEKYPVQ KMGIPFRQMH SWDYSGPYHG YDGFAIFARD MDLAINNPVW
GLYGAPWKKS TAPMTGLPAT ADKKANHLC