Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5053 |
Symbol | |
ID | 6978147 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 699993 |
End bp | 701522 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643394193 |
Product | nitrogenase molybdenum-iron protein alpha chain |
Protein accession | YP_002279011 |
Protein GI | 209547093 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01282] nitrogenase molybdenum-iron protein alpha chain [TIGR01862] nitrogenase component I, alpha chain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCTCG ATTACGAGAA CGACAGTGCT TTTCACACGA GGATTATCGA CGATGTCCTC TCGCAATATC CTGAAAAGGC GGCTAAGCGC CGCAAAAAGC ACCTTGGGGT CGCGAAGGGC AAAGATGAGG CCGAGAAGGG ACCAGATGCG TTCTGCCAAC CCGAGGTGAA ATCAAACATC AAGTCTAATC CCGGAGTGAT GACAGTTCGT GGCTGCGCAT ATGCCGGCTC TAAAGGTGTG GTGTGGGGTC CAATCAAAGA CATGGTCCAC ATATCGCATG GGCCTGTCGG TTGCGGGCAA TATTCCTGGT CGCAACGCCG CAATTATTAC GTCGGCCTGA CGGGTGTCGA CGCATTCGTC ACCATGCAGT TCACGTCAGA CTTCCAGGAG AGGGATATCG TTTTCGGTGG CGACAAAAAG CTCGAGAAGC TCATCGATGA AGTTGAGAAA CTTTTTCCGC TGAACAACGG TATCAGCTTG CAATCCGAGT GTCCAATCGG ATTGATTGGG GATGACATAG AAGCTGTAGC TCGAAAGAAA GCCAAGGAAT ACGATAAAAC AATCGTACCG GTGCGGTGCG AGGGCTTTCG TGGCGTGTCG CAATCGCTCG GCCATCACAT CGCCAATGAT GCGATACGGG ACTGGGTCTT CGATAAGAGA GACATTCACT TCGAGCAGGG GCCGAATGAC GTTAACGTCA TTGGTGACTA TAATATCGGC GGCGATGCGT GGGCTTCGCG CATTCTTCTG CAGGACATCG GGTTGCGGGT GGTCGGCAAC TGGTCGGGCG ATGCCACACT CGCGGAGTTG GAGCGCGCAC CAAAAGCGCG GCTCAATCTC ATTCACTGCT ACCGTTCTAT GAACTACATC TCACGGCATA TGGAAGAAAA GTACGGCATT CCCTGGATGG AGTACAACTT CTTCGGTCCT TCCCAGATTG AAGACTCTTT GCGCAATATT GCCGCTTTTT TCGGGCCGGA GACCCAAGAA AAGGCCGAAG CGCTCATCCA AAGGTATCAA CCCCTCGTCC AGGCGGTGAC GAAGAAGTAC CTCCCGCGCC TTTATGGCAA AAGAGTGATG CTTTATGTGG GAGGATTGCG ACCTCGTCAC GTCATAACGG CCTACGAGGA TCTTGGAATG GAGATCGTCG GTACCGGCTA CGAATTCGGT CACGGGGACG ACTACCAGCG CACGGGCCAA CATGTCAAAA AAGGTACACT CATCTACGAT GATGTGACCG GCTACGAGCT CGAAAAATTC ATCGAGGCAA TTCGGCCAGA TCTCGTCGGC TCGGGCATCA AGGAAAAGTA TCCTGTACAA AAAATGGGCA TACCATTTCG TCAGATGCAT TCCTGGGATT ATTCTGGTCC GTATCACGGC TACGACGGCT TCGCCATCTT TGCGAGAGAT ATGGATCTCG CCATCAACAA CCCAGTCTGG GGGCTCTATG GCGCGCCATG GAAAAAAAGC ACCGCGCCGA TGACTGGACT TCCTGCAACT GCAGACAAGA AGGCAAATCA TCTTTGCTGA
|
Protein sequence | MSLDYENDSA FHTRIIDDVL SQYPEKAAKR RKKHLGVAKG KDEAEKGPDA FCQPEVKSNI KSNPGVMTVR GCAYAGSKGV VWGPIKDMVH ISHGPVGCGQ YSWSQRRNYY VGLTGVDAFV TMQFTSDFQE RDIVFGGDKK LEKLIDEVEK LFPLNNGISL QSECPIGLIG DDIEAVARKK AKEYDKTIVP VRCEGFRGVS QSLGHHIAND AIRDWVFDKR DIHFEQGPND VNVIGDYNIG GDAWASRILL QDIGLRVVGN WSGDATLAEL ERAPKARLNL IHCYRSMNYI SRHMEEKYGI PWMEYNFFGP SQIEDSLRNI AAFFGPETQE KAEALIQRYQ PLVQAVTKKY LPRLYGKRVM LYVGGLRPRH VITAYEDLGM EIVGTGYEFG HGDDYQRTGQ HVKKGTLIYD DVTGYELEKF IEAIRPDLVG SGIKEKYPVQ KMGIPFRQMH SWDYSGPYHG YDGFAIFARD MDLAINNPVW GLYGAPWKKS TAPMTGLPAT ADKKANHLC
|
| |