Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A1011 |
Symbol | |
ID | 3833472 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 1198007 |
End bp | 1199470 |
Gene Length | 1464 bp |
Protein Length | 487 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637825100 |
Product | nitrogenase molybdenum-iron protein alpha chain |
Protein accession | YP_426099 |
Protein GI | 83592347 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01282] nitrogenase molybdenum-iron protein alpha chain [TIGR01862] nitrogenase component I, alpha chain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.77042 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCTCG AATACACCAA CGATGCGGAA TTGAGCCAGA AGGCGATCGA GGAGGTCCTG GAGGCCTATC CCGAAAAGGC CGCGAAGAAG CGCAAAAAGC ACCTTGGCAC CATCGTTGCC GAGGGCGAGG GCAGCTCTTG CGGGGTGAAG TCCAACGTCA AGGCCATTCC GGGCGTCATG ACCATCCGCG GCTGCGCCTA TGCCGGCTCG AAGGGCGTGG TCTGGGGTCC GGTCAAGGAC ATGGTCCACA TCAGTCACGG CCCGGTCGGC TGCGGTCAGT ACTCCTGGTC CCAGCGCCGC AATTACTTCA CCGGTCAGGT GGGCGTCGAT TCTTTCGTCA CCATGCAGTT CACCTCGGAT TTCCAGGAAA AAGACATCGT CTTTGGCGGT GACAAGAAGC TGGAAAAGGT GATCGACGAG ATCAAGGGGC TGTTTCCGCT GGTTCGCGGC ATCAGCATCC AGTCCGAATG CCCGATCGGC CTGATCGGCG ACGATATCGA AGCCGTCGCC CGCAAGAAGG CCAAGGATGT CGGCTTGCCG ATCATCCCGG TGCGCTGCGA AGGCTTCCGC GGCGTGTCGC AGTCGCTTGG TCACCATATC GCCAATGACG CCATCCGCGA CTGGGTCTTC TCGCGCGACA GCGAAAGCGC CTTCGAGACC ACGCCCTATG ACGTCAACAT CATCGGCGAT TACAACATCG GTGGCGACGC CTGGGCCTCG CGCATTCTGT TGGAGGAAAT GGGCCTGCGG GTGATCGCCC AATGGTCGGG CGATGCCACC ATCGCCGAAA TGGAACGCGC CCCCAAGGCC AAGCTGAACC TCATCCATTG CTACCGGTCG ATGAATTACA TCTGCCGCCA CATGGAAGAG AAGCACGGCG TGCCCTGGAT GGAATACAAC TTCTTCGGTC CCTCGCAGAT CGAGAAGTCG TTGCGCGCCA TCGCCGCCAA TTTCGACGAG ACCATCCAGA AGAAGGCCGA GGAGGTGATC GCCGCCCATC GCCCGACGGT CGACGCGGTG ATCAACAAGT ACAAGGCCCG CCTCGAAGGC AAGCGCGTCA TGCTGTATGT CGGCGGCCTG CGCCCCCGTC ACGTGATGAC CGCTTATGAA GACCTCGGCA TGCAGATCTG CGGCGCCGGT TATGAATTCG CCCATAGCGA CGATTACCAG CGCACCACCG AATACGCCAA GGAAGGCACG CTGATCTATG ACGACCTGAC CGGCTACGAG CTGGAGCGGT TCATCGAGAA GCTGCGCCCC GATCTGGTGG GCTCGGGCAT CAAGGAAAAA TACGCCGTTC AGAAGATGGG CGTGCCTTTC CGCCAGATGC ACTCCTGGGA TTACTCGGGT CCTTACCACG GCTATGACGG CTTCGCCATC TTCGCCCGTG ACATGGACAT GGCCATCAAC AATCCGGTCT GGGCCTTGCT GAAAGCCCCG TGGACCAAGG CCGCCGCCGA GTAA
|
Protein sequence | MSLEYTNDAE LSQKAIEEVL EAYPEKAAKK RKKHLGTIVA EGEGSSCGVK SNVKAIPGVM TIRGCAYAGS KGVVWGPVKD MVHISHGPVG CGQYSWSQRR NYFTGQVGVD SFVTMQFTSD FQEKDIVFGG DKKLEKVIDE IKGLFPLVRG ISIQSECPIG LIGDDIEAVA RKKAKDVGLP IIPVRCEGFR GVSQSLGHHI ANDAIRDWVF SRDSESAFET TPYDVNIIGD YNIGGDAWAS RILLEEMGLR VIAQWSGDAT IAEMERAPKA KLNLIHCYRS MNYICRHMEE KHGVPWMEYN FFGPSQIEKS LRAIAANFDE TIQKKAEEVI AAHRPTVDAV INKYKARLEG KRVMLYVGGL RPRHVMTAYE DLGMQICGAG YEFAHSDDYQ RTTEYAKEGT LIYDDLTGYE LERFIEKLRP DLVGSGIKEK YAVQKMGVPF RQMHSWDYSG PYHGYDGFAI FARDMDMAIN NPVWALLKAP WTKAAAE
|
| |