Gene Rru_A1011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A1011 
Symbol 
ID3833472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp1198007 
End bp1199470 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content61% 
IMG OID637825100 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_426099 
Protein GI83592347 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.77042 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCTCG AATACACCAA CGATGCGGAA TTGAGCCAGA AGGCGATCGA GGAGGTCCTG 
GAGGCCTATC CCGAAAAGGC CGCGAAGAAG CGCAAAAAGC ACCTTGGCAC CATCGTTGCC
GAGGGCGAGG GCAGCTCTTG CGGGGTGAAG TCCAACGTCA AGGCCATTCC GGGCGTCATG
ACCATCCGCG GCTGCGCCTA TGCCGGCTCG AAGGGCGTGG TCTGGGGTCC GGTCAAGGAC
ATGGTCCACA TCAGTCACGG CCCGGTCGGC TGCGGTCAGT ACTCCTGGTC CCAGCGCCGC
AATTACTTCA CCGGTCAGGT GGGCGTCGAT TCTTTCGTCA CCATGCAGTT CACCTCGGAT
TTCCAGGAAA AAGACATCGT CTTTGGCGGT GACAAGAAGC TGGAAAAGGT GATCGACGAG
ATCAAGGGGC TGTTTCCGCT GGTTCGCGGC ATCAGCATCC AGTCCGAATG CCCGATCGGC
CTGATCGGCG ACGATATCGA AGCCGTCGCC CGCAAGAAGG CCAAGGATGT CGGCTTGCCG
ATCATCCCGG TGCGCTGCGA AGGCTTCCGC GGCGTGTCGC AGTCGCTTGG TCACCATATC
GCCAATGACG CCATCCGCGA CTGGGTCTTC TCGCGCGACA GCGAAAGCGC CTTCGAGACC
ACGCCCTATG ACGTCAACAT CATCGGCGAT TACAACATCG GTGGCGACGC CTGGGCCTCG
CGCATTCTGT TGGAGGAAAT GGGCCTGCGG GTGATCGCCC AATGGTCGGG CGATGCCACC
ATCGCCGAAA TGGAACGCGC CCCCAAGGCC AAGCTGAACC TCATCCATTG CTACCGGTCG
ATGAATTACA TCTGCCGCCA CATGGAAGAG AAGCACGGCG TGCCCTGGAT GGAATACAAC
TTCTTCGGTC CCTCGCAGAT CGAGAAGTCG TTGCGCGCCA TCGCCGCCAA TTTCGACGAG
ACCATCCAGA AGAAGGCCGA GGAGGTGATC GCCGCCCATC GCCCGACGGT CGACGCGGTG
ATCAACAAGT ACAAGGCCCG CCTCGAAGGC AAGCGCGTCA TGCTGTATGT CGGCGGCCTG
CGCCCCCGTC ACGTGATGAC CGCTTATGAA GACCTCGGCA TGCAGATCTG CGGCGCCGGT
TATGAATTCG CCCATAGCGA CGATTACCAG CGCACCACCG AATACGCCAA GGAAGGCACG
CTGATCTATG ACGACCTGAC CGGCTACGAG CTGGAGCGGT TCATCGAGAA GCTGCGCCCC
GATCTGGTGG GCTCGGGCAT CAAGGAAAAA TACGCCGTTC AGAAGATGGG CGTGCCTTTC
CGCCAGATGC ACTCCTGGGA TTACTCGGGT CCTTACCACG GCTATGACGG CTTCGCCATC
TTCGCCCGTG ACATGGACAT GGCCATCAAC AATCCGGTCT GGGCCTTGCT GAAAGCCCCG
TGGACCAAGG CCGCCGCCGA GTAA
 
Protein sequence
MSLEYTNDAE LSQKAIEEVL EAYPEKAAKK RKKHLGTIVA EGEGSSCGVK SNVKAIPGVM 
TIRGCAYAGS KGVVWGPVKD MVHISHGPVG CGQYSWSQRR NYFTGQVGVD SFVTMQFTSD
FQEKDIVFGG DKKLEKVIDE IKGLFPLVRG ISIQSECPIG LIGDDIEAVA RKKAKDVGLP
IIPVRCEGFR GVSQSLGHHI ANDAIRDWVF SRDSESAFET TPYDVNIIGD YNIGGDAWAS
RILLEEMGLR VIAQWSGDAT IAEMERAPKA KLNLIHCYRS MNYICRHMEE KHGVPWMEYN
FFGPSQIEKS LRAIAANFDE TIQKKAEEVI AAHRPTVDAV INKYKARLEG KRVMLYVGGL
RPRHVMTAYE DLGMQICGAG YEFAHSDDYQ RTTEYAKEGT LIYDDLTGYE LERFIEKLRP
DLVGSGIKEK YAVQKMGVPF RQMHSWDYSG PYHGYDGFAI FARDMDMAIN NPVWALLKAP
WTKAAAE