Gene Rru_A1394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A1394 
Symbol 
ID3834809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp1645052 
End bp1646611 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content60% 
IMG OID637825484 
Productnitrogenase iron-iron protein, alpha chain 
Protein accessionYP_426482 
Protein GI83592730 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01284] nitrogenase alpha chain
[TIGR01861] nitrogenase iron-iron protein, alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCACC ATACCTTCAA ATGCAGCGAA TGCATCCCCG AGCGCACCAA GCACGCGGTG 
ATCAAGGGCG CCGACGAGGA CCTGACCTCG TGCCTGCCGC TGGGTTATCT CAATACCATT
CCGGGATCGA TCTCCGAGCG CGGCTGCGCC TATTGCGGCG CCAAGCACGT GATCGGCCAG
CCGATGAAGG ACGTCATCCA TATCAGTCAC GGACCGATCG GATGCACCTA CGACACCTGG
CAGACCAAGC GTTATATCAG CGATAACAAC AACTTCCAGC TCAAGTACAC CTATGCCACC
GATGTTCGGG AAAAGCACAT CGTCTTTGGC GCCGAGAAGC TTCTCAAGCA GAACATCCTC
GAAGCCTTCA AGGCGTTTCC CGACATCAAG CGCATGACCA TCTACCAGAC CTGCGCCACG
GCGCTGATCG GCGATGACAT CGACGCCATC GCCTCGGAAG TGATGGACGA GTTGCCCGAT
GTCGACATCT TCGTCTGCAA CTCGCCGGGC TTCGCCGGGC CCAGCCAGTC GGGCGGCCAC
CACAAGATCA ATATCGCCTG GGTCAACCAG AAGGTCGGCA CCGTCGAGCC CGAGATCACC
AGCGATTACG TCATCAACTA TGTTGGTGAG TATAATATCC AGGGCGATCA GGAAGTTATG
CAGGATTACT TCAATCGCAT GGGCATCCAG ATCCTGTCGA CCTTCACCGG CAACGGATCC
TATGACGGCC TGCGGGCGAT GCACCGCGCC CATCTCAATG TGCTCGAATG CGCCCGTTCG
GCCGAATACA TCTGCAATGA ATTGCGCGTG CGCTATGGCA TCCCGCGCCT TGATATCGAC
GGCTTCGGCT TCGAACCGCT GTCGGACTCG CTGCGCAAGA TCGGCCTGTT CTTCGGCATC
GAAGACCGCG CCCAGGCGAT CATCGACGAA GAAACCGCCA AGTGGAAACC CCAGCTTGAT
TGGTACAAGG AACGCCTGCG CGGCAAGAAG GTCTGCCTGT GGCCGGGCGG CTCCAAGCTT
TGGCATTGGG CCCATGTCAT CCAGGAGGAA ATGGGCCTCA ACGTCGTGTC GCTCTACACC
AAATTCGGCC ATCAGGGCGA TATGGAAAAG GGCATCGCGC GCTGCGGCGA AGGCGCCCTG
GCCATCGACG ATCCCAACGA GCTTGAAGGC CTGGAAGCCC TGGAGATGCT CAAGCCCGAC
ATCATCTTGA CGGGCAAGCG CCCGGGCGAG GTCGCCAAGA AAGTGCGCGT TCCCTATCTC
AACGCCCACG CCTATCACAA CGGCCCCTAT AAGGGCTACG AGGGCTGGGT GCGCTTCGCC
CGCGATATCT ACAACGCCAT CTATTCGCCG ATCTTCCAGC TGTCGGCCCT CGATATCAGC
AAGGACCCGA TCCCGACCGA CCAGGGCTTC CTGACGCCGC AGATGATCTC CGATCCGGCC
CTGCCCGCCG AGGTGCGGTC TTCGACCGTG CTGACCCCCT ATCGCGGCGC TTACGACACC
ATTTCCGCCC TGCGCGAGAA GACCTATCCG CGCTTCGATG CCGTTCCTGT CGCCCAATAA
 
Protein sequence
MPHHTFKCSE CIPERTKHAV IKGADEDLTS CLPLGYLNTI PGSISERGCA YCGAKHVIGQ 
PMKDVIHISH GPIGCTYDTW QTKRYISDNN NFQLKYTYAT DVREKHIVFG AEKLLKQNIL
EAFKAFPDIK RMTIYQTCAT ALIGDDIDAI ASEVMDELPD VDIFVCNSPG FAGPSQSGGH
HKINIAWVNQ KVGTVEPEIT SDYVINYVGE YNIQGDQEVM QDYFNRMGIQ ILSTFTGNGS
YDGLRAMHRA HLNVLECARS AEYICNELRV RYGIPRLDID GFGFEPLSDS LRKIGLFFGI
EDRAQAIIDE ETAKWKPQLD WYKERLRGKK VCLWPGGSKL WHWAHVIQEE MGLNVVSLYT
KFGHQGDMEK GIARCGEGAL AIDDPNELEG LEALEMLKPD IILTGKRPGE VAKKVRVPYL
NAHAYHNGPY KGYEGWVRFA RDIYNAIYSP IFQLSALDIS KDPIPTDQGF LTPQMISDPA
LPAEVRSSTV LTPYRGAYDT ISALREKTYP RFDAVPVAQ