Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A0795 |
Symbol | |
ID | 3834428 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | - |
Start bp | 949402 |
End bp | 950277 |
Gene Length | 876 bp |
Protein Length | 291 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637824886 |
Product | nitrogenase iron protein |
Protein accession | YP_425886 |
Protein GI | 83592134 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1348] Nitrogenase subunit NifH (ATPase) |
TIGRFAM ID | [TIGR01287] nitrogenase iron protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.835028 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCAAAA GTCCCAAACA AATCGCCATC TATGGCAAAG GTGGCATCGG CAAATCGACC ACCACCTCGA ATATCAGCGC CGCCCTGGCC GAGGCCGGCT ACAAGGTGAT GCAGTTCGGC TGCGACCCCA AAAGCGATTC GACCAATACC CTGCGCGGCG GCGATTACAT CCCCTCGGTG CTCGACCTGC TGCGCGAGAA CGCCCGCGTC GATGCCCATG AGGCGATCTT CCAGGGCTTT GGCGGCATCT ATTGCGTTGA AGCCGGTGGT CCGGCGCCAG GCGTCGGCTG CGCCGGTCGC GGCATCATCA CCGCCGTCGA ACTGCTCAAG CAGCAGAACG TCTTCGAAGA GCTCGATCTT GATTACGTGA TCTTCGACGT GCTGGGCGAC GTGGTCTGCG GCGGCTTCGC CGTGCCGATC CGTGAAGGCA TCGCCGAACA TGTCTTCACC GTGTCGTCGT CGGATTTCAT GGCGATCTAT GCCGCGAACA ATCTGTTCAA GGGCATTCAG AAGTACTCCA ACGCCGGGGG CGCCCTGCTT GGCGGGGTGA TCGCCAATTC GATCAACACC GATTTCCACC GGGACATCAT CGACGATTTC GTCGCCCGCA CCCAGACCCA GGTCGTCCAA TACGTGCCGC GCTCGCTGAC CGTCACCCAG GCCGAACTGC AGGGCCGCAC GACGATCGAG GCGGCGCCCG AGTCCGCCCA GGCCGAGATC TATCGGACCC TGGCGCGCAG CATCGCCGAC CATACGGACT CGAAGGTGCC GACCCCGCTT AACGCCCAAG AGCTGCGCGA CTGGTCGGCA TCCTGGGCCA ACCAATTGAT CGAGATCGAA CGGGCGAGCC AGCCGATTCC CGCCCTGGCC TCATAA
|
Protein sequence | MAKSPKQIAI YGKGGIGKST TTSNISAALA EAGYKVMQFG CDPKSDSTNT LRGGDYIPSV LDLLRENARV DAHEAIFQGF GGIYCVEAGG PAPGVGCAGR GIITAVELLK QQNVFEELDL DYVIFDVLGD VVCGGFAVPI REGIAEHVFT VSSSDFMAIY AANNLFKGIQ KYSNAGGALL GGVIANSINT DFHRDIIDDF VARTQTQVVQ YVPRSLTVTQ AELQGRTTIE AAPESAQAEI YRTLARSIAD HTDSKVPTPL NAQELRDWSA SWANQLIEIE RASQPIPALA S
|
| |