Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_4683 |
Symbol | |
ID | 3972389 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 5241486 |
End bp | 5243060 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637927795 |
Product | nitrogenase alpha chain |
Protein accession | YP_534524 |
Protein GI | 90426154 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01284] nitrogenase alpha chain [TIGR01861] nitrogenase iron-iron protein, alpha chain [TIGR01862] nitrogenase component I, alpha chain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCTATC ATGAGTTCGA AGTCAGCAAA TGCATCCCCG AGCGCAAGCA GCACGCGGTC GTCAAGGGCC CGGGGGAGGA TCTGACCTCC TGCCTGCCGA AGGGATATCT CAACACCATC CCTGGCTCGA TCTCCGAACG CGGCTGCGCC TATTGCGGCG CCAAGCACGT CATCGGCACG CCGATGAAGG ACGTGATCCA TCTCAGTCAC GGCCCGGTCG GCTGCACCTA CGACACCTGG CAGACCAAAC GCTATATCAG CGATAATAAC GACTATCAGC TGAAATACAC CTTCGCTTCC GACGTGAAGG AGAAGCACAT CGTATTCGGC GCCGAGAAGC TGTTGAAGCA GAACATCCTC GAGGCGTTCA AAGCGTTTCC GACTATGAAG CGCATGACCA TCTACCAGAC CTGCGCCACC GCCTTGATCG GCGACGACGT CAACGCCATC GCCGCCGAGG TGATGGAGGA ACTGCCCGAC GTCGATATCT TCGTCTGCAA CTCGCCGGGT TTCGCAGGCC CCAGCCAATC CGGCGGCCAT CACAAGATCA ACATCGCTTG GCTGAACCAA AAGGTCGGCA CCGTCGAGCC GAAGATCACC GGCGACTACG TCATCAATTA CGTCGGCGAG TACAACATCC AAGGTGACCA GGAAGTCATG ATCGACTTCT TCAAGCGTAT GGGCATCCAG GTGCTGTCGA CCTTCACCGG CAACGGCTCC TACGACGATC TGCGCAGCAT GCACGGCGCC CATCTCAACG TGCTGGAATG CGCCCGCTCG GCCGAATACA TCTGCGACGA ATTGCGGATG CGCTACGGCA TTCCGCGGCT CGACATCGAC GGCTTCGGCC ACAAGGCGCT CGGCGACAGC TTGCGCAAGG TCGGCCTGTT CTTCGGGATC GAAGACCGCG CCGAGGCGAT CATCGCCGAG GAGACCGCCA AATGGGGTCC CGAACTCGCC TGGTACAAGG AGCGCCTGCA AGGCAAGAAG GTGTGCCTGT GGCCGGGCGG CTCCAAGCTC TGGCACTGGG CCCATGCCAT CCAGGAAGAA ATGGGCGTCC AGGTGGTCTC GGTCTACACC AAGTTCGGCC ATCAGGGCGA CATGGAAAAG GGCGTCTCGC GTTGCGGCGA AGGCGCGCTG GCGATCGATG ACCCCAACGA ACTCGAAAAT CAGGAAGCGC TGAAGACCCT GAAGCCGGAC GTGATCTTCA CCGGCAAGCG GCCGGGCGAA GTCGCCAAGA AGATGCGGGT GCCCTATCTC AACGCCCATG CTTACCACAA CGGCCCCTAC AAGGGCTGGG AGGGCTGGGT GCGCTTTGCC CGCGACATCT ACAATGCGAT CTATTCGCCG ATGCATCAGC TGTCGGCGAT CGACATCTCC AAGGACGACT ACGCGACCGA CAAGGGCTTC ACCACCCGGC GCATGCTGTC CGATGCCAAT CTCTCCGACG AGTCAAAGGC GTCGCCGATG ACCGGCTATT CCGGCAAGTT CGACCCGATC GCCGCCATCC GCGCCAAGAC CGCCGCCGAC TATCCGGTGT TTCCGCGTCG TAGCGTCACC GAAGCCGCCG AGTAG
|
Protein sequence | MPYHEFEVSK CIPERKQHAV VKGPGEDLTS CLPKGYLNTI PGSISERGCA YCGAKHVIGT PMKDVIHLSH GPVGCTYDTW QTKRYISDNN DYQLKYTFAS DVKEKHIVFG AEKLLKQNIL EAFKAFPTMK RMTIYQTCAT ALIGDDVNAI AAEVMEELPD VDIFVCNSPG FAGPSQSGGH HKINIAWLNQ KVGTVEPKIT GDYVINYVGE YNIQGDQEVM IDFFKRMGIQ VLSTFTGNGS YDDLRSMHGA HLNVLECARS AEYICDELRM RYGIPRLDID GFGHKALGDS LRKVGLFFGI EDRAEAIIAE ETAKWGPELA WYKERLQGKK VCLWPGGSKL WHWAHAIQEE MGVQVVSVYT KFGHQGDMEK GVSRCGEGAL AIDDPNELEN QEALKTLKPD VIFTGKRPGE VAKKMRVPYL NAHAYHNGPY KGWEGWVRFA RDIYNAIYSP MHQLSAIDIS KDDYATDKGF TTRRMLSDAN LSDESKASPM TGYSGKFDPI AAIRAKTAAD YPVFPRRSVT EAAE
|
| |