Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_1622 |
Symbol | |
ID | 6409279 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 1735753 |
End bp | 1737321 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 642711511 |
Product | nitrogenase alpha chain |
Protein accession | YP_001990626 |
Protein GI | 192290021 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01284] nitrogenase alpha chain [TIGR01861] nitrogenase iron-iron protein, alpha chain [TIGR01862] nitrogenase component I, alpha chain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCATATC ACGAGTTCGA CTGTTCGAAA TGTCTGCCCG AACGCCAGAA GCACGCCGTC ACCAAGGGCG CCGGCGATAA TCTCGCCACC GCGCTGCCGC TCGGCTATCT CAACACGATC CCGGGATCGA TCTCGGAGCG CGGCTGCGCC TATTGCGGCG CAAAGCATGT GATCGGCACG CCGATGAAGG ACGTGATTCA CATGAGTCAC GGCCCCGTCG GCTGCACCTA CGACACCTGG CAGACCAAGC GCTATATCTC CGACAACAAC AACTTCCAGC TCAAATACAC CTTCGCCACC GACGTCCGCG AGAAGCACAT CGTATTCGGC GCCGAAGGCC TTCTGAAGCA GAACATCATC GAGGCTTTCA AAGCCTGCCC CGACATCAAG CGGATGACGA TCTACCAGAC CTGCGCCACC GCGCTGATCG GCGACGACAT CAATGCTGTC GCCGCCGAGG TGATGGAGGA GATGCCGGAC GTCGACATCT TCACCTGCAA CTCGCCAGGC TTCGGCGGCC CCAGCCAGTC CGGCGGCCAC CACAAGATCA ACATCGCCTG GATCAACGAC AAGGTCGGCA CCGTCGAGCC CGAGATCACC TCGGATTACG TCATCAACTA TGTGGGCGAA TACAACATCC AGGGCGACCA GGAGGTGATG CTCGACTATT TCACCCGGAT GGGTATCCAG GTGCTGTCGA CCTTCACCGG CAATGGCACC TATGACGGCC TGCGGGCGAT GCACCGCGCG CATCTCAACG TGCTCGAATG CGCCCGTTCG GCCGAATACA TCTGCAACGA GCTGCGCGTC CGCTACGGGA TTCCGCGGCT CGACATCGAC GGCTTCGGCT TCGAGCCGTT GTCGCAGTCG CTGCGCAAGA TCGGGATGTT CTTCGGCATC GAAGACCGCG CCGAAGCGAT CATCGCCGAA GAGACCGCGC GCTGGAAGCC GGAGCTCGAC TGGTACAAGG AACGGCTGAA GGGCAAAAAG GTCTGCCTGT GGCCAGGCGG CTCCAAGCTG TGGCACTGGG CGCACGCCAT CGAAGAGGAG ATGGGCGTCA AGGTCGTCTC GGTCTACACT AAGTTCGGCC ACCAGGGCGA CATGGAAAAG GGCATCGCGC GCTGCGGCGA GGATGCGCTG GCGATCGACG ATCCCAACGA ACTCGAGGGC CTCGAGGCGC TGGAGAAGCT GCAGCCGGAC ATCATCTTCA CCGGCAAGCG TCCCGGCGAA GTCGCCAAGA AGGTCCGCGT TCCGTACCTC AACGCCCACG CCTATCACAA CGGCCCATAC AAGGGCTTCG AAGGCTGGGT GCGGTTCGCC CGCGACATCT ACAACGGCAT CTACTCGCCG ATGCACCAGC TCTCCGGGCT GGACATCAGC AAGGACGAGA TTCCGGCCGA TCGCGGTTTC GTCACGCAGC GCATGCTGTC CGACGCGAAG CTGCCGGAAG AGATCGCCAA GTCGGAGACG CTGCGGCGCT ACACCGGCAA GGACGACATC ATCTCCGACC TGCGCAAGAA GAACGCGCCC TACTTCACCC CGATCGTCAA AGCCGAAGCG GCCGAGTGA
|
Protein sequence | MPYHEFDCSK CLPERQKHAV TKGAGDNLAT ALPLGYLNTI PGSISERGCA YCGAKHVIGT PMKDVIHMSH GPVGCTYDTW QTKRYISDNN NFQLKYTFAT DVREKHIVFG AEGLLKQNII EAFKACPDIK RMTIYQTCAT ALIGDDINAV AAEVMEEMPD VDIFTCNSPG FGGPSQSGGH HKINIAWIND KVGTVEPEIT SDYVINYVGE YNIQGDQEVM LDYFTRMGIQ VLSTFTGNGT YDGLRAMHRA HLNVLECARS AEYICNELRV RYGIPRLDID GFGFEPLSQS LRKIGMFFGI EDRAEAIIAE ETARWKPELD WYKERLKGKK VCLWPGGSKL WHWAHAIEEE MGVKVVSVYT KFGHQGDMEK GIARCGEDAL AIDDPNELEG LEALEKLQPD IIFTGKRPGE VAKKVRVPYL NAHAYHNGPY KGFEGWVRFA RDIYNGIYSP MHQLSGLDIS KDEIPADRGF VTQRMLSDAK LPEEIAKSET LRRYTGKDDI ISDLRKKNAP YFTPIVKAEA AE
|
| |