Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_2592 |
Symbol | |
ID | 6410254 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 2801976 |
End bp | 2803511 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642712470 |
Product | Nitrogenase |
Protein accession | YP_001991580 |
Protein GI | 192290975 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.181318 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCATTA ATCTTCGTAG TCCGTCCGCC GAGACCCGCG AACAGCGTCT TGGTACCATC ACCTCATGGG ATGGCCCCGC CTCCCGCCTT GCCAACGAGT CCGCCTTCTC CCGCGGCGAT TGCGGCCCCG GATGCGGCGG CAAGGGCCAG CGCATCTGCG AATTGGAGTC GCCGTTCTCG CAGGGCTCGA CCTGCTCCGA GCAGATCGCC GAGAGCCAGG CCAGCAACGT CCGCGACGCA GTACTGGTGC AGCATTCGCC GATCGGCTGC GCAGCCGGTC ATATCAACAG CAACGCGTTC TTCCGCAACG GCCTGATGCG CCGCGGCCTG GCTCCGCGCA ACGTCACCGG CCTGTCGAGC AATCTGATCG AACGCGACAT GATCTATGGC GGCGCCGAGA AGCTGCGCGC CACGATCAAG GCCGCGGTCG AGCGACACCA GCCCAAGGCG GTGTTCGTCG CCACCTCCTG TGCCACCGGC ATCATCGGCG ACGACGTCGA GAGCGTGGTG CGCGACTGCG AGGACGAGCT CGGTGTCCCG GTGGTGGCGA TGTATTGCGA GGGCTTCAAG TCCAAGCATT GGAGCTCAGG CTTCGACGCG ATCCAGCACG GCGTGCTGCG CCATGTCGTC AAGCCCAAGC CGGGCCCGCG GCAGGACAAC CTCGTCAATA TCATCGTGCT GTGGGGCAGC GACGTGTTCA CGCCGATGCT GGCGGAACTG GGCCTCGAAG CCAACAACAT CCTGACCGTC GCCTCGGTCG ACGAGATCGC GCGCGCCTCG GAAGCCGCCG CATCGGCGAC CTTCTGCTAC TCGGTCGGTG GCTATCTCGG CGCCGCGCTG GAGGAGCAAT ACGGCGTGCA GGAGATCAAG GCGCCGCAGC CTTACGGCTT CGACGGCACC GACGCATGGC TGCGCGCGCT GGCCAAGGCC ACCGGCCGGG AAGAACGCGC CGAAGCCTAT ATCGCCCGCG AGCACGCCCG CGTCCGACCG CAAATCGAGC GGTTGCGCGA GCGCCTCAAG GGCGTGCGCG GCTACGTCGC GATGGGAGCG GCCTATGCCC ACGGCGTCAT CGGGGTGCTG CGCGAACTCG GCGTCGAGGT GCCCGGCTCG CTCGTGTTCC ATCACGACCC GGTCTATGAC AGCCACGACG TCCGCCAGGA TACGCTCGGC CACATGATCG ACGCCTATGG CGATGTCGAG CACTTCAGCA TCTCCAATCG CCAGCCGTAT CAGTTCTACA ATCTGCTGAA GCGCTCCGAT CCGGATTTCA TCATCATCCG CCACGGGGGC CTCGCCGGCC TCGCCTCGCG CCTCGGCGTG CCGGCGATCG CGCTCGGCGA TGCGTCGACC GCGATCGGCT ATCAGGGCAT GATCGATCTC GGCGAGGAAA TCGTCGATGC GCTGGCGCAG CGCAAATTCC ATCAGGACCT CGCCGGCCAC ACCAGCCTGC CCTACAAGCA ATGGTGGCTG GATCAAGCCG ATCCGTATCT GCTCGCCCGG CAGAACGCCC CGCAAGCCGC CGGAAGTCGC GTCTGA
|
Protein sequence | MAINLRSPSA ETREQRLGTI TSWDGPASRL ANESAFSRGD CGPGCGGKGQ RICELESPFS QGSTCSEQIA ESQASNVRDA VLVQHSPIGC AAGHINSNAF FRNGLMRRGL APRNVTGLSS NLIERDMIYG GAEKLRATIK AAVERHQPKA VFVATSCATG IIGDDVESVV RDCEDELGVP VVAMYCEGFK SKHWSSGFDA IQHGVLRHVV KPKPGPRQDN LVNIIVLWGS DVFTPMLAEL GLEANNILTV ASVDEIARAS EAAASATFCY SVGGYLGAAL EEQYGVQEIK APQPYGFDGT DAWLRALAKA TGREERAEAY IAREHARVRP QIERLRERLK GVRGYVAMGA AYAHGVIGVL RELGVEVPGS LVFHHDPVYD SHDVRQDTLG HMIDAYGDVE HFSISNRQPY QFYNLLKRSD PDFIIIRHGG LAGLASRLGV PAIALGDAST AIGYQGMIDL GEEIVDALAQ RKFHQDLAGH TSLPYKQWWL DQADPYLLAR QNAPQAAGSR V
|
| |