Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0972 |
Symbol | |
ID | 3909327 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1119361 |
End bp | 1120827 |
Gene Length | 1467 bp |
Protein Length | 488 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637882865 |
Product | nitrogenase molybdenum-cofactor biosynthesis protein NifE |
Protein accession | YP_484593 |
Protein GI | 86748097 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGTC TCGCAGACAA GATCCAAGAT GTCTTCAACG AGCCCGGCTG TGCGGACAAT CAGGCCAAGT CCGAGAAGCA ACGCAAGAAG GGCTGCAGCA AACCGCTGCA GCCCGGCGGC GCGGCCGGCG GCTGCGCCTT CGACGGCGCC AAGATCGCGC TGCAGCCGAT CGTCGACGTC GCCCATCTGG TGCACGGCCC GATCGCCTGC GAAGGCTCGT CCTGGGACAA TCGCGGCACC AAATCGTCGG GCTCGAAGCT GTATCGCACC GGCTTCACCA CCGACATGAG CGAGAACGAC GTGGTGTTCG GCGGCGAGAA GCGGCTGTTC CGGTCGATCA GGGAAATCAT CGAGAAGTAC GACCCGCCCG CGGTGTTCGT GTATCAGACC TGCGTGCCGG CGATGATGGG CGACGACATC GTCGCGGTCT GTAAGGTGGC GGCCGAGAAA TTCGGCAAGC CCTGCATCCC GATCATCGCG CCGGGCTTCG TCGGCCCGAA GAATCTCGGC AACAAGCTCG CCGGCGAGGC GATGCTCGAC TACGTGATCG GCACGCAGGA GCCGGAGGTC ACCACCCCCT ACGACATCAA CATCATCGGC GAATACAACG TCGCCGGCGA ATTGTGGCAG GTCAAGCCGC TGCTCGACGA GCTCGGCATC CGCATCCTGT CCTGCCTGTC GGGCGACGCG CGCTATCACG AGGTCGCGCA GTCGCATCGC GCCCGCGCCG CCATGATGGT GTGCTCGACC GCGATGATCA ACGTCGCCCG CAAGATGCAG GAGCGCTACG GCATTCCGTA TTTCGAGGGC TCGTTCTACG GCATCACCGA CACCTCGGAT TCGCTGCGGC AGATCGCACG GCTGCTGATC GCGCGCGGCG CCGATGCCGA GCTGATGGAC CGCGTCGAGG CGCTGATCGC GCGCGAGGAG GCCAAGGCCT GGGCCGCCAT CAAGGCCTAT ACGCCGCGGC TCGCAGGCAA GAAAGTGCTG CTGATCACCG GCGGCGTGAA GTCGTGGTCG GTGGTGCTGG CGCTGCAGGA AGCCGGGCTC ACCATCGTCG GCACCAGCGT CAAGAAATCG ACCAAGGAGG ACAAGGAGCG GCTCAAGGAG ATGAGTCCCG ACGTCCATCT GATCGACGAT CTGCGGCCGC GCGAAATGTA CAAGATGCTG AAAGAGGCGC AGGCCGACAT AATGTTGTCC GGCGGCCGCT CGCAATTCGT CGCGCTGAAG GCGCGGATGC CCTGGATGGA TATCAACCAG GAACGCTCTT ACGCGTATTG CGGCTATGTC GGCATTGTCG AGATGGTGCG GCAGATCGAC AAGGCGCTGT CGAACCCGAT CTGGCAGCAG GTCCGCTCGG CGCCGCCGTG GGACGAAGTG AGCTGGGAGA CCCGCGCCGA CGCCGCCAAT GCCGCCGACG ACGCACAGCG CGCCGCCGAG GCCGCACCCG CCGCAAAGGT CGCGTGA
|
Protein sequence | MSRLADKIQD VFNEPGCADN QAKSEKQRKK GCSKPLQPGG AAGGCAFDGA KIALQPIVDV AHLVHGPIAC EGSSWDNRGT KSSGSKLYRT GFTTDMSEND VVFGGEKRLF RSIREIIEKY DPPAVFVYQT CVPAMMGDDI VAVCKVAAEK FGKPCIPIIA PGFVGPKNLG NKLAGEAMLD YVIGTQEPEV TTPYDINIIG EYNVAGELWQ VKPLLDELGI RILSCLSGDA RYHEVAQSHR ARAAMMVCST AMINVARKMQ ERYGIPYFEG SFYGITDTSD SLRQIARLLI ARGADAELMD RVEALIAREE AKAWAAIKAY TPRLAGKKVL LITGGVKSWS VVLALQEAGL TIVGTSVKKS TKEDKERLKE MSPDVHLIDD LRPREMYKML KEAQADIMLS GGRSQFVALK ARMPWMDINQ ERSYAYCGYV GIVEMVRQID KALSNPIWQQ VRSAPPWDEV SWETRADAAN AADDAQRAAE AAPAAKVA
|
| |