Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1076 |
Symbol | |
ID | 4021552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 1228173 |
End bp | 1229639 |
Gene Length | 1467 bp |
Protein Length | 488 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637961268 |
Product | nitrogenase molybdenum-cofactor biosynthesis protein NifE |
Protein accession | YP_568215 |
Protein GI | 91975556 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.57688 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGCC TCGCAGACAA GATTCAAGAT GTCTTCAACG AGCCCGGCTG CGCGGACAAT CAGGCCAAGT CCGAGAAACA GCGCAAGAAG GGCTGCAGCA AACCGCTGCA GCCCGGCGGC GCGGCCGGTG GCTGCGCTTT CGACGGCGCC AAGATCGCGC TGCAGCCGAT CGTCGACGTC GCCCATCTGG TGCACGGCCC GATCGCTTGC GAAGGCTCGT CCTGGGACAA TCGCGGCACC AAGTCGTCGG GCTCGAAGCT GTATCGCACC GGCTTCACCA CCGACATGGG CGAGAACGAC GTGGTGTTCG GCGGCGAGAA GCGGCTGTTC CGGTCGATCA GGGAGATCAT CGAAAAGTAT CATCCGCCCG CGGTGTTCGT GTATCAGACC TGCGTGCCGG CGATGATGGG CGATGACATC GTCGCGGTCT GCAAGGTGGC GACCGAGAAG CTCGGCACGC CCTGCGTTCC GATCATCGCG CCGGGCTTCG TCGGCCCGAA GAATCTCGGC AACAAGCTCG CCGGCGAAGC AATGCTCGAC TACGTGATCG GCACCCAGGA GCCCGAGGTC ACGACGCCCT ACGACATCAA CATCATCGGC GAATACAACG TCGCCGGCGA ATTGTGGCAG GTCAAGCCGC TGCTCGACGA GCTCGGCATC CGCATCCTGT CGTGCCTGTC GGGCGACGCG CGCTATCATG AGGTGGCGCA GTCGCATCGC GCCCGCGCCG CCATGATGGT GTGCTCGACC GCGATGATCA ACGTCGCCCG CAAGATGGAA GAGCGCTACG GCATTCCGTA TTTCGAAGGC TCGTTCTACG GCATCACCGA CACCTCGGAT TCGCTGCGGC AGATCGCGCG GTTGCTGATC CAGCGCGGCG CCGATGCCGA GCTGATGGAC CGCGTCGAGG CGCTGATCGC GCGCGAGGAG GCGAAGGCCT GGGCCGCCAT CAAGGCCTAT ACGCCGCGGC TCGAAGGCAA GAAAGTGCTG CTGATCACCG GCGGCGTGAA GTCGTGGTCG GTGGTGCTGG CGCTGCAGGA GGCCGGGCTC ACCATCGTCG GCACCAGCGT CAAGAAGTCG ACCAAGGAGG ACAAGGAGCG GCTCAAGGAG ATGAGCCCCG ACGTCCATCT GATCGACGAT CTGCGGCCGC GCGAAATGTA CAAGATGCTG AAAGAGGCGC AGGCCGACAT CATGCTGTCC GGCGGGCGCT CGCAATTCGT CGCGCTGAAG GCGCGGATGC CCTGGATGGA TATCAACCAG GAGCGTACTT ACGCCTATTG CGGCTATGTC GGCATTGTCG AAATGGTGCG GCAGATCGAC AAGGCGCTGT TCAACCCGAT CTGGGAGCAG GTCCGCTCCG CCCCGCCGTG GGAGGAGATC AGCTGGGAAA CCCGCGCCGA CGCCGCCAAT GCGGCCGACG ACGCGCAGCG CGCCGCCGAG GCGACGCCTG CCGCAAAGGT TGCGTGA
|
Protein sequence | MSRLADKIQD VFNEPGCADN QAKSEKQRKK GCSKPLQPGG AAGGCAFDGA KIALQPIVDV AHLVHGPIAC EGSSWDNRGT KSSGSKLYRT GFTTDMGEND VVFGGEKRLF RSIREIIEKY HPPAVFVYQT CVPAMMGDDI VAVCKVATEK LGTPCVPIIA PGFVGPKNLG NKLAGEAMLD YVIGTQEPEV TTPYDINIIG EYNVAGELWQ VKPLLDELGI RILSCLSGDA RYHEVAQSHR ARAAMMVCST AMINVARKME ERYGIPYFEG SFYGITDTSD SLRQIARLLI QRGADAELMD RVEALIAREE AKAWAAIKAY TPRLEGKKVL LITGGVKSWS VVLALQEAGL TIVGTSVKKS TKEDKERLKE MSPDVHLIDD LRPREMYKML KEAQADIMLS GGRSQFVALK ARMPWMDINQ ERTYAYCGYV GIVEMVRQID KALFNPIWEQ VRSAPPWEEI SWETRADAAN AADDAQRAAE ATPAAKVA
|
| |