Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_5098 |
Symbol | |
ID | 6412792 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 5482067 |
End bp | 5483530 |
Gene Length | 1464 bp |
Protein Length | 487 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 642714983 |
Product | nitrogenase molybdenum-cofactor biosynthesis protein NifE |
Protein accession | YP_001994062 |
Protein GI | 192293457 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCGCC TCGCAGACAA GATTCAAGAC GTCTTCAACG AGCCCGGCTG TGCGGCCAAT CAGGCCAAGT CCGACAAGCA ACGCAAGAAG GGCTGCAGCA AACCGCTGCA GCCGGGGGGC GCAGCCGGTG GCTGCGCCTT CGACGGCGCC AAGATCGCGC TGCAGCCGAT CGTCGACGTC GCGCACCTGG TGCACGGCCC GATCGCCTGC GAAGGCTCGT CCTGGGACAA TCGCGGCACC AAGTCGTCCG GCTCGAAGCT GTATCGCACC GGCTTCACCA CCGACATGGG CGAGAACGAT GTGATCTTCG GCGGCGAGAA GCGGCTGTTT AGGTCGATTC GCGAGATCAT CGAGAAGTAC GATCCGCCGG CCGTGTTCGT GTATCAGACC TGTGTTCCGG CGATGATGGG CGACGACATC GTCGCCGTCT GCAAGGTCGC CTCCGAGAAA TTCGGCAAGC CCTGCGTTCC GATCATCTCT CCCGGCTTCG TCGGCCCGAA GAATCTCGGC AACAAGCTCG CCGGCGAGGC GATGCTCGAT TACGTGATCG GCACTCAGGA GCCGGAGTTC ACGACCCCCT ACGACATCAA CATCATCGGC GAATACAACG TCGCGGGCGA ATTGTGGCAG GTGAAGCCGC TGCTCGACGA ACTCGGGATC CGCATTCTGT CGTGCCTGTC GGGCGATGCG CGCTATCACG AAGTGGCGCA ATCGCACCGC GCCCGCGCCG CCATGATGGT GTGCTCGACC GCGATGATCA ATGTCGCGCG CAAGATGGAA GAGCGCTACG GCATCCCGTA TTTCGAAGGC TCGTTCTACG GCATCAGCGA CACCTCCGAG TCGCTTCGCC AGATTGCGCG GTTGCTGATC GCCCGCGGCG CGCCGGACGA GCTGATGGCC CGCACCGAGG CGCTGATCGC CCGCGAAGAG GCCAAGGCCT GGGCGGCGAT CAAGGCCTAC ACCCCGCGGC TGGAAGGCAA GAAGGTGCTG CTGATCACCG GCGGCGTAAA GTCGTGGTCG GTGGTGGCGG CGCTGCAGGA AGCGGGGCTG TCCATCGTCG GCACCAGCGT CAAGAAGTCG ACCAAGGAAG ACAAGCTGCG GCTCAAGGAG ATGAGCCCGG ACGTCCACCA GATCGACGAT CTGCGCCCAC GCGAAATGTA CAAGATGCTC AAAGATGCGC AGGCCGACAT CATGCTGTCG GGCGGCCGCT CGCAGTTCGT CGCCTTGAAG GCGCGGATGC CCTGGATGGA TATCAACCAG GAGCGCACCT ACGCGTATTG CGGCTATGTC GGCATCGTCG AGATGGTTCG GCAGATCGAC AAATCGCTGT CCAATCCGAT CTGGGCTCAG GTGCGCAGCG CCCCGCCGTG GGACGAGGTC ACCTGGGAGC AGCGCGCGGA CGCCGCCAAC GCCGCCGACG ATCGCCAACG CGCGATCTTC GGGCGTTCGG CTCGAGTGGC GTGA
|
Protein sequence | MSRLADKIQD VFNEPGCAAN QAKSDKQRKK GCSKPLQPGG AAGGCAFDGA KIALQPIVDV AHLVHGPIAC EGSSWDNRGT KSSGSKLYRT GFTTDMGEND VIFGGEKRLF RSIREIIEKY DPPAVFVYQT CVPAMMGDDI VAVCKVASEK FGKPCVPIIS PGFVGPKNLG NKLAGEAMLD YVIGTQEPEF TTPYDINIIG EYNVAGELWQ VKPLLDELGI RILSCLSGDA RYHEVAQSHR ARAAMMVCST AMINVARKME ERYGIPYFEG SFYGISDTSE SLRQIARLLI ARGAPDELMA RTEALIAREE AKAWAAIKAY TPRLEGKKVL LITGGVKSWS VVAALQEAGL SIVGTSVKKS TKEDKLRLKE MSPDVHQIDD LRPREMYKML KDAQADIMLS GGRSQFVALK ARMPWMDINQ ERTYAYCGYV GIVEMVRQID KSLSNPIWAQ VRSAPPWDEV TWEQRADAAN AADDRQRAIF GRSARVA
|
| |