Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_0538 |
Symbol | nifE |
ID | 3718047 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007493 |
Strand | - |
Start bp | 2273554 |
End bp | 2275017 |
Gene Length | 1464 bp |
Protein Length | 487 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640071747 |
Product | nifE, nitrogenase molybdenum-cofactor synthesis protein |
Protein accession | YP_353611 |
Protein GI | 77464107 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.510346 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGGAAG CCTTGAAGCA GAAGATCCAG GACGCCTTTC ACGAGCCGGG CTGCGCCACG AACACCGCCA AGTCCGAGGG CGAGCGCCGG AAGGGCTGCG CGAAACAGCT GACGCCCGGC GCGGCGGCCG GGGGCTGCGC CTTCGACGGG GCGATGATCG CGCTGCAGCC CATCACCGAC GTGGCCCATC TCGTCCATGC CCCGCTCGCC TGCTGGGGCA ACGGCTGGGA CAACCGCGGC TCGGCCTCGT CGGGCTCCGA CCTCTACCGT CGCGGCTTCA CCACCGACCT CTCCGAGCTC GACATCGTGA TGGGCCGCGG CGAGGCCAGG CTCTTCCGTG CCATCCGCGA AGTGATCGCG CAGGAGAACC CGGCCGCAGT CTTCGTCTAT GCCACCTGCG TGACGGCGCT CATCGGCGAC GACATCGGCG CCGTCTGCAA GGCCGCCGCC GAACGGTTCG GCCGCCCGGT GATCCCGATC AACGTGCCGG GCTATGTGGG CTCGAAGAAC CTCGGCAACA AGCTGGGGGT GGACGCGCTG GTCGAACATG TCGTGGGGAC GATGGAGCCC GCGACGGCGA CCGATTGCGA CATCAACATC CTCGGCGACT TCAACCTGTC GGGCGAACTC TGGCAGGTGA AGCCGCTGCT CGACCGCCTC GGCATCCGCA TCCTCGGCTC GGTCTCGGGG GATGCGCGCT ATGCGCAGGT GGCCATGATG CACCGGGCGC GGGTGACGAT GCTCGTCTGC TCGCACGCCT TCCTGGGCAT CGCCCGCAAG CTCGAGGACC GCTACGGCAT CCCGTGGTTC GAGGGCAGCT TCTACGGCAT CTCCGACACG TCCGACGCGC TGCGGACCCT GTGCCGGATG CTGGTCGAGC GCGGCGCGCC CGCGGACCTC GTGACCCGCT GCGAGGCGCT GATCGCCGAG GAGGAGGCCC GCACCTGGGC CGCGCTGGAA CCGCTCCGCC CCGCCGTCGC CGGCCGGCGC GTGCTCCTTT ACACCGGCGG GCACAAGACC TGGTCGGTGG TCTCGGCGCT GCAGGAACTC GGCATGGAGG TGGTCGGCAC CTCGATGCGC AAGGCCACGC CCGGCGACCG CGCGCGCGTC ACCGAGATCA TGGGCACCGA GGCCCACATG TACGAGAACA TGGCGCCGAA GGAGATGTAT CGGATGCTGC GGGACGCGCG GGCCGATGTG CTTATGTCGG GGGGGCGGTC GCAGTTCGTG GCGCTGAAGG CCCGCGTGCC CTGGATCGAC GTGAATCAGG AAAAGCACGA GCCCTACGCA GGCTACATGG GCATGGTCGA TCTCGTGCGC GCCATCGACC GGTCGATCAA CAACCCGATG TGGGCCGAGC TGCGCGACCC CGCGCCTTGG GACGTGCCGG CCGAAGAAGC CGCCGTGACG CCCTTCAGCC TCGCGGCCGT TCCCGGCTCG AAAGCCGATT TCGAGGATTG CTGA
|
Protein sequence | MSEALKQKIQ DAFHEPGCAT NTAKSEGERR KGCAKQLTPG AAAGGCAFDG AMIALQPITD VAHLVHAPLA CWGNGWDNRG SASSGSDLYR RGFTTDLSEL DIVMGRGEAR LFRAIREVIA QENPAAVFVY ATCVTALIGD DIGAVCKAAA ERFGRPVIPI NVPGYVGSKN LGNKLGVDAL VEHVVGTMEP ATATDCDINI LGDFNLSGEL WQVKPLLDRL GIRILGSVSG DARYAQVAMM HRARVTMLVC SHAFLGIARK LEDRYGIPWF EGSFYGISDT SDALRTLCRM LVERGAPADL VTRCEALIAE EEARTWAALE PLRPAVAGRR VLLYTGGHKT WSVVSALQEL GMEVVGTSMR KATPGDRARV TEIMGTEAHM YENMAPKEMY RMLRDARADV LMSGGRSQFV ALKARVPWID VNQEKHEPYA GYMGMVDLVR AIDRSINNPM WAELRDPAPW DVPAEEAAVT PFSLAAVPGS KADFEDC
|
| |