Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_2189 |
Symbol | |
ID | 4895724 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 2317055 |
End bp | 2319886 |
Gene Length | 2832 bp |
Protein Length | 943 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640112783 |
Product | bifunctional nitrogenase molybdenum-cofactor biosynthesis protein NifE/NifN |
Protein accession | YP_001044064 |
Protein GI | 126462950 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE [TIGR01285] nitrogenase molybdenum-iron cofactor biosynthesis protein NifN |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.235442 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGAAG CCTTGAAGCA GAAGATCCAG GACGCCTTTC ACGAGCCGGG CTGCGCCACC AACACCGCCA AGTCCGAGGG CGAGCGCCGG AAGGGATGCG CGAAGCAGCT CACGCCCGGC GCGGCGGCCG GGGGCTGCGC CTTCGACGGG GCGATGATCG CGCTGCAGCC CATCACCGAC GTGGCCCATC TCGTCCATGC CCCGCTCGCC TGCTGGGGCA ACGGCTGGGA CAACCGCGGC TCGGCCTCGT CGGGCTCCGA CCTCTACCGT CGCGGCTTCA CCACCGACCT TTCCGAGCTC GACATCGTGA TGGGCCGCGG CGAGGCCAGG CTCTTCCGCG CCATCCGCGA AGTGATCGCG CAGGAGAACC CGGCCGCAGT CTTCGTCTAT GCCACCTGCG TGACCGCACT CATCGGCGAC GACATCGGCG CCGTCTGCAA GGCCGCCGCC GAACGGTTCG GCCGCCCGGT GATCCCGATC AACGTGCCGG GCTATGTCGG CTCGAAGAAC CTCGGCAACA AGCTGGGGGT GGACGCGCTG GTCGAACATG TCGTGGGGGC GATGGAGCCC GAGACGACCA CCGATTGCGA CATCAACATC ATTGGTGATT TCAACCTGTC GGGCGAGATC TGGCAGGTGA AGCCGCTCCT GGACCGGCTG GGCATCCGTA TCCTGGGCAG CGTTTCGGGC GATGCGCGCT ACGCACAGGT GGCGATGATG CACCGCGCCC GGGTGACGAT GCTCGTCTGC TCACACGCCT TCATGGGCAT CGCCCGCAAG CTCGAGGACC GCTACGGCAT CCCGTGGTTC GAGGGCAGCT TCTACGGCAT CTCTGACACG TCCGACGCGC TGCGGACCAT GTGCCGGATG CTGGTCGAGC GCGGCGCGCC CGCCGACCTC CTGACCCGCT GCGAGGCGCT GATCGCCGAG GAGGAGGCCC GCACCTGGGC CGCGCTCGAG CCGCTGCGCC CCGCCGTCGC CGGCCGGCGC GTGCTCCTCT ACACCGGCGG GCACAAGACC TGGTCGGTGG TCTCGGCCCT TCAGGAGCTG GGCATGGAGG TGGTCGGCAC CTCGATGCGC AAGGCCACGC CCGGCGACCG CGCGCGCGTT GCCGAGATCA TGGGCACCGA GGCCCACATG TACGAGAACA TGGCGCCGAA GGAGATGTAT CGGATGCTGC GGGACGCGCG GGCCGATGTG CTCATGTCGG GGGGGGCGGT CGCAGTTCGT GGCGCTGAAG GCCCGCGTGC CCTGGATCGA CGTGAATCAG GAAAAGCACG AGCCCTACGC GGGTTACATG GGCATGGTCG ATCTCGTGCG CGCCATCGAC CGGTCGATCA ACAACCCGAT GTGGGCCGAG CTGCGCGACC CCGCGCCGTG GGACGTGCCG GCCGAAGAGG CCGCCGTGAC GCCCTTCAGC CTCGCGGCCG TTCCCGGCTC GAAAGCCGAT TTCGAGGATT GCTGATGGCC CGCCTCATCC ACCCCGACCG CGCGCTCTCG ACCAATCCGC TGAAGGTCTC GGCCCCGCTC GGCGCCGCCA TGGCCTATCT CGGCATCGAG GGCGCGATCC CGCTCTTTCA CGGCGCGCAG GGCTGCACCG CCTTCGCGAT GGTCCATATG GTGCGCCATT TCAAGGAGGC GATCCCGCTT CAGACCACGG CGATGAACGA GGTCTCGGCG ATCCTCGGCG GCGGCGAACA GATCGAGGAG GCGATCGAGA ACCTCCGGAA GCGTGCCTCC CCGAAGTTCA TCGGCATCGC CTCGACCGCG CTCGTCGAGA CGCGCGGCGA GGATATCGCA GGCGAACTGC GCGAGATGCT GGCCCGGCGC CGCGACTTTG CGGATACGGC GGTGGTCTAT GCGGCCACAC CGGATTTCGC GGGCGGGCTC GAGGAGGGCT GGGCCCGCGC GGTCGAAGCC ATCATCGAGG CGCTGGTGAC CGAGGGGCCG CGGCGGCTCC GGCAGGTGAA CCTCCTGCCC GGCGCCAACA TGACCGCCGC CGACATCGAG GAGATCGCAG GCCTGATCCG CGCCTTCGGC CTCCATCCGG TGATCCTGCC CGATCTCTCG CTCTCGCTCG ACGGCCATCT GGCCGAGGAC TGGCGCGGCC ATTCGCTGGG CGGCACGCGG CTCGCCGACA TCGCGGCGAT GGGCGGCTCC ATCGCGACGC TGGCGCTGGG CGAGGCGATG CGCCCGGCGG CCGAGAAGCT GGCCGCTCTG GGCGTGCCCG CGCATGTCTT TCCCTCGGTG ACGGGGCTCA AGGCGGTGGA TGCCTTCGTT GCGACCCTCA TGCGGCTCTC GGGGGCGGAG GTGCCCGCAG CCGTCCGGCG CGACCGGGCG CGGCTGGCGG ACGCGATGCT CGATGCGCAT TTCCACATCG GCGGCCTGAA GGTCGCCATG GGTCTCGACC CGGACCTCGG CCTCGCGCTC GGCTCCACGC TGGCCGCCAT GGGCGCAAAG CTGACCGTCG TCGCCAGCAC GGCGAGCCCC GCCGTGGAAC GCCTGCCGGT CGAGGAGGTG CTGATCGGCG ATCTCGGCGA TCTCGAGCGG CTGGCCGAGG CCTCCGGCGC GCGGCTTCTG CTGACCCATG CCCACGGCCG GATGATGGCC GAGCGGCTGC ATCTGCCCCA TGTCCGGGCG GGCTTCCCGA TCTTCGACCG GCTGGGCACG ATGGATGCCT GCCGCACCGG ATACCGCGGC ACGCGCGCCT TCCTCTTCGA GATCGCCAAT GCCTTGCTCG CGCACCCGCA CCGGCCGCGT CCGGAGGATT TCGGCGCCGC CCGTCTCTCC CCGGAGTTCG ACCATGCCCC CCCGCCGCCT CAGACTCATT GA
|
Protein sequence | MSEALKQKIQ DAFHEPGCAT NTAKSEGERR KGCAKQLTPG AAAGGCAFDG AMIALQPITD VAHLVHAPLA CWGNGWDNRG SASSGSDLYR RGFTTDLSEL DIVMGRGEAR LFRAIREVIA QENPAAVFVY ATCVTALIGD DIGAVCKAAA ERFGRPVIPI NVPGYVGSKN LGNKLGVDAL VEHVVGAMEP ETTTDCDINI IGDFNLSGEI WQVKPLLDRL GIRILGSVSG DARYAQVAMM HRARVTMLVC SHAFMGIARK LEDRYGIPWF EGSFYGISDT SDALRTMCRM LVERGAPADL LTRCEALIAE EEARTWAALE PLRPAVAGRR VLLYTGGHKT WSVVSALQEL GMEVVGTSMR KATPGDRARV AEIMGTEAHM YENMAPKEMY RMLRDARADV LMSGGAVAVR GAEGPRALDR RESGKARALR GLHGHGRSRA RHRPVDQQPD VGRAARPRAV GRAGRRGRRD ALQPRGRSRL ESRFRGLLMA RLIHPDRALS TNPLKVSAPL GAAMAYLGIE GAIPLFHGAQ GCTAFAMVHM VRHFKEAIPL QTTAMNEVSA ILGGGEQIEE AIENLRKRAS PKFIGIASTA LVETRGEDIA GELREMLARR RDFADTAVVY AATPDFAGGL EEGWARAVEA IIEALVTEGP RRLRQVNLLP GANMTAADIE EIAGLIRAFG LHPVILPDLS LSLDGHLAED WRGHSLGGTR LADIAAMGGS IATLALGEAM RPAAEKLAAL GVPAHVFPSV TGLKAVDAFV ATLMRLSGAE VPAAVRRDRA RLADAMLDAH FHIGGLKVAM GLDPDLGLAL GSTLAAMGAK LTVVASTASP AVERLPVEEV LIGDLGDLER LAEASGARLL LTHAHGRMMA ERLHLPHVRA GFPIFDRLGT MDACRTGYRG TRAFLFEIAN ALLAHPHRPR PEDFGAARLS PEFDHAPPPP QTH
|
| |