Gene Rsph17029_2189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2189 
Symbol 
ID4895724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2317055 
End bp2319886 
Gene Length2832 bp 
Protein Length943 aa 
Translation table11 
GC content70% 
IMG OID640112783 
Productbifunctional nitrogenase molybdenum-cofactor biosynthesis protein NifE/NifN 
Protein accessionYP_001044064 
Protein GI126462950 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE
[TIGR01285] nitrogenase molybdenum-iron cofactor biosynthesis protein NifN 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.235442 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGAAG CCTTGAAGCA GAAGATCCAG GACGCCTTTC ACGAGCCGGG CTGCGCCACC 
AACACCGCCA AGTCCGAGGG CGAGCGCCGG AAGGGATGCG CGAAGCAGCT CACGCCCGGC
GCGGCGGCCG GGGGCTGCGC CTTCGACGGG GCGATGATCG CGCTGCAGCC CATCACCGAC
GTGGCCCATC TCGTCCATGC CCCGCTCGCC TGCTGGGGCA ACGGCTGGGA CAACCGCGGC
TCGGCCTCGT CGGGCTCCGA CCTCTACCGT CGCGGCTTCA CCACCGACCT TTCCGAGCTC
GACATCGTGA TGGGCCGCGG CGAGGCCAGG CTCTTCCGCG CCATCCGCGA AGTGATCGCG
CAGGAGAACC CGGCCGCAGT CTTCGTCTAT GCCACCTGCG TGACCGCACT CATCGGCGAC
GACATCGGCG CCGTCTGCAA GGCCGCCGCC GAACGGTTCG GCCGCCCGGT GATCCCGATC
AACGTGCCGG GCTATGTCGG CTCGAAGAAC CTCGGCAACA AGCTGGGGGT GGACGCGCTG
GTCGAACATG TCGTGGGGGC GATGGAGCCC GAGACGACCA CCGATTGCGA CATCAACATC
ATTGGTGATT TCAACCTGTC GGGCGAGATC TGGCAGGTGA AGCCGCTCCT GGACCGGCTG
GGCATCCGTA TCCTGGGCAG CGTTTCGGGC GATGCGCGCT ACGCACAGGT GGCGATGATG
CACCGCGCCC GGGTGACGAT GCTCGTCTGC TCACACGCCT TCATGGGCAT CGCCCGCAAG
CTCGAGGACC GCTACGGCAT CCCGTGGTTC GAGGGCAGCT TCTACGGCAT CTCTGACACG
TCCGACGCGC TGCGGACCAT GTGCCGGATG CTGGTCGAGC GCGGCGCGCC CGCCGACCTC
CTGACCCGCT GCGAGGCGCT GATCGCCGAG GAGGAGGCCC GCACCTGGGC CGCGCTCGAG
CCGCTGCGCC CCGCCGTCGC CGGCCGGCGC GTGCTCCTCT ACACCGGCGG GCACAAGACC
TGGTCGGTGG TCTCGGCCCT TCAGGAGCTG GGCATGGAGG TGGTCGGCAC CTCGATGCGC
AAGGCCACGC CCGGCGACCG CGCGCGCGTT GCCGAGATCA TGGGCACCGA GGCCCACATG
TACGAGAACA TGGCGCCGAA GGAGATGTAT CGGATGCTGC GGGACGCGCG GGCCGATGTG
CTCATGTCGG GGGGGGCGGT CGCAGTTCGT GGCGCTGAAG GCCCGCGTGC CCTGGATCGA
CGTGAATCAG GAAAAGCACG AGCCCTACGC GGGTTACATG GGCATGGTCG ATCTCGTGCG
CGCCATCGAC CGGTCGATCA ACAACCCGAT GTGGGCCGAG CTGCGCGACC CCGCGCCGTG
GGACGTGCCG GCCGAAGAGG CCGCCGTGAC GCCCTTCAGC CTCGCGGCCG TTCCCGGCTC
GAAAGCCGAT TTCGAGGATT GCTGATGGCC CGCCTCATCC ACCCCGACCG CGCGCTCTCG
ACCAATCCGC TGAAGGTCTC GGCCCCGCTC GGCGCCGCCA TGGCCTATCT CGGCATCGAG
GGCGCGATCC CGCTCTTTCA CGGCGCGCAG GGCTGCACCG CCTTCGCGAT GGTCCATATG
GTGCGCCATT TCAAGGAGGC GATCCCGCTT CAGACCACGG CGATGAACGA GGTCTCGGCG
ATCCTCGGCG GCGGCGAACA GATCGAGGAG GCGATCGAGA ACCTCCGGAA GCGTGCCTCC
CCGAAGTTCA TCGGCATCGC CTCGACCGCG CTCGTCGAGA CGCGCGGCGA GGATATCGCA
GGCGAACTGC GCGAGATGCT GGCCCGGCGC CGCGACTTTG CGGATACGGC GGTGGTCTAT
GCGGCCACAC CGGATTTCGC GGGCGGGCTC GAGGAGGGCT GGGCCCGCGC GGTCGAAGCC
ATCATCGAGG CGCTGGTGAC CGAGGGGCCG CGGCGGCTCC GGCAGGTGAA CCTCCTGCCC
GGCGCCAACA TGACCGCCGC CGACATCGAG GAGATCGCAG GCCTGATCCG CGCCTTCGGC
CTCCATCCGG TGATCCTGCC CGATCTCTCG CTCTCGCTCG ACGGCCATCT GGCCGAGGAC
TGGCGCGGCC ATTCGCTGGG CGGCACGCGG CTCGCCGACA TCGCGGCGAT GGGCGGCTCC
ATCGCGACGC TGGCGCTGGG CGAGGCGATG CGCCCGGCGG CCGAGAAGCT GGCCGCTCTG
GGCGTGCCCG CGCATGTCTT TCCCTCGGTG ACGGGGCTCA AGGCGGTGGA TGCCTTCGTT
GCGACCCTCA TGCGGCTCTC GGGGGCGGAG GTGCCCGCAG CCGTCCGGCG CGACCGGGCG
CGGCTGGCGG ACGCGATGCT CGATGCGCAT TTCCACATCG GCGGCCTGAA GGTCGCCATG
GGTCTCGACC CGGACCTCGG CCTCGCGCTC GGCTCCACGC TGGCCGCCAT GGGCGCAAAG
CTGACCGTCG TCGCCAGCAC GGCGAGCCCC GCCGTGGAAC GCCTGCCGGT CGAGGAGGTG
CTGATCGGCG ATCTCGGCGA TCTCGAGCGG CTGGCCGAGG CCTCCGGCGC GCGGCTTCTG
CTGACCCATG CCCACGGCCG GATGATGGCC GAGCGGCTGC ATCTGCCCCA TGTCCGGGCG
GGCTTCCCGA TCTTCGACCG GCTGGGCACG ATGGATGCCT GCCGCACCGG ATACCGCGGC
ACGCGCGCCT TCCTCTTCGA GATCGCCAAT GCCTTGCTCG CGCACCCGCA CCGGCCGCGT
CCGGAGGATT TCGGCGCCGC CCGTCTCTCC CCGGAGTTCG ACCATGCCCC CCCGCCGCCT
CAGACTCATT GA
 
Protein sequence
MSEALKQKIQ DAFHEPGCAT NTAKSEGERR KGCAKQLTPG AAAGGCAFDG AMIALQPITD 
VAHLVHAPLA CWGNGWDNRG SASSGSDLYR RGFTTDLSEL DIVMGRGEAR LFRAIREVIA
QENPAAVFVY ATCVTALIGD DIGAVCKAAA ERFGRPVIPI NVPGYVGSKN LGNKLGVDAL
VEHVVGAMEP ETTTDCDINI IGDFNLSGEI WQVKPLLDRL GIRILGSVSG DARYAQVAMM
HRARVTMLVC SHAFMGIARK LEDRYGIPWF EGSFYGISDT SDALRTMCRM LVERGAPADL
LTRCEALIAE EEARTWAALE PLRPAVAGRR VLLYTGGHKT WSVVSALQEL GMEVVGTSMR
KATPGDRARV AEIMGTEAHM YENMAPKEMY RMLRDARADV LMSGGAVAVR GAEGPRALDR
RESGKARALR GLHGHGRSRA RHRPVDQQPD VGRAARPRAV GRAGRRGRRD ALQPRGRSRL
ESRFRGLLMA RLIHPDRALS TNPLKVSAPL GAAMAYLGIE GAIPLFHGAQ GCTAFAMVHM
VRHFKEAIPL QTTAMNEVSA ILGGGEQIEE AIENLRKRAS PKFIGIASTA LVETRGEDIA
GELREMLARR RDFADTAVVY AATPDFAGGL EEGWARAVEA IIEALVTEGP RRLRQVNLLP
GANMTAADIE EIAGLIRAFG LHPVILPDLS LSLDGHLAED WRGHSLGGTR LADIAAMGGS
IATLALGEAM RPAAEKLAAL GVPAHVFPSV TGLKAVDAFV ATLMRLSGAE VPAAVRRDRA
RLADAMLDAH FHIGGLKVAM GLDPDLGLAL GSTLAAMGAK LTVVASTASP AVERLPVEEV
LIGDLGDLER LAEASGARLL LTHAHGRMMA ERLHLPHVRA GFPIFDRLGT MDACRTGYRG
TRAFLFEIAN ALLAHPHRPR PEDFGAARLS PEFDHAPPPP QTH