Gene RPB_0972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0972 
Symbol 
ID3909327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1119361 
End bp1120827 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content65% 
IMG OID637882865 
Productnitrogenase molybdenum-cofactor biosynthesis protein NifE 
Protein accessionYP_484593 
Protein GI86748097 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGTC TCGCAGACAA GATCCAAGAT GTCTTCAACG AGCCCGGCTG TGCGGACAAT 
CAGGCCAAGT CCGAGAAGCA ACGCAAGAAG GGCTGCAGCA AACCGCTGCA GCCCGGCGGC
GCGGCCGGCG GCTGCGCCTT CGACGGCGCC AAGATCGCGC TGCAGCCGAT CGTCGACGTC
GCCCATCTGG TGCACGGCCC GATCGCCTGC GAAGGCTCGT CCTGGGACAA TCGCGGCACC
AAATCGTCGG GCTCGAAGCT GTATCGCACC GGCTTCACCA CCGACATGAG CGAGAACGAC
GTGGTGTTCG GCGGCGAGAA GCGGCTGTTC CGGTCGATCA GGGAAATCAT CGAGAAGTAC
GACCCGCCCG CGGTGTTCGT GTATCAGACC TGCGTGCCGG CGATGATGGG CGACGACATC
GTCGCGGTCT GTAAGGTGGC GGCCGAGAAA TTCGGCAAGC CCTGCATCCC GATCATCGCG
CCGGGCTTCG TCGGCCCGAA GAATCTCGGC AACAAGCTCG CCGGCGAGGC GATGCTCGAC
TACGTGATCG GCACGCAGGA GCCGGAGGTC ACCACCCCCT ACGACATCAA CATCATCGGC
GAATACAACG TCGCCGGCGA ATTGTGGCAG GTCAAGCCGC TGCTCGACGA GCTCGGCATC
CGCATCCTGT CCTGCCTGTC GGGCGACGCG CGCTATCACG AGGTCGCGCA GTCGCATCGC
GCCCGCGCCG CCATGATGGT GTGCTCGACC GCGATGATCA ACGTCGCCCG CAAGATGCAG
GAGCGCTACG GCATTCCGTA TTTCGAGGGC TCGTTCTACG GCATCACCGA CACCTCGGAT
TCGCTGCGGC AGATCGCACG GCTGCTGATC GCGCGCGGCG CCGATGCCGA GCTGATGGAC
CGCGTCGAGG CGCTGATCGC GCGCGAGGAG GCCAAGGCCT GGGCCGCCAT CAAGGCCTAT
ACGCCGCGGC TCGCAGGCAA GAAAGTGCTG CTGATCACCG GCGGCGTGAA GTCGTGGTCG
GTGGTGCTGG CGCTGCAGGA AGCCGGGCTC ACCATCGTCG GCACCAGCGT CAAGAAATCG
ACCAAGGAGG ACAAGGAGCG GCTCAAGGAG ATGAGTCCCG ACGTCCATCT GATCGACGAT
CTGCGGCCGC GCGAAATGTA CAAGATGCTG AAAGAGGCGC AGGCCGACAT AATGTTGTCC
GGCGGCCGCT CGCAATTCGT CGCGCTGAAG GCGCGGATGC CCTGGATGGA TATCAACCAG
GAACGCTCTT ACGCGTATTG CGGCTATGTC GGCATTGTCG AGATGGTGCG GCAGATCGAC
AAGGCGCTGT CGAACCCGAT CTGGCAGCAG GTCCGCTCGG CGCCGCCGTG GGACGAAGTG
AGCTGGGAGA CCCGCGCCGA CGCCGCCAAT GCCGCCGACG ACGCACAGCG CGCCGCCGAG
GCCGCACCCG CCGCAAAGGT CGCGTGA
 
Protein sequence
MSRLADKIQD VFNEPGCADN QAKSEKQRKK GCSKPLQPGG AAGGCAFDGA KIALQPIVDV 
AHLVHGPIAC EGSSWDNRGT KSSGSKLYRT GFTTDMSEND VVFGGEKRLF RSIREIIEKY
DPPAVFVYQT CVPAMMGDDI VAVCKVAAEK FGKPCIPIIA PGFVGPKNLG NKLAGEAMLD
YVIGTQEPEV TTPYDINIIG EYNVAGELWQ VKPLLDELGI RILSCLSGDA RYHEVAQSHR
ARAAMMVCST AMINVARKMQ ERYGIPYFEG SFYGITDTSD SLRQIARLLI ARGADAELMD
RVEALIAREE AKAWAAIKAY TPRLAGKKVL LITGGVKSWS VVLALQEAGL TIVGTSVKKS
TKEDKERLKE MSPDVHLIDD LRPREMYKML KEAQADIMLS GGRSQFVALK ARMPWMDINQ
ERSYAYCGYV GIVEMVRQID KALSNPIWQQ VRSAPPWDEV SWETRADAAN AADDAQRAAE
AAPAAKVA