Gene RPD_1076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1076 
Symbol 
ID4021552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1228173 
End bp1229639 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content65% 
IMG OID637961268 
Productnitrogenase molybdenum-cofactor biosynthesis protein NifE 
Protein accessionYP_568215 
Protein GI91975556 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.57688 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGCC TCGCAGACAA GATTCAAGAT GTCTTCAACG AGCCCGGCTG CGCGGACAAT 
CAGGCCAAGT CCGAGAAACA GCGCAAGAAG GGCTGCAGCA AACCGCTGCA GCCCGGCGGC
GCGGCCGGTG GCTGCGCTTT CGACGGCGCC AAGATCGCGC TGCAGCCGAT CGTCGACGTC
GCCCATCTGG TGCACGGCCC GATCGCTTGC GAAGGCTCGT CCTGGGACAA TCGCGGCACC
AAGTCGTCGG GCTCGAAGCT GTATCGCACC GGCTTCACCA CCGACATGGG CGAGAACGAC
GTGGTGTTCG GCGGCGAGAA GCGGCTGTTC CGGTCGATCA GGGAGATCAT CGAAAAGTAT
CATCCGCCCG CGGTGTTCGT GTATCAGACC TGCGTGCCGG CGATGATGGG CGATGACATC
GTCGCGGTCT GCAAGGTGGC GACCGAGAAG CTCGGCACGC CCTGCGTTCC GATCATCGCG
CCGGGCTTCG TCGGCCCGAA GAATCTCGGC AACAAGCTCG CCGGCGAAGC AATGCTCGAC
TACGTGATCG GCACCCAGGA GCCCGAGGTC ACGACGCCCT ACGACATCAA CATCATCGGC
GAATACAACG TCGCCGGCGA ATTGTGGCAG GTCAAGCCGC TGCTCGACGA GCTCGGCATC
CGCATCCTGT CGTGCCTGTC GGGCGACGCG CGCTATCATG AGGTGGCGCA GTCGCATCGC
GCCCGCGCCG CCATGATGGT GTGCTCGACC GCGATGATCA ACGTCGCCCG CAAGATGGAA
GAGCGCTACG GCATTCCGTA TTTCGAAGGC TCGTTCTACG GCATCACCGA CACCTCGGAT
TCGCTGCGGC AGATCGCGCG GTTGCTGATC CAGCGCGGCG CCGATGCCGA GCTGATGGAC
CGCGTCGAGG CGCTGATCGC GCGCGAGGAG GCGAAGGCCT GGGCCGCCAT CAAGGCCTAT
ACGCCGCGGC TCGAAGGCAA GAAAGTGCTG CTGATCACCG GCGGCGTGAA GTCGTGGTCG
GTGGTGCTGG CGCTGCAGGA GGCCGGGCTC ACCATCGTCG GCACCAGCGT CAAGAAGTCG
ACCAAGGAGG ACAAGGAGCG GCTCAAGGAG ATGAGCCCCG ACGTCCATCT GATCGACGAT
CTGCGGCCGC GCGAAATGTA CAAGATGCTG AAAGAGGCGC AGGCCGACAT CATGCTGTCC
GGCGGGCGCT CGCAATTCGT CGCGCTGAAG GCGCGGATGC CCTGGATGGA TATCAACCAG
GAGCGTACTT ACGCCTATTG CGGCTATGTC GGCATTGTCG AAATGGTGCG GCAGATCGAC
AAGGCGCTGT TCAACCCGAT CTGGGAGCAG GTCCGCTCCG CCCCGCCGTG GGAGGAGATC
AGCTGGGAAA CCCGCGCCGA CGCCGCCAAT GCGGCCGACG ACGCGCAGCG CGCCGCCGAG
GCGACGCCTG CCGCAAAGGT TGCGTGA
 
Protein sequence
MSRLADKIQD VFNEPGCADN QAKSEKQRKK GCSKPLQPGG AAGGCAFDGA KIALQPIVDV 
AHLVHGPIAC EGSSWDNRGT KSSGSKLYRT GFTTDMGEND VVFGGEKRLF RSIREIIEKY
HPPAVFVYQT CVPAMMGDDI VAVCKVATEK LGTPCVPIIA PGFVGPKNLG NKLAGEAMLD
YVIGTQEPEV TTPYDINIIG EYNVAGELWQ VKPLLDELGI RILSCLSGDA RYHEVAQSHR
ARAAMMVCST AMINVARKME ERYGIPYFEG SFYGITDTSD SLRQIARLLI QRGADAELMD
RVEALIAREE AKAWAAIKAY TPRLEGKKVL LITGGVKSWS VVLALQEAGL TIVGTSVKKS
TKEDKERLKE MSPDVHLIDD LRPREMYKML KEAQADIMLS GGRSQFVALK ARMPWMDINQ
ERTYAYCGYV GIVEMVRQID KALFNPIWEQ VRSAPPWEEI SWETRADAAN AADDAQRAAE
ATPAAKVA