Gene RPB_1754 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1754 
SymbolnirA 
ID3909741 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2007058 
End bp2008818 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content68% 
IMG OID637883648 
Productferredoxin-nitrite reductase 
Protein accessionYP_485373 
Protein GI86748877 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0155] Sulfite reductase, beta subunit (hemoprotein) 
TIGRFAM ID[TIGR02435] precorrin-3B synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.188302 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00588237 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGGAAGCG AATTCACGAT CGACCAGAAG CGATACCTGG AAGGCTTCGC CACCGGCATC 
AACGCCGCGC GGGTGCAGCG CGGCGCCCTG CCCGCCGCCG GCAGCGCGCA GCCCTCCGGC
CCCGACGCGA TCCACATCGC CGCGCAGGAT CGCTACACTG CCGCGGGCAA GAAACTCGCC
GAAGCGGAGA AGTGGAAGCG CGAGGAGCAT CCGTTCGACG CCTATGCGCG GCTGAAGCAG
CAGGCCAAGA CAAATACGCC GCCGAAGCCG GCCGACAATT TCCGCTGGCG CTTTTACGGG
CTGTTCTACG TCGCCCCGAC CCAGAGCTCC TATATGTGCC GGTTGCGGAT TCCCAATGGC
GTGCTGACGT CGTGGCAGAT GCAGGGCCTC GCCGACATCG CCGACAATTG TGCCGGCGGC
TATTCCCACG TGACGACGCG CGCCAATCTG CAGATGCGCG AGATCGCGCC GAAGAACGCC
GTGACTCTGA TCGAAGGCAT CGAGTCGCTC GGCCTGTGGG CGCGCGGCGC CGGCGCCGAC
AACATCCGCA ACGTCACCGG TTCGGCCACC GCCGGCATCG ATCCGCAGGA ATTGCTCGAC
ACCCGCGCTT ACGCGCGCGA GTGGCACTTC CACATCCTCA ACGAGCGCGC GCTGTACGGC
CTGCCGCGCA AATTCAACGT CGCCTTCGAC GGTGGCGGGC TGATCCCGAC GCTGGAGGAC
ACCAACGACA TCGCGTTCCA GGCGGTGACG ATCGGCGACG GCCACGGCGT CGAACCGGGC
GTGTGGTTCC GGCTCTCGCT CGGCGGCATC ACCGGCCACA AGGATTTTGC CCGCGACACC
GGCGTGATCG TGCCGCCGGA TGAAGCGACC GAAGTCTCGG ACGCGATCGT GCGGGTGTTC
ATCGAGCACG GCGACCGCAC CGACCGGGCC AGGTCGCGGC TGAAATACGT GCTCGACCGT
TTCGGCTTCG ATGAGTTTCT CAGGCGGGTC GAGGGGCGGC TCGGACGCAA GCTGGTGCGC
GTGCCCGCGG AGGCCGCGCA GCCACGGCCG GCGCAGGATC GCGCCGCGCA TATCGGCGTC
CATCGCCAGA AGCAGGCCGG GCTGAACTGG ATCGGCGTGC GGCTGCCGCT CGGCAAGCTG
ACGAGCGCGC AGATGCGCGG CCTCGCCGAG ATTGCGCACA ATTTCGGCGA CGGCGATATC
CGGCTGACCG TGTGGCAGAA CCTGTTGATC TCCGGCGTTC CGGACGCGCG TGTCGCGGAC
GCAAGTGCGG CGATCGTCGC GCTCGGCCTC GCGATCGACG CCAGTCCGAT CCGCGCCGGA
CTGATCGCCT GCACTGGCGC CACCGGCTGC CGGTTCGGCG CCGCCAAGAC CAAGGAAACC
GCCGAAGCCA TCGCGGGCTA TTGCGAGCCG CGGGTGCCAC TCGACACGCC GGTGAACATC
CACCTCACCG GCTGCCATCA TTCCTGCGCG CAACATTACA TCAGCGACAT CGGCCTGATC
GGCGCCAAGG TGGCGATCTC CGACGAAGAC ACCGTCGAGG GCTTCCACAT CCATGTCGGC
GGTGCGTTCG GCGAAGGCGC CGCGATCGGC GCCGAAGTGC TGCGCGACGT CAGACAGGAC
GACGCGCCGC GTGTCATCGC GCAAATGCTG AGCACGTATC TCGCGCAACG CGCTTCGCCG
GACGAAACCT TCCTCGCCTT CGCGCGCCGC CACGACACCC CGACGCTGCA ACGTCTGTTC
GCTCTGGAGA CCGGTGCATG A
 
Protein sequence
MGSEFTIDQK RYLEGFATGI NAARVQRGAL PAAGSAQPSG PDAIHIAAQD RYTAAGKKLA 
EAEKWKREEH PFDAYARLKQ QAKTNTPPKP ADNFRWRFYG LFYVAPTQSS YMCRLRIPNG
VLTSWQMQGL ADIADNCAGG YSHVTTRANL QMREIAPKNA VTLIEGIESL GLWARGAGAD
NIRNVTGSAT AGIDPQELLD TRAYAREWHF HILNERALYG LPRKFNVAFD GGGLIPTLED
TNDIAFQAVT IGDGHGVEPG VWFRLSLGGI TGHKDFARDT GVIVPPDEAT EVSDAIVRVF
IEHGDRTDRA RSRLKYVLDR FGFDEFLRRV EGRLGRKLVR VPAEAAQPRP AQDRAAHIGV
HRQKQAGLNW IGVRLPLGKL TSAQMRGLAE IAHNFGDGDI RLTVWQNLLI SGVPDARVAD
ASAAIVALGL AIDASPIRAG LIACTGATGC RFGAAKTKET AEAIAGYCEP RVPLDTPVNI
HLTGCHHSCA QHYISDIGLI GAKVAISDED TVEGFHIHVG GAFGEGAAIG AEVLRDVRQD
DAPRVIAQML STYLAQRASP DETFLAFARR HDTPTLQRLF ALETGA