Gene RPD_1061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1061 
Symbol 
ID4021537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1215686 
End bp1217434 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content63% 
IMG OID637961253 
Producttranscriptional regulator NifA 
Protein accessionYP_568200 
Protein GI91975541 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID[TIGR01817] Nif-specific regulatory protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.798583 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCAGC GCGAAGTACG TCTTGTCGAG AGCGAGCAAT CGCGGCAGCC GATGAACCAG 
AACCCGATAC CACTGAGTGA GATTGCGCTC ACCGGCATCT TCGAAATCTC GAAGATCCTC
ACCGCGCCGG CGCGCCTCGA AGTCACGCTC GCCAATGTGG TCAATTTGCT GCAGTCCTTT
CTGCAGATGC GCAACGGTGT CGTTTCACTA CTGGCTGACG ACAGCGTTCC GGACATCACG
GTCGGGGGCG GCTGGAACGA GGGCAGCGAC AACCGCTATC GCGCAAGGCT GCCGCAAAAA
GCGATCGACC AGATCGTCGC GACCTCGGTT CCGCTTGTCG CCGACAATGT GTCCACACAT
CCGATGTTCT CTGCAGCGGA CGCTCTTGCG CTCGGCGCGA CCGACGAGAC CCGCGTGTCG
TTCATCGGCG TGCCGATCCG GATCGACTCG CGCGTGGTCG GGACGCTGAC GATCGACCGG
GTTCGCGACG GGCAGTCGAA CTTCAGGATG GACGCCGATG TGCGTTTCCT GACCATGGTT
GCCAATCTGA TTGGCCAAAC CGTGAAACTG CACCGCGTAG TGGCGCGGGA TCGCGAACGG
CTGATGGCGG AAAGTCATCG CCTGCAGAAA GAATTGTCCG AGTTGAAGCC TCAGCGCGAG
CGCAAGCGCG TTCGCGTCGA TGGCATCGTC GGCGAGAGTC CGGCGATTCG TACGTTGCTT
GCCAAAGTCA GCATCATCGC CAAATCGCAA TCGCCGGTGC TGCTGCGCGG CGAGTCGGGC
ACCGGCAAGG AACTGATCGC AAAGGCGATC CATGAATTGT CAGCGCGTAG CAACGGGCCG
TTCATCAAGA TCAACTGCGC GGCGCTTCCC GAATCGGTGC TCGAATCGGA ATTGTTCGGG
CACGAGAAGG GCGCGTTCAC CGGTGCGATC GCCTCGCGCA AGGGCCGGTT CGAACTCGCC
GACAAGGGCA CGCTGTTCCT CGACGAGATC GGCGAGATCT CCCCCGCATT TCAGGCCAAG
CTGCTGCGCG TTCTGCAGGA GCAGGAGTTC GAGCGGGTCG GCGGCAACCA GACCATCAAG
GTCAACGTCC GCATCGTTGC TGCGACCAAC CGCAACCTCG AGGAAGCGGT GGCGCGCAAG
GAGTTTCGCG CCGATTTGTA TTATCGCATC AACGTCGTGC CGATGATCCT GCCGCCACTG
CGCGACAGGC CGAGCGACAT TCCGTTGCTG GCGACCGAGT TTCTGAAGAA CTTCAACAAG
GAGAACGATC GCGAGCTGGC GTTCGAGCCG CACGCGCTGG AATTGTTGAA GGCCTGCTCG
TTCCCCGGAA ACGTCCGCGA ACTCGAGAAC TGCGTGCGGC GAACGGCGAC GCTGGCGGCG
GGACCGGCCA TCCACGACAG TGATTTCGCC TGTCACCAGG ACGAGTGCCT GTCCGCGATC
CTTTGGAAAG GCCACGCCGA GCCGCCGCCG GAGCGGCCGC GGCCACAAAT CCCGCTGCAG
GTGTTGCCGC GCAAGGTCCC GGTTGAGGTC GTCACGCCGC GCGAGGCATT CACTGCGCCT
ACGGAGCCGG ACCAAACCGC GGTGCGGGCC GCGTCGAACG ACGCAGCGAT GCCGGAGCGT
GAGCGCCTGA TCAATGCGAT GGAACGATCC GGCTGGGTGC AGGCGAAGGC CGCACGCCTC
CTCGGACTGA CGCCGCGTCA GATCGGTTAC GCGCTCAAGA AGCACGATAT CGAACTCAAG
CATTTCTAG
 
Protein sequence
MAQREVRLVE SEQSRQPMNQ NPIPLSEIAL TGIFEISKIL TAPARLEVTL ANVVNLLQSF 
LQMRNGVVSL LADDSVPDIT VGGGWNEGSD NRYRARLPQK AIDQIVATSV PLVADNVSTH
PMFSAADALA LGATDETRVS FIGVPIRIDS RVVGTLTIDR VRDGQSNFRM DADVRFLTMV
ANLIGQTVKL HRVVARDRER LMAESHRLQK ELSELKPQRE RKRVRVDGIV GESPAIRTLL
AKVSIIAKSQ SPVLLRGESG TGKELIAKAI HELSARSNGP FIKINCAALP ESVLESELFG
HEKGAFTGAI ASRKGRFELA DKGTLFLDEI GEISPAFQAK LLRVLQEQEF ERVGGNQTIK
VNVRIVAATN RNLEEAVARK EFRADLYYRI NVVPMILPPL RDRPSDIPLL ATEFLKNFNK
ENDRELAFEP HALELLKACS FPGNVRELEN CVRRTATLAA GPAIHDSDFA CHQDECLSAI
LWKGHAEPPP ERPRPQIPLQ VLPRKVPVEV VTPREAFTAP TEPDQTAVRA ASNDAAMPER
ERLINAMERS GWVQAKAARL LGLTPRQIGY ALKKHDIELK HF