Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1061 |
Symbol | |
ID | 4021537 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 1215686 |
End bp | 1217434 |
Gene Length | 1749 bp |
Protein Length | 582 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637961253 |
Product | transcriptional regulator NifA |
Protein accession | YP_568200 |
Protein GI | 91975541 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains |
TIGRFAM ID | [TIGR01817] Nif-specific regulatory protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.798583 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCAGC GCGAAGTACG TCTTGTCGAG AGCGAGCAAT CGCGGCAGCC GATGAACCAG AACCCGATAC CACTGAGTGA GATTGCGCTC ACCGGCATCT TCGAAATCTC GAAGATCCTC ACCGCGCCGG CGCGCCTCGA AGTCACGCTC GCCAATGTGG TCAATTTGCT GCAGTCCTTT CTGCAGATGC GCAACGGTGT CGTTTCACTA CTGGCTGACG ACAGCGTTCC GGACATCACG GTCGGGGGCG GCTGGAACGA GGGCAGCGAC AACCGCTATC GCGCAAGGCT GCCGCAAAAA GCGATCGACC AGATCGTCGC GACCTCGGTT CCGCTTGTCG CCGACAATGT GTCCACACAT CCGATGTTCT CTGCAGCGGA CGCTCTTGCG CTCGGCGCGA CCGACGAGAC CCGCGTGTCG TTCATCGGCG TGCCGATCCG GATCGACTCG CGCGTGGTCG GGACGCTGAC GATCGACCGG GTTCGCGACG GGCAGTCGAA CTTCAGGATG GACGCCGATG TGCGTTTCCT GACCATGGTT GCCAATCTGA TTGGCCAAAC CGTGAAACTG CACCGCGTAG TGGCGCGGGA TCGCGAACGG CTGATGGCGG AAAGTCATCG CCTGCAGAAA GAATTGTCCG AGTTGAAGCC TCAGCGCGAG CGCAAGCGCG TTCGCGTCGA TGGCATCGTC GGCGAGAGTC CGGCGATTCG TACGTTGCTT GCCAAAGTCA GCATCATCGC CAAATCGCAA TCGCCGGTGC TGCTGCGCGG CGAGTCGGGC ACCGGCAAGG AACTGATCGC AAAGGCGATC CATGAATTGT CAGCGCGTAG CAACGGGCCG TTCATCAAGA TCAACTGCGC GGCGCTTCCC GAATCGGTGC TCGAATCGGA ATTGTTCGGG CACGAGAAGG GCGCGTTCAC CGGTGCGATC GCCTCGCGCA AGGGCCGGTT CGAACTCGCC GACAAGGGCA CGCTGTTCCT CGACGAGATC GGCGAGATCT CCCCCGCATT TCAGGCCAAG CTGCTGCGCG TTCTGCAGGA GCAGGAGTTC GAGCGGGTCG GCGGCAACCA GACCATCAAG GTCAACGTCC GCATCGTTGC TGCGACCAAC CGCAACCTCG AGGAAGCGGT GGCGCGCAAG GAGTTTCGCG CCGATTTGTA TTATCGCATC AACGTCGTGC CGATGATCCT GCCGCCACTG CGCGACAGGC CGAGCGACAT TCCGTTGCTG GCGACCGAGT TTCTGAAGAA CTTCAACAAG GAGAACGATC GCGAGCTGGC GTTCGAGCCG CACGCGCTGG AATTGTTGAA GGCCTGCTCG TTCCCCGGAA ACGTCCGCGA ACTCGAGAAC TGCGTGCGGC GAACGGCGAC GCTGGCGGCG GGACCGGCCA TCCACGACAG TGATTTCGCC TGTCACCAGG ACGAGTGCCT GTCCGCGATC CTTTGGAAAG GCCACGCCGA GCCGCCGCCG GAGCGGCCGC GGCCACAAAT CCCGCTGCAG GTGTTGCCGC GCAAGGTCCC GGTTGAGGTC GTCACGCCGC GCGAGGCATT CACTGCGCCT ACGGAGCCGG ACCAAACCGC GGTGCGGGCC GCGTCGAACG ACGCAGCGAT GCCGGAGCGT GAGCGCCTGA TCAATGCGAT GGAACGATCC GGCTGGGTGC AGGCGAAGGC CGCACGCCTC CTCGGACTGA CGCCGCGTCA GATCGGTTAC GCGCTCAAGA AGCACGATAT CGAACTCAAG CATTTCTAG
|
Protein sequence | MAQREVRLVE SEQSRQPMNQ NPIPLSEIAL TGIFEISKIL TAPARLEVTL ANVVNLLQSF LQMRNGVVSL LADDSVPDIT VGGGWNEGSD NRYRARLPQK AIDQIVATSV PLVADNVSTH PMFSAADALA LGATDETRVS FIGVPIRIDS RVVGTLTIDR VRDGQSNFRM DADVRFLTMV ANLIGQTVKL HRVVARDRER LMAESHRLQK ELSELKPQRE RKRVRVDGIV GESPAIRTLL AKVSIIAKSQ SPVLLRGESG TGKELIAKAI HELSARSNGP FIKINCAALP ESVLESELFG HEKGAFTGAI ASRKGRFELA DKGTLFLDEI GEISPAFQAK LLRVLQEQEF ERVGGNQTIK VNVRIVAATN RNLEEAVARK EFRADLYYRI NVVPMILPPL RDRPSDIPLL ATEFLKNFNK ENDRELAFEP HALELLKACS FPGNVRELEN CVRRTATLAA GPAIHDSDFA CHQDECLSAI LWKGHAEPPP ERPRPQIPLQ VLPRKVPVEV VTPREAFTAP TEPDQTAVRA ASNDAAMPER ERLINAMERS GWVQAKAARL LGLTPRQIGY ALKKHDIELK HF
|
| |