Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0957 |
Symbol | |
ID | 3909312 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1106632 |
End bp | 1108383 |
Gene Length | 1752 bp |
Protein Length | 583 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637882850 |
Product | transcriptional regulator NifA |
Protein accession | YP_484578 |
Protein GI | 86748082 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains |
TIGRFAM ID | [TIGR01817] Nif-specific regulatory protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCAGC GCGAAGTACG TCTTGTCGAG AGCGAGCAAT CGCGGCAGCC GATGAACCAG AACCCGATAC CGCTGAGTGA GATTGCGCTC ACCGGCATTT TTGAAATCTC CAAGATCCTC ACAGCGCCGG CGCGGCTCGA AGTCACGCTC GCGAATGTCG TCAATCTGCT GCAGTCCTTT CTGCAGATGC GAAATGGCGT CGTGTCGCTG CTGGCCGACG ACAGTGTTCC CGACATCACG GTCGGCGTGG GCTGGAACGA AGGCAGCGAC AATCGCTATC GCGCGCGACT CCCGCAGAAA GCCATCGACC AGATCGTCGC CACCTCGGTG CCGCTGGTCG CCGACAACGT GGCCGCGCAT CCGATGTTCT CCGCCGCCGA CGCGCTCGCG CTGGGCGCGA CCGACGAGAC CCGTGTGTCG TTCATCGGCG TGCCGATCCG GATCGATTCG CGGGTCGTGG GCACCCTGAC GATCGACCGG GTTCGCGACG GGCAGTCGAT CTTCCGGATG GACGCCGATG TCCGGTTCCT GACTATGGTC GCCAATCTGA TCGGGCAGAC CGTGAAGCTG CATCGCGTGG TGGCGCGTGA TCGCGAGCGG CTGATGGCGG AAAGCCATCG CCTGCAGAAA GAGCTGTACG AGTTGAAGCC GCAGCGCGAG CGCAAGCGCG TCCGGGTCGA CGGCATCGTC GGCGAGAGCC CGGCGATCCG CACGTTGCTC GCCAAGGTCA GCATCATCGC CAAATCGCAG TCGCCGGTGC TTTTGCGCGG CGAGTCCGGC ACCGGCAAGG AACTGATCGC CAAGGCGATC CACGAATTGT CGGCGCGCGC CAACGGGCCC TTCATCAAGA TCAACTGCGC CGCGCTGCCG GAGTCGGTGT TGGAATCCGA ACTGTTCGGC CACGAGAAGG GGGCTTTCAC CGGCGCGATC GCTTCACGCA AGGGCCGGTT CGAACTCGCC GACAAGGGCA CGCTGTTTCT CGACGAGATC GGCGAGATCT CGGCCTCGTT CCAGGCCAAG CTGCTGCGCG TGCTGCAGGA GCAGGAGTTC GAGCGGGTCG GCGGCAACCA GACCATCAAG GTCAACGTCC GAATCGTCGC GGCGACCAAC CGCAATCTCG AAGAGGCCGT CGCCCGCAAG GAGTTCCGCG CCGATCTGTA CTACCGCATC AACGTCGTGC CGATGATCCT GCCGCCGCTG CGCGATAGGC CGACCGATAT CCCATTGCTG GCGAGCGAGT TCCTGAAGAA CTTCAACAAG GAGAACGATC GCGAACTGCA ATTCGAGCCG CATGCGCTGG AATTGCTGAA GGCGTGCTCG TTCCCGGGCA ACGTTCGCGA ACTCGAGAAC TGCGTGCGGC GCACGGCGAC GTTGGCGATC GGGCCGGAAA TTACCGACAG CGATTTCGCC TGCCATCAGG ACGAATGCCT GTCGGCGATC TTGTGGAAGG GCCACGCCGA ACCGGCACCG GTACGGCCGC GGCCGCAGAT TCCGCTGCAG GTGATGCCGC GCAAGGCGCC GCTCGAAGTC GTGGCGCCGC GCGAGGCCGT GAGTGTGTCG CCCGATCCGG TGTCGACACC CATGTCCGCC GAATCGGCCA ACGGCGGGCC GATGTCGGAG CGCGAGCGCT TGGTCAACGC GATGGAGCGA TCCGGCTGGG TCCAGGCAAA GGCCGCCCGG CTGCTCGGCC TGACGCCGCG GCAGATCGGC TACGCGCTGA AGAAGTACGA TATCGAGCTC AAACACTTCT GA
|
Protein sequence | MAQREVRLVE SEQSRQPMNQ NPIPLSEIAL TGIFEISKIL TAPARLEVTL ANVVNLLQSF LQMRNGVVSL LADDSVPDIT VGVGWNEGSD NRYRARLPQK AIDQIVATSV PLVADNVAAH PMFSAADALA LGATDETRVS FIGVPIRIDS RVVGTLTIDR VRDGQSIFRM DADVRFLTMV ANLIGQTVKL HRVVARDRER LMAESHRLQK ELYELKPQRE RKRVRVDGIV GESPAIRTLL AKVSIIAKSQ SPVLLRGESG TGKELIAKAI HELSARANGP FIKINCAALP ESVLESELFG HEKGAFTGAI ASRKGRFELA DKGTLFLDEI GEISASFQAK LLRVLQEQEF ERVGGNQTIK VNVRIVAATN RNLEEAVARK EFRADLYYRI NVVPMILPPL RDRPTDIPLL ASEFLKNFNK ENDRELQFEP HALELLKACS FPGNVRELEN CVRRTATLAI GPEITDSDFA CHQDECLSAI LWKGHAEPAP VRPRPQIPLQ VMPRKAPLEV VAPREAVSVS PDPVSTPMSA ESANGGPMSE RERLVNAMER SGWVQAKAAR LLGLTPRQIG YALKKYDIEL KHF
|
| |