Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_4475 |
Symbol | |
ID | 3972487 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 4976682 |
End bp | 4978424 |
Gene Length | 1743 bp |
Protein Length | 580 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637927586 |
Product | transcriptional regulator NifA |
Protein accession | YP_534317 |
Protein GI | 90425947 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains |
TIGRFAM ID | [TIGR01817] Nif-specific regulatory protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.391339 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCTATC GCGAAGCGCA CGTCGCCGAT GCCGAGGAAT CGCGCCCGAC CACTCTGATA CCTCTGAGCG AAATTGCTCT GACTGGTATC TTTGAGATCT CGAAGATCCT CACCGCGCCG GCGCGTCTCG AAACCACGCT CGCCAATGTC GTCAATCTGC TGCAGTCGTT CATGCAGATG CGGCATGGCA CGGTGTCGCT GCTGGCCGAC GATGGGGTGC CTGATATTAC CGTCGGCGCC GGCTGGAACG AAGGCACCGA CGACCGCTAC CGCGCCCGCC TGCCGGCCAA GGCGATCGAC CAGATCGTCG CGACCTCGGT GCCGCTGGTG GTCGAGAACG TTTCTTCGCA TCCGATGTTT TCGCGCGCCG ATGCCGACGC GCTCGGGGCC TCGCCCGAGG TCCGCGTTTC GTTCATCGGC GTGCCGATCC GGATCGATTC CCGGGTGGTC GGTACGCTGA CCATCGACCG CGTCCGCGAC GGCCGCTCGA TCTTCCGGCT CGACGCCGAC GTCCGCTTCC TCACCATGAT CGCCAATCTG ATCGGCCAGA CCGTCAAGCT GCACCGCGTG GTGGCGCGCG ACCGCGAGCG GCTGATGGCC GAGAGCCACC GGCTGCAGAA GCAATTGTCC GAGCTGAAAC CGCCGCGCGA GCGCAAGAAG GTCCGCGTCG ACGGCATCAT CGGCGAAAGC CAGGCGATCC GCGGGCTGCT CGCCAAGGTC GGCATCATCG CCAAATCGCA TTCGCCGGTG CTGCTGCGCG GCGAGTCCGG CACCGGCAAG GAGCTGATCG CCAAGGCGAT CCACGAATTG TCGTCGCGCG CCAACGGCCC GTTCATCAAG ATCAACTGCG CGGCGTTGCC CGAATCGGTG CTGGAATCCG AATTGTTCGG CCACGAGAAG GGCGCCTTCA CCGGCGCCAT CGCGTCGCGC AAGGGACGCT TCGAACTCGC CGACAAGGGC ACGCTGTTCC TCGACGAGAT CGGCGAAATC TCGCCGTCGT TCCAGGCCAA GCTGTTGCGG GTGTTGCAAG AGCAGGAGTT CGAGCGGGTC GGCGGCAACC ACACCATCAA GGTCAATGTC CGCGTGGTGG CGGCCACCAA CCGCAATCTC GAGGAGGCGG TGGCGCGCAA CGAATTCCGC GCCGACCTGT ACTATCGCAT CAATGTGGTG CCGATGATGC TGCCGCCGCT GCGTGATCGC GCCAGCGACA TTCCGCTGTT GGCCAGCGAG TTCCTGAAGA ACTTCAACAG GGAAAACGAG CGCGACCTGG AGTTCGATCC GGCCTCGATG GAGCTGCTGC AGGGGTGTTC GTTCCCCGGC AACGTGCGCG AGCTGGAAAA CTGCGTGCGC CGCACCGCGA CGCTGGCGCC CGGTCCGGCG ATTCACCAGG ACGACTTCGC CTGCCATCAT GACGAGTGCC TGTCGTCGAT TCTTTGGAAG AGCCATTCGG AGCGCACCGC GCAGCGTCCG CCGCCGGAAA TTCCGCTTGC AGTCGCACCG ATCGGCCGGG CCGACGGCCC CCGCGGCAAC GTTGCAGCAC CGGCGCCCAC CGTGCCGACG CCGCAGCCGC CCGCCCGCGT CGAAGCGGCC TCCGACGCGC AGATGTCCGA GCGCGAGCGG CTGGTCGACG CCATGGAACG CTCGGGCTGG GTGCAGGCCA AGGCGGCGCG CATCCTCGGG CTGACGCCGC GGCAGATCGG CTACGCGCTG AAGAAGTACG ACATCGAGGT CAAGCACTTC TGA
|
Protein sequence | MVYREAHVAD AEESRPTTLI PLSEIALTGI FEISKILTAP ARLETTLANV VNLLQSFMQM RHGTVSLLAD DGVPDITVGA GWNEGTDDRY RARLPAKAID QIVATSVPLV VENVSSHPMF SRADADALGA SPEVRVSFIG VPIRIDSRVV GTLTIDRVRD GRSIFRLDAD VRFLTMIANL IGQTVKLHRV VARDRERLMA ESHRLQKQLS ELKPPRERKK VRVDGIIGES QAIRGLLAKV GIIAKSHSPV LLRGESGTGK ELIAKAIHEL SSRANGPFIK INCAALPESV LESELFGHEK GAFTGAIASR KGRFELADKG TLFLDEIGEI SPSFQAKLLR VLQEQEFERV GGNHTIKVNV RVVAATNRNL EEAVARNEFR ADLYYRINVV PMMLPPLRDR ASDIPLLASE FLKNFNRENE RDLEFDPASM ELLQGCSFPG NVRELENCVR RTATLAPGPA IHQDDFACHH DECLSSILWK SHSERTAQRP PPEIPLAVAP IGRADGPRGN VAAPAPTVPT PQPPARVEAA SDAQMSERER LVDAMERSGW VQAKAARILG LTPRQIGYAL KKYDIEVKHF
|
| |