Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_1968 |
Symbol | |
ID | 6409628 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 2124979 |
End bp | 2127780 |
Gene Length | 2802 bp |
Protein Length | 933 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642711854 |
Product | signal transduction histidine kinase, nitrogen specific, NtrB |
Protein accession | YP_001990966 |
Protein GI | 192290361 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3852] Signal transduction histidine kinase, nitrogen specific |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCGCAAGG GGCTGACGCT TTCGACCAGG CTGGCGATCT TGGTGATGTC GACGGCAATG CTGACCGCAA GCGGGGTCGG CTACCTCGGC TATCGCAACA TCGCGCCGGT GGCGATCGAG CGGACTTTAG CCGGGCTCGA CGCCAATGCG AGCTGGCAGG CGCGCGAGCT GTCCCACCTT GTCAACGGCG CCACCGCCGA TCTGATGGGC TTCCGCCAGA TCATCGGCAT CGACGAGCTG ATCGAACTCA GCCTCGACCC GTCAAAGACC GTCGCCGGCG GGGGGAGCCT GCCGCAATGG CGCGAACGGA TCGGCCACCG CCTGGCCCTG GAACTGGAAA ACAAGCCGGA CTTCGCACGC TACCGGCTGA TCGGCGTCGC CAACCAGGGC CGCGACATCG TCCGGGTCGA ACGTCAGACA ACCGGCAAAA TCCACGTGGT TCCAGATGAA CAGTTAGCGT TCGCGGGTGG ACGCGAGATC ATCGAGCTCG GCATGTCGGC CAAGGACGGC GAGGTTCTGA TCTCGAACGT CGAATTCGAG CCGATCGAGC AGCCCTCAGC CAAAACCCCA CAGCACGACG CAGCGGCGCT GCGTCCCATC ATTCGGGTCG TCACCCCGGT GTTCAGCGAC GAAGGCGCAC GGTTCGGCGT TCTGGCAGCG ACGATCGACC TGAGAAGGCC GTTCGAGCGG CTGAGAGATC CGGTGCGGGA GTCGTCGAAA ATCTATGTGG TGGACGACAG GGGCCACTAC CTGTTCCACC CGGACGCCTC CCGAGGCGGC TTGGCGATCG GCTTTCCCTC GACCCTGGAG CAGGATTTTC CCGCGCTTGC CGAGGCGCTG GCGAACAACC GCTGGACTCC GGCGGTGATC GAAAGCCGCA ACGGCGACCG GTTCGGCGTC GCCTATCAGC CGATGAACGC CGGCACCGAG ACGCCCCTGG CGCTGGTCGA AGCGATCGCC GAACACGACA TGATCCGCGG CCCGATGCTG GCATGGGGAA AGTCGACGCT GGTCGGCGGC AGCTTCGCGG TGCTGGTGGC AATAGCGCTG GCGGTGGCAT TCGCCCGCAG CCTGGCGCGG CCACTCTCGG AGATGACCCG CGCAGTCGAA AGCGTCCGTG GCGGCGGGCC GCTGACGCTG CCGCGTAACG CCGGCGGCGA GATCGGCGTC CTGGCCGAGG CCTTCTCGTC GACGATGCAG GAGTCGCGCG AGAAGACCGC GGCGCTCCGC CGCGAGAAGG AGATCTTCGA GTCGATCATG AACGCGATGG CCGAGGCCGT GCTGCTGGTC GATACCGAAG GGGTGATCGT CTATGAGAAC CCCGCCGCGG TGGCGCTGCG CACCTCGCCC ACCGGCATCA CTGGCCCGAC CTGGGAAACC TCGGTCGAGT CGTTCCTCGC CGACGGCGTC ACGCCGCTGC AGGTCGATCA GCGTCCCGGA CGGCGGGCGA TGCGCGGCGA GCCGATCGAT CGCTTCGAGT TCGTCGTCCA TGTGCTCGGC AGCGACAAGA TTGTCTATGT CTCCGGCAAC GCGCGGCCGA TCCGCGAGGC CGACGGCACG ATCAGCGGCG CGGTCGTGGT GTTCAGCGAC GTCTCCGAGC TGAAGGAGAC CGAGCGGCGG CTGCACCAGG CGCAGAAGCT GGAGGCGATC GGCCAGCTCA CCGGCGGGGT CGCACACGAC TTCAACAATA TGCTGACGGT GATCAGCGGC ACCGCCGAGA TCCTGCTCGA CGAGCTGACC GATCGCCCGG ACCTCGTCAC CATCGCCAAG ATGATCGATC AGGCCGCCGA ACGCGGCGCC GATCTGACCC GGCAACTGCT CGCCTTCGCC CGCAAACAGC CGCTGCAGCC GCGCAATATC GACGTCAACA CCGTGGTGTC GAATATCAAG CAGCTGCTGC GGCCAACCAT CGGCGAGCAC ATCGAGATCG ACACCCGGCT CGATCCCACA GTCGATCCCG CGCTGATCGA TCCGTCGCAA CTGTCCTCGG CGCTGCTGAA TCTTGCCGTC AATGCCCGCG ACGCAATGCC GAACGGCGGC AAGCTGCTGT TCGAAACCGC CAATGTGATG CTCGACGACG ACTACGCCGA GCACCACCCC GAGGTGAAGC CGGGCCGCTA CGTGATGATC GCGGTCAGCG ACTCCGGCTT CGGCATGGCG CCGGACGTGC TGGAGAAAGC ATTCGAGCCG TTCTTCACCA CCAAGAGCGT CGGCAAGGGC ACCGGCCTCG GGCTGAGCAT GGTGTACGGC TTCGTCAAAC AGTCCAACGG CCACGTCCAG ATCTACAGCG AGGAGCAGCA CGGCACCACG ATCCGGCTGT ATCTGCCGCG CGCGGACTCC GACATCGATG CCCTGCCCTC GATCACGCCG GTCGAAGGCG GCACCGAGAC CATCCTGCTG GTCGAAGACG ACGAGCTGGT GCGCAACTTC GCGCTCGCCC AGCTCCGAGG TCTCGGCTAT CGCACCATCG CGGCCGCCGA CGGCGCCGCA GCGTTGGCGG AAGTGCGGCG CGGCACGCCG TTCGATCTTC TGCTCACCGA CATCATCATG CCCGGCGGCA TGAACGGCCG CGAGCTTGCC GACGCGGTGG CGCGGCTGCG GCCGGTGAAG GTACTGTACA CCTCGGGCTA CACCGAGAAT GCGATCATGC ATCACGGCCG GCTCGATCCC GGCGTGCTGC TGCTGTCCAA GCCGTTCCGC CGCGCCGATC TGGCGCGGCT GGTGCGCGCC GCACTGAACC GCGCCGATCA CCAAACTTCT GGTGATACGG CAGGTACAGA CCGCAAGAGC GCGGCGAACT AA
|
Protein sequence | MRKGLTLSTR LAILVMSTAM LTASGVGYLG YRNIAPVAIE RTLAGLDANA SWQARELSHL VNGATADLMG FRQIIGIDEL IELSLDPSKT VAGGGSLPQW RERIGHRLAL ELENKPDFAR YRLIGVANQG RDIVRVERQT TGKIHVVPDE QLAFAGGREI IELGMSAKDG EVLISNVEFE PIEQPSAKTP QHDAAALRPI IRVVTPVFSD EGARFGVLAA TIDLRRPFER LRDPVRESSK IYVVDDRGHY LFHPDASRGG LAIGFPSTLE QDFPALAEAL ANNRWTPAVI ESRNGDRFGV AYQPMNAGTE TPLALVEAIA EHDMIRGPML AWGKSTLVGG SFAVLVAIAL AVAFARSLAR PLSEMTRAVE SVRGGGPLTL PRNAGGEIGV LAEAFSSTMQ ESREKTAALR REKEIFESIM NAMAEAVLLV DTEGVIVYEN PAAVALRTSP TGITGPTWET SVESFLADGV TPLQVDQRPG RRAMRGEPID RFEFVVHVLG SDKIVYVSGN ARPIREADGT ISGAVVVFSD VSELKETERR LHQAQKLEAI GQLTGGVAHD FNNMLTVISG TAEILLDELT DRPDLVTIAK MIDQAAERGA DLTRQLLAFA RKQPLQPRNI DVNTVVSNIK QLLRPTIGEH IEIDTRLDPT VDPALIDPSQ LSSALLNLAV NARDAMPNGG KLLFETANVM LDDDYAEHHP EVKPGRYVMI AVSDSGFGMA PDVLEKAFEP FFTTKSVGKG TGLGLSMVYG FVKQSNGHVQ IYSEEQHGTT IRLYLPRADS DIDALPSITP VEGGTETILL VEDDELVRNF ALAQLRGLGY RTIAAADGAA ALAEVRRGTP FDLLLTDIIM PGGMNGRELA DAVARLRPVK VLYTSGYTEN AIMHHGRLDP GVLLLSKPFR RADLARLVRA ALNRADHQTS GDTAGTDRKS AAN
|
| |