Gene Rpal_1968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1968 
Symbol 
ID6409628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp2124979 
End bp2127780 
Gene Length2802 bp 
Protein Length933 aa 
Translation table11 
GC content66% 
IMG OID642711854 
Productsignal transduction histidine kinase, nitrogen specific, NtrB 
Protein accessionYP_001990966 
Protein GI192290361 
COG category[T] Signal transduction mechanisms 
COG ID[COG3852] Signal transduction histidine kinase, nitrogen specific 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCGCAAGG GGCTGACGCT TTCGACCAGG CTGGCGATCT TGGTGATGTC GACGGCAATG 
CTGACCGCAA GCGGGGTCGG CTACCTCGGC TATCGCAACA TCGCGCCGGT GGCGATCGAG
CGGACTTTAG CCGGGCTCGA CGCCAATGCG AGCTGGCAGG CGCGCGAGCT GTCCCACCTT
GTCAACGGCG CCACCGCCGA TCTGATGGGC TTCCGCCAGA TCATCGGCAT CGACGAGCTG
ATCGAACTCA GCCTCGACCC GTCAAAGACC GTCGCCGGCG GGGGGAGCCT GCCGCAATGG
CGCGAACGGA TCGGCCACCG CCTGGCCCTG GAACTGGAAA ACAAGCCGGA CTTCGCACGC
TACCGGCTGA TCGGCGTCGC CAACCAGGGC CGCGACATCG TCCGGGTCGA ACGTCAGACA
ACCGGCAAAA TCCACGTGGT TCCAGATGAA CAGTTAGCGT TCGCGGGTGG ACGCGAGATC
ATCGAGCTCG GCATGTCGGC CAAGGACGGC GAGGTTCTGA TCTCGAACGT CGAATTCGAG
CCGATCGAGC AGCCCTCAGC CAAAACCCCA CAGCACGACG CAGCGGCGCT GCGTCCCATC
ATTCGGGTCG TCACCCCGGT GTTCAGCGAC GAAGGCGCAC GGTTCGGCGT TCTGGCAGCG
ACGATCGACC TGAGAAGGCC GTTCGAGCGG CTGAGAGATC CGGTGCGGGA GTCGTCGAAA
ATCTATGTGG TGGACGACAG GGGCCACTAC CTGTTCCACC CGGACGCCTC CCGAGGCGGC
TTGGCGATCG GCTTTCCCTC GACCCTGGAG CAGGATTTTC CCGCGCTTGC CGAGGCGCTG
GCGAACAACC GCTGGACTCC GGCGGTGATC GAAAGCCGCA ACGGCGACCG GTTCGGCGTC
GCCTATCAGC CGATGAACGC CGGCACCGAG ACGCCCCTGG CGCTGGTCGA AGCGATCGCC
GAACACGACA TGATCCGCGG CCCGATGCTG GCATGGGGAA AGTCGACGCT GGTCGGCGGC
AGCTTCGCGG TGCTGGTGGC AATAGCGCTG GCGGTGGCAT TCGCCCGCAG CCTGGCGCGG
CCACTCTCGG AGATGACCCG CGCAGTCGAA AGCGTCCGTG GCGGCGGGCC GCTGACGCTG
CCGCGTAACG CCGGCGGCGA GATCGGCGTC CTGGCCGAGG CCTTCTCGTC GACGATGCAG
GAGTCGCGCG AGAAGACCGC GGCGCTCCGC CGCGAGAAGG AGATCTTCGA GTCGATCATG
AACGCGATGG CCGAGGCCGT GCTGCTGGTC GATACCGAAG GGGTGATCGT CTATGAGAAC
CCCGCCGCGG TGGCGCTGCG CACCTCGCCC ACCGGCATCA CTGGCCCGAC CTGGGAAACC
TCGGTCGAGT CGTTCCTCGC CGACGGCGTC ACGCCGCTGC AGGTCGATCA GCGTCCCGGA
CGGCGGGCGA TGCGCGGCGA GCCGATCGAT CGCTTCGAGT TCGTCGTCCA TGTGCTCGGC
AGCGACAAGA TTGTCTATGT CTCCGGCAAC GCGCGGCCGA TCCGCGAGGC CGACGGCACG
ATCAGCGGCG CGGTCGTGGT GTTCAGCGAC GTCTCCGAGC TGAAGGAGAC CGAGCGGCGG
CTGCACCAGG CGCAGAAGCT GGAGGCGATC GGCCAGCTCA CCGGCGGGGT CGCACACGAC
TTCAACAATA TGCTGACGGT GATCAGCGGC ACCGCCGAGA TCCTGCTCGA CGAGCTGACC
GATCGCCCGG ACCTCGTCAC CATCGCCAAG ATGATCGATC AGGCCGCCGA ACGCGGCGCC
GATCTGACCC GGCAACTGCT CGCCTTCGCC CGCAAACAGC CGCTGCAGCC GCGCAATATC
GACGTCAACA CCGTGGTGTC GAATATCAAG CAGCTGCTGC GGCCAACCAT CGGCGAGCAC
ATCGAGATCG ACACCCGGCT CGATCCCACA GTCGATCCCG CGCTGATCGA TCCGTCGCAA
CTGTCCTCGG CGCTGCTGAA TCTTGCCGTC AATGCCCGCG ACGCAATGCC GAACGGCGGC
AAGCTGCTGT TCGAAACCGC CAATGTGATG CTCGACGACG ACTACGCCGA GCACCACCCC
GAGGTGAAGC CGGGCCGCTA CGTGATGATC GCGGTCAGCG ACTCCGGCTT CGGCATGGCG
CCGGACGTGC TGGAGAAAGC ATTCGAGCCG TTCTTCACCA CCAAGAGCGT CGGCAAGGGC
ACCGGCCTCG GGCTGAGCAT GGTGTACGGC TTCGTCAAAC AGTCCAACGG CCACGTCCAG
ATCTACAGCG AGGAGCAGCA CGGCACCACG ATCCGGCTGT ATCTGCCGCG CGCGGACTCC
GACATCGATG CCCTGCCCTC GATCACGCCG GTCGAAGGCG GCACCGAGAC CATCCTGCTG
GTCGAAGACG ACGAGCTGGT GCGCAACTTC GCGCTCGCCC AGCTCCGAGG TCTCGGCTAT
CGCACCATCG CGGCCGCCGA CGGCGCCGCA GCGTTGGCGG AAGTGCGGCG CGGCACGCCG
TTCGATCTTC TGCTCACCGA CATCATCATG CCCGGCGGCA TGAACGGCCG CGAGCTTGCC
GACGCGGTGG CGCGGCTGCG GCCGGTGAAG GTACTGTACA CCTCGGGCTA CACCGAGAAT
GCGATCATGC ATCACGGCCG GCTCGATCCC GGCGTGCTGC TGCTGTCCAA GCCGTTCCGC
CGCGCCGATC TGGCGCGGCT GGTGCGCGCC GCACTGAACC GCGCCGATCA CCAAACTTCT
GGTGATACGG CAGGTACAGA CCGCAAGAGC GCGGCGAACT AA
 
Protein sequence
MRKGLTLSTR LAILVMSTAM LTASGVGYLG YRNIAPVAIE RTLAGLDANA SWQARELSHL 
VNGATADLMG FRQIIGIDEL IELSLDPSKT VAGGGSLPQW RERIGHRLAL ELENKPDFAR
YRLIGVANQG RDIVRVERQT TGKIHVVPDE QLAFAGGREI IELGMSAKDG EVLISNVEFE
PIEQPSAKTP QHDAAALRPI IRVVTPVFSD EGARFGVLAA TIDLRRPFER LRDPVRESSK
IYVVDDRGHY LFHPDASRGG LAIGFPSTLE QDFPALAEAL ANNRWTPAVI ESRNGDRFGV
AYQPMNAGTE TPLALVEAIA EHDMIRGPML AWGKSTLVGG SFAVLVAIAL AVAFARSLAR
PLSEMTRAVE SVRGGGPLTL PRNAGGEIGV LAEAFSSTMQ ESREKTAALR REKEIFESIM
NAMAEAVLLV DTEGVIVYEN PAAVALRTSP TGITGPTWET SVESFLADGV TPLQVDQRPG
RRAMRGEPID RFEFVVHVLG SDKIVYVSGN ARPIREADGT ISGAVVVFSD VSELKETERR
LHQAQKLEAI GQLTGGVAHD FNNMLTVISG TAEILLDELT DRPDLVTIAK MIDQAAERGA
DLTRQLLAFA RKQPLQPRNI DVNTVVSNIK QLLRPTIGEH IEIDTRLDPT VDPALIDPSQ
LSSALLNLAV NARDAMPNGG KLLFETANVM LDDDYAEHHP EVKPGRYVMI AVSDSGFGMA
PDVLEKAFEP FFTTKSVGKG TGLGLSMVYG FVKQSNGHVQ IYSEEQHGTT IRLYLPRADS
DIDALPSITP VEGGTETILL VEDDELVRNF ALAQLRGLGY RTIAAADGAA ALAEVRRGTP
FDLLLTDIIM PGGMNGRELA DAVARLRPVK VLYTSGYTEN AIMHHGRLDP GVLLLSKPFR
RADLARLVRA ALNRADHQTS GDTAGTDRKS AAN