Gene RPB_1453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1453 
Symbol 
ID3908403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1640969 
End bp1642444 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content68% 
IMG OID637883347 
Productsignal transduction histidine kinase, nitrogen specific, NtrB 
Protein accessionYP_485074 
Protein GI86748578 
COG category[T] Signal transduction mechanisms 
COG ID[COG3852] Signal transduction histidine kinase, nitrogen specific 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.469828 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCTCA TCGGCTTCTG CTACGCGCAA TCGCGATGGT GGCGGATCGG CCTGGCGCTG 
TGCATCGTGC TGGCCGGGAT CGCCATCCGG GTCGCCCTGC ACGATTCGCT CGGCGATGGC
CTCACCTTCG TCACCCTGTA CCCCGCCGTC GCCGCCGCCG CCATGCTCGG CGGCGCTGCG
GCCGGAGTGA CCGCGACGAT CGTGGCGACG GCATCGACGC AACTCATGGT TGCTTCGCTC
GGCCGCGCTG ACAACCTGCT CGGGCTCGTC GTCTTTCTGT GCGGCTGCAT CGTCATCGTC
GCGATGGCCC AGGCGATGCG CGTGTTTCAC GCCCGCCTGA TCGAAGCCGC CGAGATCGGC
GAAAGCGAGC GGCAGCTCCG CCGCTTCGTC GAGGGGGTGC CCGCCGCGAT CGCCATGTTC
GACTGCAACA TGCGCTATCT CGCCGCCAGC ACGCGCTGGC TGTCCGCCTT CCACCCGACC
GAATCCGTCA TCGGCCGATT GCACTACGAC GTGTTTCCGA ACATTCGCGA GGAGTGGCGG
GAAGCGCATC GGCGTGGCCT CGCCGGCGAG GTGGTGCGCA ACAACGAAGA CTGCCACGAT
CGCGCCGACG GCGTGAAGCG ATGGGTCAGA TGGGAGGTGG CGCCGTGGCG CGACAGCCGC
GGCGAGATCG GGGGAATCAC CATCTATTTC GAAGACATCA CCGAGAACAG GGCGATGACG
GAGCGACTGG CGCAGGCGCA GCAGCTCGAG ACCGTCGGCC GGTTGTCCGG CGCCATCGCC
CACGACTTCA ACAACCTGCT CACCGTGATC GTCGGCAACG CCGAGCTGCT CCGCGAGCAG
CTCGAGCCGC ACGAGGATTT GAGGCAACTG GCCGAGCATA TCAGCAGCGC CGGTGACCGG
GCGGCCGAAC TGACCCAGCG TTTGCTGGCG TTCGGCCGAC GCCAGATCCT CGCCCCGGTG
GCGGTCGATC TGAACTGGCT GGTGCAGTCG GTCGCCCAGA TCGGAGCGGC TTCCGCGCGT
CGCGGCGTCA AGATCAGGAC CGTGCTGGAG CCGGCGCGGC CCCTCGTCCG CGCCGACCGG
CTGCAGCTCG AATCCGCCAT CCTCAACATC CTGATCAACG CCCAGGAAGC GGTCGCGGAC
GACAGCGGCC GGATCGCGAT CGTCACCGGC CTGGTCGTGT TCGGCGACCG GCAGGACCAG
GTCCGGCCCG GCCGCTACGG CGTGGTCGAG ATCACCGACA ACGGCGTCGG CATGAGCGAA
CGGGTCCGCG GCCAGGCGTT CGAGCCGTTC TTCACCACCA AGGAATTCGG CACCGCGAGC
GGACTGGGGT TGAGCATGGT GTATGGCTTC GTGAAACAGT CCGACGGGCA CGTCACGATC
GACAGCCAGC CCGGCGTCGG CACCACGATC CGGCTGTTTC TGCCCCTGGC GAACCCGACA
CCGTCCGACG GCGACGCGGC ACCATCGGGC GCGTGA
 
Protein sequence
MELIGFCYAQ SRWWRIGLAL CIVLAGIAIR VALHDSLGDG LTFVTLYPAV AAAAMLGGAA 
AGVTATIVAT ASTQLMVASL GRADNLLGLV VFLCGCIVIV AMAQAMRVFH ARLIEAAEIG
ESERQLRRFV EGVPAAIAMF DCNMRYLAAS TRWLSAFHPT ESVIGRLHYD VFPNIREEWR
EAHRRGLAGE VVRNNEDCHD RADGVKRWVR WEVAPWRDSR GEIGGITIYF EDITENRAMT
ERLAQAQQLE TVGRLSGAIA HDFNNLLTVI VGNAELLREQ LEPHEDLRQL AEHISSAGDR
AAELTQRLLA FGRRQILAPV AVDLNWLVQS VAQIGAASAR RGVKIRTVLE PARPLVRADR
LQLESAILNI LINAQEAVAD DSGRIAIVTG LVVFGDRQDQ VRPGRYGVVE ITDNGVGMSE
RVRGQAFEPF FTTKEFGTAS GLGLSMVYGF VKQSDGHVTI DSQPGVGTTI RLFLPLANPT
PSDGDAAPSG A