Gene Rsph17029_1457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1457 
Symbol 
ID4898069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1520258 
End bp1521364 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content67% 
IMG OID640112045 
Productsignal transduction histidine kinase, nitrogen specific, NtrB 
Protein accessionYP_001043339 
Protein GI126462225 
COG category[T] Signal transduction mechanisms 
COG ID[COG3852] Signal transduction histidine kinase, nitrogen specific 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.291348 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTCC GTTCGCCCTA TCCCGTTCCC GGGGTGATCT GGGCCAGCCT TCCGCTTCCG 
GCGGTGCTCA TCAACCCCGA CGGCATCATC ATCGAATCGA ACCCGGCGGC CGAGGCCTTC
CTCAATGCCT CGTCGAAGAG CCTGCAGGGA CAGCCCGCCT TCGACCGGAT CCTGATAGAC
GCGCCGGTGG ACGAGGCCCT GTCGCGGGCG CGGGCGAACC AGTCGCCCAT CTTCATCAAC
GATGTGGATG TGACCTCGGG CGAGCGTCCG CCGGTGCAGT GCAACATCCA GATCGCGCCG
TTGCACGACA ATGCCGAGAT CGTCATGCTG CTGATCTCGC CGCGCGAGAT CGCTGACCGG
CTGGGGCGGG CGACGGCGGC CAAGTCTGCC GCCAAATCCG CCATCGGCAT GGCCGAGATG
CTGGCGCACG AGATCAAGAA CCCGCTTGCG GGCATTTCGG GGGCGGCGCA GCTCATTGCC
ATGAACCTCT CGGCGGAGGA TCGCGAGCTG GCCGATCTGA TCGTCGAGGA GACGCGCCGC
ATCGTGAAGC TTCTCGAACA GGTGGAGCAG TTCGGCAATC TGCGCCCGCC CGAGCGGCGG
GCGGTGAACA TCCACGATGC GCTCGACCGG GCGCGGAAGT CGGCGGCGGT GGGATTTGCC
GCCAAGATGC GGATCACCGA GGAATACGAC CCCTCGCTGC CCGCTACCTA TGCCGATGCG
GATCAGCTGA TGCAGGTGTT CCTGAACCTC ATCAAGAACG CGGCCGAGGC CGCGGGCCCG
CAGGGCGGGC GCATCCGGCT GCGGACCTTC TACGACATCT CGCTGCGGCT GCGCCGGGCG
GACGGGTCGG GAGGCGCGCT GCCCCTGCAG GTCGAGATCA TCGACGACGG CCCGGGCATC
GCGCCCGACA TCGCAAAGGA AATCTTCGAG CCCTTCGTCT CGGGCCGCGA GAATGGCACC
GGCCTCGGGC TCGCGCTGGT CAACAAGATC ATATCCGACC ACGGCGGATG GATTTCCGTC
GATTCGGCCC CCGGACGCAC CGTGTTCCGG GTCTCGCTGC CGGTGGCGCC CCGGGAAGCG
GCGTCCAATG ACATGGAGGT GAAGTGA
 
Protein sequence
MSFRSPYPVP GVIWASLPLP AVLINPDGII IESNPAAEAF LNASSKSLQG QPAFDRILID 
APVDEALSRA RANQSPIFIN DVDVTSGERP PVQCNIQIAP LHDNAEIVML LISPREIADR
LGRATAAKSA AKSAIGMAEM LAHEIKNPLA GISGAAQLIA MNLSAEDREL ADLIVEETRR
IVKLLEQVEQ FGNLRPPERR AVNIHDALDR ARKSAAVGFA AKMRITEEYD PSLPATYADA
DQLMQVFLNL IKNAAEAAGP QGGRIRLRTF YDISLRLRRA DGSGGALPLQ VEIIDDGPGI
APDIAKEIFE PFVSGRENGT GLGLALVNKI ISDHGGWISV DSAPGRTVFR VSLPVAPREA
ASNDMEVK