Gene RoseRS_4160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4160 
Symbol 
ID5211144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5208597 
End bp5210147 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content60% 
IMG OID640597749 
Productsignal transduction histidine kinase, nitrogen specific, NtrB 
Protein accessionYP_001278454 
Protein GI148658249 
COG category[T] Signal transduction mechanisms 
COG ID[COG3852] Signal transduction histidine kinase, nitrogen specific 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGCTT CTGTACAACT GGCGATCCAG AATAACTTGA ACGCAAGCAT TCGTGACATT 
GCGCTGAACC TGTGCGCAGC GACCGGTGCA AAAGCATGCT ACCTGACGTT GCGTGAACGA
CCCGGCGCCT ATCCCCGTTT TATCGGCGCT GCCGGTGTGC ACGGCGATCC TCCCGATTGG
AACGACATCG GCGGCGCCCA CATCCTGCGG CAGATCGGGT ATGGCGGGAC GATGGTCATG
CCGGATTGTA TGACATCGGC AATGGCGCTG TCGCTTTCTT CCGGCGGGCG CGATGTCGGT
GTCGCCATTC TGCTGTTCGA TGGCGAACCG CCGTCGGTAT CAGATACGAT GGTCGCCGCA
GCGCGCCTGG CAGCGCAGGC GATCGATGCG TCATGGCAAC TCGCCATGCT GCACGATCAG
GCGGAGCAGA TCGCCGAGCG CACCCGCCTG CGCGAGATTC AGTTGAGCCG CAACCTGATC
CGTGGGGTTA TCGACAGTGT GCCGATGGGA CTGGTGCTGA TCGATCCAAC CGGGTATGTG
CTGGCGGCAA ACCGTGCGCT CGCAGGGCGT TTTGGGCTTG AGCCGGCGAT GCTGGTCGGC
AGGTTCTACG ATGATGTGCT CGGAGCATGG AGTGAGTCGG CTGCATCACG CACCTTTGTT
GAACGCCAAC CGCAACGCCT GCGCCGCACG CTTCAGCGTC CCGGCAGCAG TGATGCGCTG
ATCGAGATTG CGAGTTTCCC GCTCTTCGAC GCAGCCGGAG CAGTGTATCA GGTGGTCGAA
GTATGGGAAG ACATCACCGA GCGCGTGGCG CTGCAAACCC AACTCGTACG TGCCGAAAAA
CTGGCGGCGA TCGGTCATCT CGCCGCCAGT ATTGCGCACG AAGTCGGCAA TCCGCTCCAG
GCTATTCAGG GATTTCTGGC GTTGTTCCTG GAACAGTGCG CTCCCGAAAC GCCCAATCAG
CACTTCCTGC GCCTCGCGGA AGAAGAGATC GAACGGATCG TGCGGGTGCT GGAACGGTTG
CGCGACCTGT ACCGTCCGCG CGCCGATGTT TTCACGAACG TCGATGTCAA TGAGTTGATC
GAGAGTGTTC TGTTGCTGAC CGGTAAGCAA CTCGAGCGTT CGCGCATCCG GGTGGTGCGC
GAACTGACCC CCAACCTCCC CACGATCCAG GGTGTCGCCG ATCAACTGAA GCAGGTGCTG
CTCAATCTGG TGCTGAACGC TGCAGAAGCC ATGCCCAACG GCGGCATATT GCACGTGCAA
ACGCACCGCG CTCATCTGGC GTCGGGTCAG GAGGCAATCG CCATTGCGAT CACCGATACC
GGGGTTGGCA TTCCGCCGGA GCAGTTGACC CGCATTTTCG ATGGGTTGCA TACCACGAAA
GAGCGTGGCA TGGGATTGGG GTTGTATACC AGCAAGGCGA TTGTGGAACG TCATCTGGGG
AGTATCAGCG TGCAGAGCAT TCCCGGCGAG GGAACAACCT TCGAGATTAT GGTACCGATC
AGGCATAAGG AGAGCCGCCA TGAAGCAACC GGCGAAAATC CTGGTCGTTG A
 
Protein sequence
MGASVQLAIQ NNLNASIRDI ALNLCAATGA KACYLTLRER PGAYPRFIGA AGVHGDPPDW 
NDIGGAHILR QIGYGGTMVM PDCMTSAMAL SLSSGGRDVG VAILLFDGEP PSVSDTMVAA
ARLAAQAIDA SWQLAMLHDQ AEQIAERTRL REIQLSRNLI RGVIDSVPMG LVLIDPTGYV
LAANRALAGR FGLEPAMLVG RFYDDVLGAW SESAASRTFV ERQPQRLRRT LQRPGSSDAL
IEIASFPLFD AAGAVYQVVE VWEDITERVA LQTQLVRAEK LAAIGHLAAS IAHEVGNPLQ
AIQGFLALFL EQCAPETPNQ HFLRLAEEEI ERIVRVLERL RDLYRPRADV FTNVDVNELI
ESVLLLTGKQ LERSRIRVVR ELTPNLPTIQ GVADQLKQVL LNLVLNAAEA MPNGGILHVQ
THRAHLASGQ EAIAIAITDT GVGIPPEQLT RIFDGLHTTK ERGMGLGLYT SKAIVERHLG
SISVQSIPGE GTTFEIMVPI RHKESRHEAT GENPGR