Gene Saro_1927 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1927 
Symbol 
ID3917150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2038957 
End bp2040078 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content64% 
IMG OID640444673 
Productsignal transduction histidine kinase, nitrogen specific, NtrB 
Protein accessionYP_497201 
Protein GI87199944 
COG category[T] Signal transduction mechanisms 
COG ID[COG3852] Signal transduction histidine kinase, nitrogen specific 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.109087 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGCGC TCCCTGACTG GCGTCGTCAG GAACCGGGGC AGAAGCCCGA ACCGAAGCGC 
GTCGTCGCGA GCCTTCCGCT TGCGCTGCTG CAGCTCGATC CCGACCTCAT CGTGGCCGCG
GTAAACCCGG CGGCCGAACA ACTCATGGGG CAGGGCGCAC GCCGGATCGT TGGAAAGTCG
GTCGCAGAAC TGTTCGAATT CGAGGAGCCG CTTATCCTCG GCCGCCTGGC TGAAGGTGAA
GCCCAACTTT TCGCGCGCGG AGTTGGCGTG CGCATCATGG GGCAGCCCGC GCGACGTTTT
GACGTGATGA CCAGCCCCGT GACTCATTGC CCCGGCTGGC AGCTCCTGAT GCTTCACGAA
GGTGTGGGCG TCGAGGCCCT GTCTGGCGAC GGTCGCGGCG CAGGAGGCGG GGAGGGTGTT
GCTTTGCGCG CACCCGAAGT CCTTGCCCAC GAGATCAAGA ACCCGCTGGC CGGCATAAAG
GGCGCGGCGC AGCTTCTTGA TCGCAAGCTG TCCGAAAGCG ATCGCGCGAT GACCGGCCTG
ATCACCGCCG AGGTCGACCG TATCGCCAAA CTGATCGACC AGATGCAGTC GCTTTCCCGG
CGGAGCGCCG AACCCGCGCA GCCGTGCAAT CTGCACGAAG CTGTCCGCCG GGCCGAAGCG
GTGCTTGCAG CGGCCAGCCC GGAATCGGTC ACGATCGTCG AGGAGTTCGA CCCCTCGCTC
CCGCCGATCA TGGCCAATCC GGATTCGCTC GTCCAGGTTC TGCTTAACCT GCTGAGCAAT
GCGCGCGAAG CCTGCCTCGC CAATGAAGAG CCGCGCATCA TCGTGCGCAC GCGCTTTGCA
AGCGGCATTC AGCTACATGC CGGCCCCGGT GGAAGGCCCC TTCGCCTGCC TATCGAATTG
CGCGTATCCG ACAACGGACC GGGTATCGAT CCCACATTGC GCGACCACAT CTTCGAACCC
TTTGTCACCG CAAAGAAGAA CGGCCAGGGC CTTGGTCTTG CCCTTGTCCA GAAGCTGGTG
CGAGAGATGA ATGGCCGCAT TACCCATGAT CGCGACGAGG TGGGTGGCTG GACCCATTTT
CGCATCCATC TTCCTGTCGC CGGATCGGTT CCCACCGAAT GA
 
Protein sequence
MIALPDWRRQ EPGQKPEPKR VVASLPLALL QLDPDLIVAA VNPAAEQLMG QGARRIVGKS 
VAELFEFEEP LILGRLAEGE AQLFARGVGV RIMGQPARRF DVMTSPVTHC PGWQLLMLHE
GVGVEALSGD GRGAGGGEGV ALRAPEVLAH EIKNPLAGIK GAAQLLDRKL SESDRAMTGL
ITAEVDRIAK LIDQMQSLSR RSAEPAQPCN LHEAVRRAEA VLAAASPESV TIVEEFDPSL
PPIMANPDSL VQVLLNLLSN AREACLANEE PRIIVRTRFA SGIQLHAGPG GRPLRLPIEL
RVSDNGPGID PTLRDHIFEP FVTAKKNGQG LGLALVQKLV REMNGRITHD RDEVGGWTHF
RIHLPVAGSV PTE