Gene Smal_0122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmal_0122 
Symbol 
ID6477671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStenotrophomonas maltophilia R551-3 
KingdomBacteria 
Replicon accessionNC_011071 
Strand
Start bp162245 
End bp163306 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content72% 
IMG OID642729255 
Productsignal transduction histidine kinase, nitrogen specific, NtrB 
Protein accessionYP_002026510 
Protein GI194363900 
COG category[T] Signal transduction mechanisms 
COG ID[COG3852] Signal transduction histidine kinase, nitrogen specific 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.545461 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.163107 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGACC CCGCACCGCC GCCATCCCTC GACGCCCTGG GCACGCCGCT GGCCTGGGCC 
GGGGCTGATG GCCGCATCAT CGGCTGCAAC CCGGCCTTCG CGCGTTGGCT GGGTGTCAGC
ATCCGCCGCC TGCTTGGCCA GCCGCTGGCC GCGCTGGAAG TGCAGGGCGA GGCGCTGGCC
CATTTCCTGG CCCGCGATGA GCGCGACAGT CTGCGCCTGA ACCGGCTGGC CCTGGCGGTG
CCGGGCGAGG CACCACGCTT CGCCGAGGGC TGGATGAGCC GCCGCGACGA TGGTGGCTGG
TTGCTGGAGG CGCACCCGGT CGATGAGTTC CCCGGGCTCG ACCCGACCCA GGCCCTGCCC
AGCGCGCTCA GTGCAGCGCT GAAGGGGCTG GCCCATGAGC TGCGCAATCC GCTGGCCGGG
CTGAAGGGCG CGGCCCAGCT GCTGGCCCGG CGCGCGGCCC AGCGCGACGC CAGCGAACGC
GAGCTGATCG AGCTGATCGG TTCGGAGATC GAGCGCCTCA ACGGCCTGCT CGACCAGCTG
CTGTCACCGG CCCCGGCCGC GCCGCACGCC GAACTGAACA TCCACGCCGC GCTGGAACGC
GTGCTGCGCC TGGCCGAGAA CGAGGCCGGC TGGGCGGTAC GCCTGCAGCG CGACTACGAC
CCCAGTATTC CCGAATTCCA TGGCGACGCC GACCGCCTCA CCCAGGCAGT GTGGAACCTG
GTGCGCAACG CGATCCAGGC CGGCGCCGGC AACATCACCC TGCGTACCCG CGTAGAACAC
GGCGTACGCA TCGCCGAGCA GTTGCATACG CTGGCGCTTC GCCTCGAAAT CGCCGACGAC
GGCCGTGGTG TGCCAGAGGA ACTGGCCGAG CATCTGTTTC TGCCGCTGGT CAGTGGTCGC
GCCGAAGGTA CTGGCCTGGG CCTGGCGCTG GCGCAGCAGG TCGCGCGCGA ACATCGCGGC
ACACTGACCT ACCGCTCGCG CCCGGGCCAT ACCGTGTTCA CCCTGCTGCT GCCGATCGGC
AACGGTGCCG CCCCGGCCGA GGAGGCCCCG CGCGATGTCT GA
 
Protein sequence
MSDPAPPPSL DALGTPLAWA GADGRIIGCN PAFARWLGVS IRRLLGQPLA ALEVQGEALA 
HFLARDERDS LRLNRLALAV PGEAPRFAEG WMSRRDDGGW LLEAHPVDEF PGLDPTQALP
SALSAALKGL AHELRNPLAG LKGAAQLLAR RAAQRDASER ELIELIGSEI ERLNGLLDQL
LSPAPAAPHA ELNIHAALER VLRLAENEAG WAVRLQRDYD PSIPEFHGDA DRLTQAVWNL
VRNAIQAGAG NITLRTRVEH GVRIAEQLHT LALRLEIADD GRGVPEELAE HLFLPLVSGR
AEGTGLGLAL AQQVAREHRG TLTYRSRPGH TVFTLLLPIG NGAAPAEEAP RDV