Gene Daci_5000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaci_5000 
Symbol 
ID5750611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDelftia acidovorans SPH-1 
KingdomBacteria 
Replicon accessionNC_010002 
Strand
Start bp5535846 
End bp5536922 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content67% 
IMG OID641300124 
Productsignal transduction histidine kinase, nitrogen specific, NtrB 
Protein accessionYP_001566014 
Protein GI160900432 
COG category[T] Signal transduction mechanisms 
COG ID[COG3852] Signal transduction histidine kinase, nitrogen specific 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCAGG ACGACCGGCC GGCGCCCCCG GCCTCCTATC AGGCGCTTGA CCTGCTGTGC 
ACGCTGATGA CCGTGCTCGA CGAGCAGGGT GCCGTGCTTT TCGCCAACGC AGCCCTTGAA
AACGCACTGG GCCTGTCGCG CCGCATGCTC GAAGGCAGCG AGCTGGCGGC CTGCTTCACC
GAGCCTGCCT TGCTGGACAA GGCCTTGCGT GGCGCCCAGG ACAACGACTT CGCGGCCCTG
CGCTTCGAGG CCAGCCTGCA CCGCGTGGGC GGCGATGCGC TGCCCGTGCA CGTCACGGTC
TCGCTGGCCG AGCGGCCCGG CCATGTGCTG GTCGAGTTCT GGCCGCTGGA GCAGCAGGCC
CGCCAGGACC GCGAGGAGCG GCTGCGCGAG CAGGCCCAGG CACACAAGGA GCTGATCCGC
AACCTGGCCC ACGAGATCAA GAACCCGCTG GGCGGCATCC GTGGCGCAGC CCAGCTGCTG
CAGATGGATC TGGACGCACC CGAGCTGCGC GAGTACACCG AAGTCATCAT CCACGAGGCC
GACCGGCTGC AGGCGCTGGT GGATCGGCTG CTGGCGCCGC ACCGCCATCC GCATGAGGTC
GGTGACGTCA ACATCCACGA AGTCTGCGAG CGCGTGCGCT CGCTGGTGCT GGTCGAGCAC
CCTGCCGGGC TGACCATCAC GCGCGACTAC GACATCTCCA TCCCGGAGTT CCGGGGCGAC
AGCGCCCAGC TGATCCAGGC CGTGCTCAAC ATCGTCCAGA ACGCCGCGCA GGCGCTGCAG
GAGCGCATTG CGGCTGGTGA TGCCGAGATC ATTTTGCGAA CGCGTGTGGC CCGGCAGGTA
ACTTTTGGTC GCCAGCGCTA TCGGTTGGCA TTGGAATTGC ATGTGATCGA CAACGGGCCA
GGCGTTCCCG ATGCCATCAA GGAGCGGATC TTCTATCCGC TGGTATCGGG AAGGGATGGC
GGTTCGGGAC TGGGGCTGAC GCTGGCGCAG ACCTTCGTGC AGCGCCACCA GGGGCTGATC
GAATGTGAAA GCCAGCCGGG ACGCACGGAT TTCCGCATCC TCATTCCCCT GCCTTGA
 
Protein sequence
MIQDDRPAPP ASYQALDLLC TLMTVLDEQG AVLFANAALE NALGLSRRML EGSELAACFT 
EPALLDKALR GAQDNDFAAL RFEASLHRVG GDALPVHVTV SLAERPGHVL VEFWPLEQQA
RQDREERLRE QAQAHKELIR NLAHEIKNPL GGIRGAAQLL QMDLDAPELR EYTEVIIHEA
DRLQALVDRL LAPHRHPHEV GDVNIHEVCE RVRSLVLVEH PAGLTITRDY DISIPEFRGD
SAQLIQAVLN IVQNAAQALQ ERIAAGDAEI ILRTRVARQV TFGRQRYRLA LELHVIDNGP
GVPDAIKERI FYPLVSGRDG GSGLGLTLAQ TFVQRHQGLI ECESQPGRTD FRILIPLP