Gene Daro_1586 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1586 
Symbol 
ID3568730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp1703311 
End bp1704570 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content63% 
IMG OID637680054 
Productsensor histidine kinase 
Protein accessionYP_284805 
Protein GI71907218 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value0.144463 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACAA CGCCCTGTAC CAACGAAAAC TCCGAGCATC TGGCCAACCT GCGTCGACTG 
GTCGCTGGCC GCTGGTTCGT ACTCGCAGCC TTGACGCCAC TGATCCTGCT CGCCCCGGGC
CTGCTCGATA TTCCGGTGCC ACAGGTTCCC CTGCTCGGTA TCGTTGCCAT TACCGGGCTG
TTCAACGCTT TTGCACAATG GCGGGTCAGC ACGTCAAAAA CCGCCAGCGC CTGCGAACTG
TTTGTTCAGT TACTTTTCGA CATCGGGGCG CTGACCGCCA TCGTTTTCTT CAGCGGTGGC
GCTGCCAACC CGCTGGTTTC AATGCTTCTG CCCCCCGTTG CCATCGCCGC CCTGACCCTG
CCAGCGCGCT TTGCTGCCGT CGTCAGTGCC GTGGCGATCG CCGCCTACTC GCTGCTGATG
GTTTTTTACT TGCCACTGCC AATGCCGGAT GCGACACGCG CCACCCGCCT GCACCTGATC
GGCATGTGGC TGATTTTTGT CCTTTCCGCC GCCATGATCG GCTGGATCAT CATCCGCATG
ACCCGCCAGA TTCGCCAACG CGATGCCCAA CTGGCGACTG CCCGCGAACA GGCTCTGCGC
GACGAACGGG TCATGGCCAT GGGCACCCTG GCGGCCGGCG CCGCCCATGA GCTGGGCACA
CCGCTTGGCA CGATGGCCCT GCTCGCCGGA GAACTGGCCA ACGATCCCAG CCTGAGCGGA
CCGTCACGCG AAGACATCGT GCTGCTCCGC CAACAGATCG GAGTCTGCAA GGAAATCATC
ACTGGTCTGT CACGCCGTGC CGGTGCCGAA CGCCTGGAAA ATGCTCCGCT CGAAGCGTCC
AATCGCTGGC TGGACAGTCT GCGCCAGCGT TGGCATGCGG CCCGCCCGCA GGCCAGCAGC
CGCCTGATCA TCGCCAGCGA CGGTACGCCC CCCGAAATCC TTGCCGATCC ACGGCTTGAG
CAGGCCATCC TCAACCTGCT CAACAATGCA GCCAACGCCA CGCCGAGTCC GCTGGAAGTC
CGGCTTTCCT GGTGCACGGA CAACCTGTGC ATCGACATCC GTGACCACGG CCCCGGTTTC
CCGCCAGAAG TTCTGGAACA GGGCGGGCAG ACCAGTTTTC CAGCCCATGA GCAAGGCAGC
GGCGTCGGCC TGATACTGAC CCGTAGCGCC ATCGAACAAC TCGGCGGCGC GCTGACCCTG
AGTAACCCTG AAGACGGCGG CGCCCTTGCC CGCATCGAAC TGCCACGGAT ACAAGCATGA
 
Protein sequence
MATTPCTNEN SEHLANLRRL VAGRWFVLAA LTPLILLAPG LLDIPVPQVP LLGIVAITGL 
FNAFAQWRVS TSKTASACEL FVQLLFDIGA LTAIVFFSGG AANPLVSMLL PPVAIAALTL
PARFAAVVSA VAIAAYSLLM VFYLPLPMPD ATRATRLHLI GMWLIFVLSA AMIGWIIIRM
TRQIRQRDAQ LATAREQALR DERVMAMGTL AAGAAHELGT PLGTMALLAG ELANDPSLSG
PSREDIVLLR QQIGVCKEII TGLSRRAGAE RLENAPLEAS NRWLDSLRQR WHAARPQASS
RLIIASDGTP PEILADPRLE QAILNLLNNA ANATPSPLEV RLSWCTDNLC IDIRDHGPGF
PPEVLEQGGQ TSFPAHEQGS GVGLILTRSA IEQLGGALTL SNPEDGGALA RIELPRIQA