Gene Daro_2995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_2995 
Symbol 
ID3567308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3233258 
End bp3235246 
Gene Length1989 bp 
Protein Length662 aa 
Translation table11 
GC content58% 
IMG OID637681466 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_286195 
Protein GI71908608 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCAAG CCCCTCCACC CCTTCCCGGT AGTTCGCGCC GCCTGCGCTG GCTGCTCACC 
CTGCCCAAGC TGGGCATCGT GCTGCTGCTG GCCGCCCTGC TCGCTCTCCT CTGGCAACTC
CACCGTAGCG AGATAGAGGA AGATCGCGCG GCGCTGATCA AGGATGTTCT GTGGCTCGAA
CAGAACCTGC GCTTCCATCT GAACAGCAAC GAAGAGCAAG TCCAGCAATT GGCGCTGGAA
ATGGCCAACC CGGCCGGCCG TCAAAAAAAA TTCCGACTCC GGGCCGAGCA CATGCTCAAG
AACGCACCGG AAATCGCCCA GATTCTTTGG CTCGACAACC ACCGTCAGGT CATCGATGCG
CTACCGACGA GCACCCCGCC CGACAGCGAA ATCGAGTCAT TCGGGCCACC GGTCACGGCC
AAGGCCTTTG ACACAGCGAC CCGATTGGGC AAGCTGCATT ACAGCGAACC CTTTTTCCTG
GAAGGCAATC AGGCCTTCGT TGAAATGCTC GTCCCGGTAT TTGGCGAAGA ACACCGCATC
AGCGGCATGC TGGCCGTGCT TTACCCACTA GACGCACTAC TCGACAATCA GGTTCCCTGG
TGGTTTACCG AAAAATACAA GGTCGAGATT GTTGACGATA ACGACCTGCA ATACGCCACC
AAAACACGTA TCGAAGGAAA TAGCGACCAG CGCTACGAAA TCCCATTCGA CCCACCCGGC
GCCGGCTTGC TACTCCGCAT TACCAGCTAC CACACGCCCG ACAACTCGAT GCAGCGCCTG
CTGGTTGCCG CCATCATCCT GCTCGCCATC GGGGTTTTCT GGAGCCTGTG GCTGGTCCGC
GACCTGATGA AAAGGCACAG CCGGACCGAA GAGGCGCTAC GGGCAGAACA CGCCTTCCGC
GCAGCGATGG AAGACTCGCT GACCGTCGGC ATGCGCGCTC GCGACCTCGT CGGCCGGGTC
ATCTACGTCA ACCCGGCATT CTGCCGGATG ACCGGCTTCA GCCCCGACGA GCTGGTCGGC
GCAGCGCCAC CAATGCCTTA CTGGGCCCCC GAGCAACTGG AAGAAACCTA TGCCATGCAC
CAGACCGTGC TGGCCGGAGA AGCGCCTTTA GACGGCTTCG AAATCACCTT CATGCGCAAA
AACGGCGAAC GTTTCCAAGC CCTGGTCTAT GAAGCCAAAC TGATTGACGG AAACGGCAAA
CACACTGGCT GGATGGCCTC GGTACTCGAC ATTACCGAAA GGAAGCGCGC CGAAGAACTG
GCCCGCCAGC AGCAGGAACA GTTGCAATTT ACTTCGCGGC TGGTAACGAT GGGGGAAATG
GCCTCTACCT TGGCCCACGA GCTGAATCAG CCACTCGCAG CGATCGCCAG CTACAACACG
GGTTGCCTGA ATTTATTGAA CGACGGCAAA GCCAGTCCGA ACGATATTTT GCCCGCGCTA
GAAAAAATTG GCCTGCAGGC ACAACGGGCC GGCAAGATCA TCCGCCGCGT CCACGACTTT
GTCCGCAAGA GCGAGCCCAA GCGCGCCTCT TGCCTGCTTG GCGAAGTGAT CGAAGACTGC
CTGGGCTTCA TGGAGGCCGA AGCCCGCAAG CGCCATGTCC GGATCGAATG CAACACCCCG
CCCATGCCGC CGGTCCTGGC CGACCGGCTA ATGCTCGAAC AGGTACTCCT CAACCTGATT
CGCAACGGCA TGGAAGCAAT GGCCACGACC AGCGAAGTCA ACCGCCTGTT GCATATCTCC
ATCGAAGTCA GTGAAAACGA ATTGCGTATA AGCGTCAGCG ACCAAGGTTG TGGCATCGCC
CCGGAAGTCC GTGCCAAGCT GTTTACTGCC TTCTTCACCA CAAAACCAGA AGGCATGGGC
ATCGGCTTGT CGATCTGTCG TTCAATCATT GAATTCCATC GCGGCCGCCT GTGGGCCGAA
GAGAACCCCC ATTCAACGAC AGGAAACGGT ACGATATTCT TCTTTACTCT CCCCCTGGAA
AGCGCATGA
 
Protein sequence
MNQAPPPLPG SSRRLRWLLT LPKLGIVLLL AALLALLWQL HRSEIEEDRA ALIKDVLWLE 
QNLRFHLNSN EEQVQQLALE MANPAGRQKK FRLRAEHMLK NAPEIAQILW LDNHRQVIDA
LPTSTPPDSE IESFGPPVTA KAFDTATRLG KLHYSEPFFL EGNQAFVEML VPVFGEEHRI
SGMLAVLYPL DALLDNQVPW WFTEKYKVEI VDDNDLQYAT KTRIEGNSDQ RYEIPFDPPG
AGLLLRITSY HTPDNSMQRL LVAAIILLAI GVFWSLWLVR DLMKRHSRTE EALRAEHAFR
AAMEDSLTVG MRARDLVGRV IYVNPAFCRM TGFSPDELVG AAPPMPYWAP EQLEETYAMH
QTVLAGEAPL DGFEITFMRK NGERFQALVY EAKLIDGNGK HTGWMASVLD ITERKRAEEL
ARQQQEQLQF TSRLVTMGEM ASTLAHELNQ PLAAIASYNT GCLNLLNDGK ASPNDILPAL
EKIGLQAQRA GKIIRRVHDF VRKSEPKRAS CLLGEVIEDC LGFMEAEARK RHVRIECNTP
PMPPVLADRL MLEQVLLNLI RNGMEAMATT SEVNRLLHIS IEVSENELRI SVSDQGCGIA
PEVRAKLFTA FFTTKPEGMG IGLSICRSII EFHRGRLWAE ENPHSTTGNG TIFFFTLPLE
SA