Gene Daro_3714 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3714 
Symbol 
ID3568147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3990834 
End bp3993920 
Gene Length3087 bp 
Protein Length1028 aa 
Translation table11 
GC content60% 
IMG OID637682187 
ProductPAS 
Protein accessionYP_286913 
Protein GI71909326 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG0784] FOG: CheY-like receiver 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.101634 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTAATT GCTGGCTAAG CATTGTTGCG GCCGTCTTCT GGCTGCTCTC CGGATTATCT 
GTCGCGCATG GCATGGAGCC CGCCGGCACC GCGACCGAGG CCAAAAAACC GGCGAAATTC
GCCGTCCTGG CCTACCGCCC AAAAGCGGAA ACCGAAGCAC GCTGGCAACC GCTGATCGAC
TACCTCAACA GCATCCAGCC GAGCCAGCCG ATCACGCTGG TCGCGCTCAA TTACCCGGAG
CTCGAGGCCG CCGTCAGGAA CAAGCAGATC GATTTCGTCC TGACCCAACC GGCACATTAC
ATCCTGCTGG CCAAGCAGGA ACATTTGCTC TCGCCATTGG CCACGCTGGT TGAAAAAGAA
GGCGGCAAAG CACTGTCCAG TTTTGGCGGC GTAATCGTCA CCCGCAGCGA CAAGACAGAC
ATCCAGCAGC TCGCCGATCT GCGGGGAAAA CGTATCGCGG CCAGCAGCAT GGCATCGCTT
GGCGGCTATC AGATGCAGGC CTTCGAACTG TTGCAAGCAG GCATCCACCT GCCAAACGAC
GCCCGGATCA TCGAGACCGG GCAGCCCCAG GACAAGGCCA TTCAGGAATT GCTGGCCGGA
CGGGCCGACG CAGCCTTCGT TCGCACCGGC CTCCTCGAAG CCATGCAAAT GGAGAACAAG
CTGGATGCCA GGCAATTGCG GGTCATCAAC CCCCAGCCTC CCGGCGACTT TCCACTCGTC
CGGTCAACCC GGCTGTATCC GGAATGGCCA GTTGCCGCGA TGCCCTGGAC AGAAAACAAT
CTGGCGCGGC AGGTCGCTGC AGCCATTCTT TCCCTGCCGG CCGACGGGGA GACTGCGCGA
GCCATGAATA TTCATGGCTT CACCATTCCT GAAGACTACC GTCCAGTTGA TGAATTGATG
CGCCAGCTCC GCCTGTCGCC TTTTGATGCT CCGCCAAAGT TCACGATAAA TGATGTTTTC
GTGCAGTTCG GTGCCGTCAT TGCGGTGATC GCCATACTCG GCATCAGCGT CCTGATAGCC
ATCGTTCTGG CCCTGTTCCG TACGATACGC CGCCTCAAGG GCGAACGCTC GCGAACCAAG
GCGACGATGG CTCAACTCTC GGCCGCCGAA GCCCGCTTCC GGGCCATTTT CGACAACGTC
GAAACCCTGG CCATTCAGGG CTACCTGGCT GACGGTACCG TCGCCTACTG GAACAATGCA
TCGCAAACTA CCTATGGCTA CACCGCGCAG GAAGCGCTCG GCAAATCCCT GCTCGACCTG
ATCATCCCCC CCGCCATGCG CGAAGACGTC CGGGGCGCCG TGCAATGGAT GTTTTCCCAC
AAGACGGGCA TTCCACCGGG CCGGCTCAAA TTGCAGCGCA AGGATGGCAG CACGGTCGAG
GTCTATTCGA GCCATACCGT CATCGACACG GCCGATAAGG GGCCGATGAT GTTCTGCCTC
GACATCAACC TGACCGAACT GGTCCATACC GAGCAGGCGC TGATCGAGAG CGAATCCCGG
CAACGGATGA TTCTCCAGAC ACTGGGCGAA GGCGTCTACG GCACCGACCT CGACGGGGTT
TGCACCTTCA TCAATCCCGC CGCGCTGGAC TACCTTGGTC TCCAGGAAGA CGAGGCACTG
GGCAAGAACA CCCATACCCT CTTCCACCAC CATCGAGCCG ACGGCACGCC CTTGCCGCTG
GCCGAATGCC CACTGATCCT GACCGCCAGG GATGGCCAAA GCCGGCGCCT GGAGGATGTC
TTCTGGCGGA AAAATGGCGA ATGCTTTCCG GTGCGCCTGA CCATCACGGC CAAGCTACGC
GATGGCGTGA TTACCGGTGT CGTCGTCTCG TTCGCCGACA TCAGCGAAAA CCGACGCGTG
GCCAAGGAAC TGGAACAACA TCACACCCAC CTTGAAGCGC AAGTCAGGCA ACGTACCGAA
CAACTGGAAA TTGCCCGCCA GGCGGCTGAA GCCGCCAATC GATCAAAGAG TGCCTTCCTG
GCCAACATGA GTCACGAGAT ACGCACGCCG CTCAACGCCG TGCTCGGCAT GGTGCACCTG
TTACGCCGTG ATGCACCAAC ACTGGAGCAG ATCGACCGCC TGGACAAGAT CGACTCGGCC
TCGCAGCATC TGCTGGCGGT GATCAACGAC ATCCTGGACA TCTCGAAAAT CGAGGCTGGA
AAACTGGTTC TCGACGAAAC AACCGTGGAC ATCACCAGCA TCCTGAAGCG TGTCGTATCC
GTGCTCGGCG ACCGTGCTCG CCAAAAGGGG CTCGAATTAC GTGTGCTGGC GGATGATTTC
CCCCATACCT TTATCGGCGA CCCGACGCGC ATCACCCAGT GCCTGATCAA TTACGCCGGC
AACGCGGTCA AGTTCACCGA GCATGGCAGC ATCACCCTCA AAGCCAGGCG CATGGCTGAT
GACAGCAACG GCGTACTGAT CCGCTTCGAA GTCGAAGATA CGGGCATTGG CATCTCCGAT
GATGCCATCG ATCGCCTGTT CGGCATCTTT GAGCAGGCCG ACAGTTCGAC CAGCCGGAAA
TTCGGCGGCA CCGGTCTGGG CCTGGCCATC ACCCGACGCC TGGCCGAACT GATGGGCGGC
AAGGTCGGCG TCAGCAGCCG CGTCGGCGCC GGCAGCCGCT TCTGGTTTAC GGCGCAGTTG
AAACCCGGCG ACGACGCCGT CGATGCCATG ACCTCAACCT TTGCCGGCCT TACCGCCAAA
GGCGTTCAGA ACAGCCTGCG CGGACGGCAT CTGCTGGTCG TTGAAGACGA GGCGATCAAC
CGCGAAATTG CGCTGGAACT CCTCGGCGAG TTCGGGATCA CCGCAGATAC CGCCGAAAAC
GGGCGCCACG CGGTCGAATT GATCAAGATC CAGCACTACG ATCTGGTCCT GATGGACATG
CAGATGCCGG AAATGGATGG CCTGGAAGCT ACCCGCCGGA TTCGCGCCCT GCCCGAGCAA
AACGCCGTAC CGATCATCGC CATGACCGCC AATGCGTTCG CCGAAGACCG CGAGCGCTGT
ATCAAGGCTG GCATGAACGA CTTCCTGTCG AAGCCGGTTG AACCCGACGA CCTCAAGATG
CTGCTGCTAC GCTACCTCGT CCATTAA
 
Protein sequence
MVNCWLSIVA AVFWLLSGLS VAHGMEPAGT ATEAKKPAKF AVLAYRPKAE TEARWQPLID 
YLNSIQPSQP ITLVALNYPE LEAAVRNKQI DFVLTQPAHY ILLAKQEHLL SPLATLVEKE
GGKALSSFGG VIVTRSDKTD IQQLADLRGK RIAASSMASL GGYQMQAFEL LQAGIHLPND
ARIIETGQPQ DKAIQELLAG RADAAFVRTG LLEAMQMENK LDARQLRVIN PQPPGDFPLV
RSTRLYPEWP VAAMPWTENN LARQVAAAIL SLPADGETAR AMNIHGFTIP EDYRPVDELM
RQLRLSPFDA PPKFTINDVF VQFGAVIAVI AILGISVLIA IVLALFRTIR RLKGERSRTK
ATMAQLSAAE ARFRAIFDNV ETLAIQGYLA DGTVAYWNNA SQTTYGYTAQ EALGKSLLDL
IIPPAMREDV RGAVQWMFSH KTGIPPGRLK LQRKDGSTVE VYSSHTVIDT ADKGPMMFCL
DINLTELVHT EQALIESESR QRMILQTLGE GVYGTDLDGV CTFINPAALD YLGLQEDEAL
GKNTHTLFHH HRADGTPLPL AECPLILTAR DGQSRRLEDV FWRKNGECFP VRLTITAKLR
DGVITGVVVS FADISENRRV AKELEQHHTH LEAQVRQRTE QLEIARQAAE AANRSKSAFL
ANMSHEIRTP LNAVLGMVHL LRRDAPTLEQ IDRLDKIDSA SQHLLAVIND ILDISKIEAG
KLVLDETTVD ITSILKRVVS VLGDRARQKG LELRVLADDF PHTFIGDPTR ITQCLINYAG
NAVKFTEHGS ITLKARRMAD DSNGVLIRFE VEDTGIGISD DAIDRLFGIF EQADSSTSRK
FGGTGLGLAI TRRLAELMGG KVGVSSRVGA GSRFWFTAQL KPGDDAVDAM TSTFAGLTAK
GVQNSLRGRH LLVVEDEAIN REIALELLGE FGITADTAEN GRHAVELIKI QHYDLVLMDM
QMPEMDGLEA TRRIRALPEQ NAVPIIAMTA NAFAEDRERC IKAGMNDFLS KPVEPDDLKM
LLLRYLVH