Gene Daro_2105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_2105 
Symbol 
ID3567001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2266119 
End bp2268161 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content54% 
IMG OID637680578 
ProductPAS 
Protein accessionYP_285318 
Protein GI71907731 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value0.292551 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAATG CGGAAAACGA CCTTTCGAAC GAACGATTGC ATGATTTTTC TCGCAGTTCT 
GCTGACTGGT TTTGGGAAAC GGATACTGAT CACCGGTTTA CCTATCTGTC CGATGATATT
GAGTCTAAAG TCGGCGTCCC TATTGACTAC TTGATGGGGA AAAGCCGCAG GGAACTGGCC
ACGCAACTCA CTAACAATCC TCCCGGAGAC TGGGAGAAGT ACTTGGCCGC CTTGAATGCG
CGAGAAGCTT TTCGGGATTT TGAATACTGC GCCTTAGCGC CGAATGGCGT GGCCTATTGG
TTCAGCATCA GCGGCGTCCC GTTCTTTGAC GAACAGGGGG AATTTCGTGG CTATCGGGGG
GTCGGGCGTG ATGTCACCGA AACGCACTCG ATGCAGGTTG AGCTGCACCG TTACCGGAGC
CACCTTGAAG ACGAGGTCCG AAGGCGCACG GAAGAATCAT TGAAGGAGCG TGAACGGGCA
ATATCGGCGT CGCAGGCAAA GTCGATTTTT CTCGCCAACA TGAGCCATGA AATTCGTACG
CCGATGAATG CGATCGTCGG ACTGACGCAT TTGTTGCGCA AAGAGATCAC GACGCCCGCA
CAGCTTGAAA AGCTCGCCCA GGTTTCTGCG TCTGCTGACC ATTTGTTATC GGTCATCAAC
GATATTCTCG ATATCTCAAA AATCGAGTCC GGCAATGCCG CCCTGGACGA ACTGGATTTT
GAGCTGGAAG GCATGATCCG GCGGGTCAGC AGCGTCATCG CACTGCGCGC CCAGGCCAAG
GGACTGGAAC TCATCGTCGA TATCAGAACG CTGCCGACTG TCCTGCATGG CGACCCCACC
CGACTGAGCC AGGTCCTGAT CAATTTTCTT GGCAATGCAG TCAAATTTAC CGTGCAGGGG
TCGATTACCT TGCGCGGCCG GATAGAGGCC GAAACGGCAA CAGATATGGT TGTCCGATTC
GAAGTCGAAG ACACCGGCAT CGGCATTTCG CCTGACGCAC AAGAAAAAAT CTTTGAGGCG
TTTGAACAGG CGGATCAATC AACCACCCGC AACTATGGCG GAACGGGATT GGGACTGGCG
ATTGCCAAGC ATGCGGCGAA AATGATGAAA GGCGATGTCG GGGTACGCAG CACGCCTGGC
GTCGGCAGCG TATTCTGGAT TACCGCCAGA CTCGGGAAAG TCAATGCAGC CGGCCCGGAG
TTCATTGTCC CGGAAGCGGT AGGCGTATCC ACTTTGGTGG TAGACGACTT GCCGTTAACG
CAGGCGGTCC ACTCTCAATT GCTCAAACGC ATCGGCCTCA ACCCGATAGC GGTGATGTCC
GGTCAAGAAG CTTTAACCGC TGTGCAGATT GCCGACCGCG AACTGCATCC GTTTGGGATT
GCCTTTATTG ACCTTCACAT GCCCGACTTG AATGGCCTGG AAGTCATTGC GAAAATCCGA
GCCTTGCCCT TGCGATACCA GCCGCATTGC GTTCTGGTCA CTGCCGCTGG TATTCAAAGC
ATTGTCGACG AAGCCAAGTC AGTCGGATAT GCGGCAGTTT TGCAGAAACC AGCCAGCTTG
GCTGGCCTGC GTGAAATTGT CAGCCAGCTT CTGACAACAC AGGGAAGCCA ATTTCAATAT
CTGAAAGAAA AAAAACGTCC TTGCGAAGCC TTGCTTCGGG AACACAAAGC CGGTTGCCGC
ATTCTGCTGG TTGAGGATGA ACCAATCAAT CAAATGATCG CCCGCGAATT GCTCGAAGAT
GCTTCGCTGG TTGTCCACGT GGCGGACAAC GGCCAACAGG CGGTTGAAAT GGCGCTATCA
ACGCCTTCTC CTTATGACCT TATCCTGATG GACGTACAGA TGCCGGTTTT GGACGGCATT
GCCGCTGCCC ACTGTATCCG CGAAAAGCGC CCGGATCTCC ATACGCCCAT CATTGCGCTG
ACGGCAAATG CGTTTTCTGA TGACCGCCGC AAATGTCTGG CTGCAGGGAT GAATGATTTT
GTCTCCAAAC CCGTTGATCC GGATGTTCTA TTCGAGATCA TCTGGAAATG GCTGTCAAAA
TAG
 
Protein sequence
MSNAENDLSN ERLHDFSRSS ADWFWETDTD HRFTYLSDDI ESKVGVPIDY LMGKSRRELA 
TQLTNNPPGD WEKYLAALNA REAFRDFEYC ALAPNGVAYW FSISGVPFFD EQGEFRGYRG
VGRDVTETHS MQVELHRYRS HLEDEVRRRT EESLKERERA ISASQAKSIF LANMSHEIRT
PMNAIVGLTH LLRKEITTPA QLEKLAQVSA SADHLLSVIN DILDISKIES GNAALDELDF
ELEGMIRRVS SVIALRAQAK GLELIVDIRT LPTVLHGDPT RLSQVLINFL GNAVKFTVQG
SITLRGRIEA ETATDMVVRF EVEDTGIGIS PDAQEKIFEA FEQADQSTTR NYGGTGLGLA
IAKHAAKMMK GDVGVRSTPG VGSVFWITAR LGKVNAAGPE FIVPEAVGVS TLVVDDLPLT
QAVHSQLLKR IGLNPIAVMS GQEALTAVQI ADRELHPFGI AFIDLHMPDL NGLEVIAKIR
ALPLRYQPHC VLVTAAGIQS IVDEAKSVGY AAVLQKPASL AGLREIVSQL LTTQGSQFQY
LKEKKRPCEA LLREHKAGCR ILLVEDEPIN QMIARELLED ASLVVHVADN GQQAVEMALS
TPSPYDLILM DVQMPVLDGI AAAHCIREKR PDLHTPIIAL TANAFSDDRR KCLAAGMNDF
VSKPVDPDVL FEIIWKWLSK