Gene Daro_2890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_2890 
Symbol 
ID3566295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3111709 
End bp3114870 
Gene Length3162 bp 
Protein Length1053 aa 
Translation table11 
GC content62% 
IMG OID637681359 
Productpeptidase M52, hydrogen uptake protein 
Protein accessionYP_286090 
Protein GI71908503 
COG category[C] Energy production and conversion
[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG0680] Ni,Fe-hydrogenase maturation factor
[COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain 
TIGRFAM ID[TIGR00072] hydrogenase maturation protease 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAC GAAAGCAGCT GCGCATCCTC TGTTTTGGCA ATCCATTGCA TGGCGACGAC 
GGCTTCGGCC CGACCGTCAG CCTGGCATTG CGGCGGAAGG TGCTGCCAGC TGGCGTCGAG
GTCGTCGACT GCGGCACCCG CGGGCTCGAT GTCTTGCATC TGTTCGAAGA CTGCACCAGC
GTCATCGTCA TCGATGCGAT GGCCGGAGAG CGGCCAGGCA CGCTGCATGA ACTCAGCCCG
CACCAGGTGC CGGTTGAAAA TACCGCTAGC GGCGGACATG GCGCCGGGGT GGGCTATCTG
CTGGCAGCCG TGCGCGAGAT GATCGCCAGG CCGCCGGTCA TCAAGATCAT CGCGGTCGAA
ATCGGCGCGG TGAAAAGCTT TTCCCCGGGC CTCTCGCTGG AAGTCGCTGC CGCTGTGAGC
GAGGCCATTG AGCTCATACA AGACTGCTGT GCCGCGATGG CCGGACATGA TCCAGACGAA
CTGGCCAACG AACTGGAAAC GTTGCGCCAG GCCAACCGGG CGCTGGAAGC CGAACTGACC
AAGTGCACCG AGGCACTCGA CCAGCTGATC AGCGAGCAGG AGAACCAGAA CGACGAGTTG
CGCCACCGCT CCCGGCAACT TGCCATGCTG CAAGGTACCG TCGATCGCGC CATCGACACG
ATGGCCGAAA TCATCGTCAT GCTAGGGCCG GACGGACGAG TTACCCGGAC TAACCGGCTG
CTGGAAATCG AACTCGGCTA CACCGCGGAA TCCCTGGTCG GGGGATATTT TGAAGATTGC
CTTCCGCTGG CCGGCCAGAA CGCCCTGCGT GCCTTGCTAC CGGCCGGCGC AAAACCTCCG
CTGCTGCTCA ATGCCATCCG TGCAGCGGGC GGACACTTCG TCGCCGAGCT CAATTTCCGA
CGTGCCGATG CAGCATCGGA GCATGATCTC GAAGGCACCG TACCCTATCT GGTACACGCC
AGCCTGATTC ACAGCCAGGC CGGCAAACTG GAAGGCGCCA TTGTCATTTC GACCAACATT
TCGGTGCTCA AGGCCCGCGA AAAAGCCCTG CGCAACAATG AGCGCCAGTT GCACGAAACC
GCCGAGGAAC TGCGCAACCA CCGCGACAAC CTCGCCGCCA TGGTCGAAGA ACAGACGCGC
GACCTGCGCC AGGCCAAGGA ACAGGCGGAA GAGGCCAGCC GCGCCAAGAG CGAGTTCCTG
TCCAACATGT CGCACGAAGT CCGCACTCCG CTCAATGCGA TTCTCGGGCT TTCCGACCTC
TGCCTGCAGA CCCCGCTCAA CCCCCAACAG AACCAGCATT TGAGCAAAAT TCGACTGGCT
GCCGACCACC TGCTTGGCAT CATCAACGAC ATTCTCGACT TCTCGCGCAT CGAGGCTGGC
AAGCTCAGTA TTGAAAAACT GATCTTTGAA CTGCCATCGC TGCTGGAAGA GATCAGCGAC
CTGCTGATCG GCCGTATCGA AGAAAAAAAC CTGGAACTCT GTGTCGACAT CGCCGCCGAG
GCCAGCCGCA GTTTCGTCGG CGATCCGCTA CGCCTCAAGC AGGTGATCAT CAACCTGCTC
GGCAACGCCA TCAAGTTTTC CGACCAGGGC AGCGTCCGTC TTGGCTGCCG GATCGACAGC
ACCGACCACA ACAGCACCTC GCTGCACTTC TCTGTCAGCG ACGAAGGCAT CGGCATTTCA
GAGGAACAGC AGAATGCCCT GTTCTCCGCC TTCAGCCAGG CCGATACCTC GACCACCCGA
CGCTTCGGCG GATCCGGCCT GGGGCTGGTG ATATCCAAGC GCCTCGTCGA ACTGATGGGC
GGTCGTATCT GGCTGGAAAG CACGCCGGGA AACGGCAGCA CCTTCCACTT CACCATCCGT
CTCGACAATG CCCCCGATGT GCCATTGCTC ATCGATGAAC TGCGCAAGCA TCTGGCCAAA
CATGCCGGGC GTACCGTCCT GATCACCGAC GACAACCCGT TCGCAGCCCA GGCCTTGGCC
GCCCAGTGCC AGCAACTCGG GCTGCTAACC GAAACCTGCC CATCCGGCGA ATCGGCACTG
GCCGCACTGA ACCACCCGGG TGCCAACTAT CTCGCGGCAC TGATCGACTG GCATTTGCCA
GCCGGCCTGG ATGGCGTCGA GACCATGAAA CGAATCAGGA GCAATCTTGG TGTTCAGGCG
CCACCGCTGA TTTTGCTGAG CTCCCAGAAA GCATCCGGGG CAAGCGCTCT CGACGACCTG
CCGGCCGATG CGTTGCTGAT CAAGCCATGC AGTGTGCGCC ACCTATATGC GGCCCTTGCC
CGCCCGCTGC ACCTGCCGGA AATCACCATG CGGGCACCGG CCGTCCCGGC TCTTGATCTC
TCCGGCATCG CCCTGCTCAG CAACACCGAC ATTCTGGTCG TCGATGATAT CGAGCTCAAT
CGTGACCTGA TGCGCGAGCT GTTTGCCACC GCCGGCCTCA ATATCCGCCT CGCCTGCAAC
GGCAGGGAAG CCATCACCGC GATCCACTCA AAGAAACCTG ATCTCGTGCT GATGGACTGC
CAGATGCCGG TCATGGACGG CTTCTCCGCA ACCCGCGAAC TCCGGGCTCG ACCGGAATAC
GCCGAGCTGC CGATCATTGC CCTGACCGCC GGGGCCCTGG ATCACGATCG CGAACAATGT
CTTGCGGCCG GCATGAACGC CTACGTCACC AAGCCGGTCA ATCTCGAAAA ACTGCTCCGC
GTCATCGCCG AGCAGCTTTC ACCAGGCACC GCAGGGTCCA TGCCTGAGCG GCGAACAGCG
CTGCCTCCGC CGCCCCCGAA TTCAGTAACC ACGACGCCGC ACGCCGAAAC CACCAAGCTG
CCAGAGCTAC CTGGCATCGA TGTCACCGCC GGACTGATGC GAGTTCGCAA CAAGGCCGAC
TTTTACTGCC GCATGCTGGT GAAGTTCCGC GATACCCATT GCGCCGCCCT GGCGGCTGAC
CTTCGTGCAG CACTGGACGA GGGGAACCGC CCGGAAGCCA TTCGCCTGGC CCACTCGGTA
AAGGGTATCG CCCTGAGCCT CGGCATTGAC ATGCTGGGTG AGCGAGCCCT GAATCTGGAA
ATGAAACTCA AAGATCCGAC GGCAAGCGAT GTTGCGCCAG AGGTTGAGCT GCTCCTGACC
GAACTGGCGC AGATCCGCCA AGTCTTGCAG GCCCTCGGCT AG
 
Protein sequence
MSERKQLRIL CFGNPLHGDD GFGPTVSLAL RRKVLPAGVE VVDCGTRGLD VLHLFEDCTS 
VIVIDAMAGE RPGTLHELSP HQVPVENTAS GGHGAGVGYL LAAVREMIAR PPVIKIIAVE
IGAVKSFSPG LSLEVAAAVS EAIELIQDCC AAMAGHDPDE LANELETLRQ ANRALEAELT
KCTEALDQLI SEQENQNDEL RHRSRQLAML QGTVDRAIDT MAEIIVMLGP DGRVTRTNRL
LEIELGYTAE SLVGGYFEDC LPLAGQNALR ALLPAGAKPP LLLNAIRAAG GHFVAELNFR
RADAASEHDL EGTVPYLVHA SLIHSQAGKL EGAIVISTNI SVLKAREKAL RNNERQLHET
AEELRNHRDN LAAMVEEQTR DLRQAKEQAE EASRAKSEFL SNMSHEVRTP LNAILGLSDL
CLQTPLNPQQ NQHLSKIRLA ADHLLGIIND ILDFSRIEAG KLSIEKLIFE LPSLLEEISD
LLIGRIEEKN LELCVDIAAE ASRSFVGDPL RLKQVIINLL GNAIKFSDQG SVRLGCRIDS
TDHNSTSLHF SVSDEGIGIS EEQQNALFSA FSQADTSTTR RFGGSGLGLV ISKRLVELMG
GRIWLESTPG NGSTFHFTIR LDNAPDVPLL IDELRKHLAK HAGRTVLITD DNPFAAQALA
AQCQQLGLLT ETCPSGESAL AALNHPGANY LAALIDWHLP AGLDGVETMK RIRSNLGVQA
PPLILLSSQK ASGASALDDL PADALLIKPC SVRHLYAALA RPLHLPEITM RAPAVPALDL
SGIALLSNTD ILVVDDIELN RDLMRELFAT AGLNIRLACN GREAITAIHS KKPDLVLMDC
QMPVMDGFSA TRELRARPEY AELPIIALTA GALDHDREQC LAAGMNAYVT KPVNLEKLLR
VIAEQLSPGT AGSMPERRTA LPPPPPNSVT TTPHAETTKL PELPGIDVTA GLMRVRNKAD
FYCRMLVKFR DTHCAALAAD LRAALDEGNR PEAIRLAHSV KGIALSLGID MLGERALNLE
MKLKDPTASD VAPEVELLLT ELAQIRQVLQ ALG