Gene Gdia_2655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2655 
Symbol 
ID6976085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2922638 
End bp2924953 
Gene Length2316 bp 
Protein Length771 aa 
Translation table11 
GC content59% 
IMG OID643392168 
Producthistidine kinase 
Protein accessionYP_002277009 
Protein GI209544780 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.129531 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.0835586 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACAC GAGAATCCTT CCGAGTCAGC TCGCATCTTA AGGACATTAT CGGGCGCGAT 
CTGGTCACGA ACGAGTTCGT GGCGATTTTC GAGTTGGTGA AGAACTCATT CGATGCCGGG
GCGACGAGGG TCGACATCGA GTTTGATCCG GGTGAGGCTT CCATCACCAT CGTCGACGAT
GGCAAGGGGA TGACGTCGAC GGATATTCGC GACAAGTGGC TATTCGTCGC CTACTCCGAG
AAGGCACTTT CAGCTTCAGA TAACTATCGT GACAAGATCG CGTTCGCGGG GAGCAAGGGG
ATTGGCCGCT TCGCATGCGA CACGCTCGGG GATGGCTTGG AACTCTATAG CCGCACCGAA
GGCGGCAACT GGATCTCAAG GCTTGAAATC GACTGGACGC GCTTCGAACA GGATAGCACT
GAGGAGTTTC AGGAAGTCGA AGTCGAACTT GGGACGGCGC CATCGTTTCC CAGGATCGCA
AATGCCGAGC CTCCACCCCA GCATGGAACG ATGCTCGTCA TTAAGGGCAC TCGGCAGGAG
TGGAACGAAA ACGCCATCCG CCGGTTGCGC AGGGATCTTG CTAAGCTCAT CGACCCATTC
GGTACGACCA GTCATGTCGC CGTCTCGACC TGGCTGGTAG ATGGGTCCGA GGAGAGGATG
GAGGGCGTCG ATGGTCCGGT AGGGAATAAC ATTGCTGACA TACTCAGGGA GAAGACAAGC
CGGATCGAGG TAACCGTCGC GGACGGCCGT GTTGACACGA CGCTTTTCGA TCGTGGTCGG
AAAATATATG CAATCCGTGA GCCTTCTCCC TATGACGAAC TTACTGACTG CCGCGTCGAA
GGGCAGGTAT ACTTCCTCAA CAGGTCAGCT AAGCATACTT TTACGTCTCG CATGAGCGTG
CGGCCCGTGG AGTTCGGGAG CGTTTTCCTT TTCCTAAATG GGTTCCGTGT CTCGCCAATC
GGCGATGAGT TTGACGACAC CCTCGGATTG AACCGCAGGA AGCAGCAAGG CCAATCCCGC
TACCTTGGTA CGCGCGACAT CATCGGGCGG GTTGATGTAA CAGCGCCGCC CAAGATGTTC
CGAGAAGTGT CGAGCCGGGA CGCGGGCCTG ATCGAAGATG CGCGGAGTCG CGCGCTGTAT
GAAGCGATCC GCCGTCATAT GATCTTCCGG CTTGAGCGTT ATGTGGTCGG CGTGAACTGG
CAGGACAGGC CTGATCAAGA CCGTGATACA TCCGAAGGCC TTGAAACTGA CCGGGCACGC
GAGCGCGTCC TTGGTATAGT TGGTTCGCTT GCGCGATCGA AGGACGTCGA AATCCTGTAC
TACGATACAG AACTCGTCCG CGTGGCCGAC GATCCCGACA AGATCACTGA CGATGCGCTC
AAGGCGATGT CGGCGGTTGC GGAGAGCCGC AGCGATGTCT CCTTGCTTGG CCAGGTAGAG
GAAGCGAGGA GGCGGATCGC TGAATTGCGG GCGTCGCGCG ACGAAGCTCG CCAAGCCGCC
CTGCGCGCGA TGGCGGAGCG GGCCACCGCC GATGCGCGGA TTGCGGCGCT AGAGAAGCAG
GCAGCGTTCC TCGGGAGCAG CCAGGATGTC GATGTCGAGC GCATCCAGTT GCTCATGCAC
CAAGCCACGA TACACCTCGG GCATGTGCGC GCGGCAATCG AGAACATGGC GCATGAGGTG
CGGAACATTC TGGCTGCGGC CGTGATGCCA AAAGAAATCG ATGACCTCGG GGACGTCGAG
GATCTGCTCG CCACGATCCG CCAGTCCGCC CGTCGTGCAT CGGTCTCGAT CGCCGGCGCA
AGCTTATCCG GGGATCGCTT GCGCACCGTT CTGTCATTTG CACCGAACAT CCGTGTCGAT
TTGGAAACCG ACAAGGTGCA CGGCGACTTG CTCCAATTTC TCACCGAGTA TTTCGAGGTT
CGCCTTGTTG GTATTCCTGG CATGCCAGCC GCTACCTTTG ATGCCGCCAA TCTGGCGCTG
GAACGGGAAT TCTCCCCCGT GGACGTAGCA GTTCTCGTCG ACAACCTGCT GGATAACGCA
CGCAAGGCGA AAGCGAGCAA GATCGAGTTC AAGGCCACGC GCAAGGGGCA GGACTCCGTG
CTTATCAGGG TCATCGACGA TGGCCTGGGT ATCGACAGGC GCAAGGTCGA CCCGTCCAAG
ATTTTCGAGC GGGGCTATAC CGGTTCTGCC AACGGCACTG GACTAGGCTT GTACAGCATT
CGCCAGATAG TCGAGGGAAT GGGCGGCACC ATCGAGCTTG CAGGAGATGG AACGCGAGCT
GACTTCGACA TCGTTGTCCC GGGGGAACGG CAATGA
 
Protein sequence
MTTRESFRVS SHLKDIIGRD LVTNEFVAIF ELVKNSFDAG ATRVDIEFDP GEASITIVDD 
GKGMTSTDIR DKWLFVAYSE KALSASDNYR DKIAFAGSKG IGRFACDTLG DGLELYSRTE
GGNWISRLEI DWTRFEQDST EEFQEVEVEL GTAPSFPRIA NAEPPPQHGT MLVIKGTRQE
WNENAIRRLR RDLAKLIDPF GTTSHVAVST WLVDGSEERM EGVDGPVGNN IADILREKTS
RIEVTVADGR VDTTLFDRGR KIYAIREPSP YDELTDCRVE GQVYFLNRSA KHTFTSRMSV
RPVEFGSVFL FLNGFRVSPI GDEFDDTLGL NRRKQQGQSR YLGTRDIIGR VDVTAPPKMF
REVSSRDAGL IEDARSRALY EAIRRHMIFR LERYVVGVNW QDRPDQDRDT SEGLETDRAR
ERVLGIVGSL ARSKDVEILY YDTELVRVAD DPDKITDDAL KAMSAVAESR SDVSLLGQVE
EARRRIAELR ASRDEARQAA LRAMAERATA DARIAALEKQ AAFLGSSQDV DVERIQLLMH
QATIHLGHVR AAIENMAHEV RNILAAAVMP KEIDDLGDVE DLLATIRQSA RRASVSIAGA
SLSGDRLRTV LSFAPNIRVD LETDKVHGDL LQFLTEYFEV RLVGIPGMPA ATFDAANLAL
EREFSPVDVA VLVDNLLDNA RKAKASKIEF KATRKGQDSV LIRVIDDGLG IDRRKVDPSK
IFERGYTGSA NGTGLGLYSI RQIVEGMGGT IELAGDGTRA DFDIVVPGER Q