Gene Gdia_3467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3467 
Symbol 
ID6976919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3799027 
End bp3801282 
Gene Length2256 bp 
Protein Length751 aa 
Translation table11 
GC content65% 
IMG OID643392988 
ProductCheA signal transduction histidine kinase 
Protein accessionYP_002277807 
Protein GI209545578 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0643] Chemotaxis protein histidine kinase and related kinases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.124475 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGGTG ACATCGACAT GAACGGCGAC ATGGACGACA TCCTCCAGAT TTTCTTTCAG 
GAGTGTGACG AACAGCTTCA GGAGCTTGAG CGCGGGCTGT CGGCACTTTC GGACGGCGAA
TCCGGGCACG AGACCATCAA TGCCGTCTTT CGAGCCGTGC ACTCCATCAA GGGGGGCGCG
GCCTCATTCG GACTCGAAAA CCTCGTGCGC TTCGCGCATG TATTCGAAAG CTCCCTGGAC
GGTCTGCGTT CCGACCGTAT CTCGCCGACG CCCGAGATCA TCAAGACCTT CCTCAAGGCG
ATGGACGTTC TCAGCGACCT GGTGGCCGAG GCCCGCGATG GGGCGTCCGT GGATCACGGA
CGTGTCGCCG AAAGCCAGGC GGAACTCGAG ACCCTCATCA ACGGCGATGG TGCGCCGAAA
AAGACGGAAG ATGTCATTCC GGCGGATTTC GTCCCGGTCC CGATGGATTT CGTCCCGGTT
CCCGTGGATT TCGAGCCGGT GGCGGTGGAT TTCAGTTTCG ACGAGGAACC CGCGCCTGCC
GCTCCCGCCG CTTCGCACCT CGTCGTCACG TTCCGCCCGC ATGATGCCAT GTACGAGCGC
GGAGACGATG CCAGGAACAT CCTGATCGGG CTTGCGGACG CGGCATCCGC CTATGGCGGA
GTGGGAACGG TCACCTGCGA TGTTTCGGCG CTTCCGGCCT TCGCGTCGCT GGACCCCGAA
ATCTCCTACC TGTCGTGGCG CGTCGTTCTT CCCTCCGAGG TAACGGAGGA GAGCGTCCGG
GCCGTGTTCG ACTGGGTCGG CGATGTGTGC GACCTCACGG TGTCCCGCGG TGACGACCAG
GAACCGGATC AGGACGAGGC GGCGCTGCCC GCACCGGATG CCGTCGCATC GGATGCAGGG
GTACCGGAGG CCGAGGACGC GGCGGTATTG CTGCCGGCGT CCAGCCCGCC CCAGGACACG
CCTGTCCACG TGGCGCCGGC GCCCGAGGCA CCCCCGGCCA CCGGGGGCGG TGGCGGGGCG
CCGGTCCGTC GGCCCGAGCA GGCATCCATC CGTGTCGATC TTCACCGGAT CGACGACCTG
ATGGACCTTG TGGGTGAACT GGTCATCGCG CAGGCGGCGA TCGAATCCCT GTCCCGACGG
GACGATTCCT CGGCAATGCG GGAACTGGTG GAGAGTGTCA CCAGCATGCA GTCCCTGACA
CGCGACATCC AGGACGCCGT CATGGCGGTC CGGGCGCAGC CGGTGCGCAG CGTGTTCCAG
CGCATGCAGC GCGTGGTGCG CGAGGCGTGT TCGATGACCG ACAAGGATGT CGTCCTGACG
CTGGAGGGTG AAGACACCGA AGTCGACCGG ACATTGGTCG AGAAGCTGAC CGACCCGCTC
ACCCACATGC TGCGCAATGC GGTCGACCAC GGCATCGAGA AGACGGAGGA CCGCCTGGCA
GCCGGCAAGT CCGCGACCGG CCACGTTCTT CTGTCCGCGG CGCACCGTTC CGGGCGCATC
CTGATCACGA TCAGGGATGA CGGGGGCGGC ATCAATCGCG AGCGTGTCCT CGCGACGGCG
ATCTCGCGCG GCATCGTCGC GCCGGACGCG GTCCTGACCG ATGACGAGAT CAACAATCTT
CTCTTCGCGC CGGGCTTTTC GACAGCGGCA AAGGTGTCCG ACCTGTCCGG GCGCGGCGTG
GGCATGGATG TCGTCAAGCA GGCGATCCTC AGCCTTGGCG GCCGGATCAC GATCAGCAGC
GTGCGGGGCG AGGGAACCAC GTTCTGCCTC AGCCTTCCTC TGACCCTGGC CGTGCTGGAC
GGCATGCTGG TCAGTGCCGG CGATTCCACG ATGGTCATTC CGGTGTCATC CGTGGTGGAA
ACCATGATGA TCGACCATCA GGACGTCTAC ACCCTGCCGG GGGGAGGGAA CGTGATCTCG
ATCCGCGGCA GTTGCATGCC GCTGGTGCCG CTGGGCAAGG AACTGGGCCT GTCGTCGGCC
GGGTCCGATG AACGGTCGGA CGAGGCCGTC ATCCTGGTCG TCGAAAACGA GTCGGGTGCC
CGGGCGGCGC TGATCGTGGA CAAGATCCAC GACCAGACGC AGGTCGTCAT CAAGAGCATG
GAGAAGAATT ACCGGCAGAT CCCCGGCGTC TCGGCGGCGA CCATTCTTGG CGACGGCAGC
GTTTCCCTGA TCCTCGACGT GCCGGGCCTG ATCGCCTCCG TCATCGGGCG TATCGATACC
AAGCCCCCGG GGGCCCGCAC AGGGTTGGCG GCGTAA
 
Protein sequence
MSGDIDMNGD MDDILQIFFQ ECDEQLQELE RGLSALSDGE SGHETINAVF RAVHSIKGGA 
ASFGLENLVR FAHVFESSLD GLRSDRISPT PEIIKTFLKA MDVLSDLVAE ARDGASVDHG
RVAESQAELE TLINGDGAPK KTEDVIPADF VPVPMDFVPV PVDFEPVAVD FSFDEEPAPA
APAASHLVVT FRPHDAMYER GDDARNILIG LADAASAYGG VGTVTCDVSA LPAFASLDPE
ISYLSWRVVL PSEVTEESVR AVFDWVGDVC DLTVSRGDDQ EPDQDEAALP APDAVASDAG
VPEAEDAAVL LPASSPPQDT PVHVAPAPEA PPATGGGGGA PVRRPEQASI RVDLHRIDDL
MDLVGELVIA QAAIESLSRR DDSSAMRELV ESVTSMQSLT RDIQDAVMAV RAQPVRSVFQ
RMQRVVREAC SMTDKDVVLT LEGEDTEVDR TLVEKLTDPL THMLRNAVDH GIEKTEDRLA
AGKSATGHVL LSAAHRSGRI LITIRDDGGG INRERVLATA ISRGIVAPDA VLTDDEINNL
LFAPGFSTAA KVSDLSGRGV GMDVVKQAIL SLGGRITISS VRGEGTTFCL SLPLTLAVLD
GMLVSAGDST MVIPVSSVVE TMMIDHQDVY TLPGGGNVIS IRGSCMPLVP LGKELGLSSA
GSDERSDEAV ILVVENESGA RAALIVDKIH DQTQVVIKSM EKNYRQIPGV SAATILGDGS
VSLILDVPGL IASVIGRIDT KPPGARTGLA A