Gene Gdia_3081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3081 
Symbol 
ID6976515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3372260 
End bp3374176 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content71% 
IMG OID643392589 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_002277426 
Protein GI209545197 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.0200059 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATC GTCCCGTCGT CCTGCTGGTC GATGACGAAC CGGAAATCCT GGTGGCGCTG 
GGCGACCTGC TGGAGGATTC GTTCACCATC CTGTCCACCA CCTCGCCGGT CGAGGCGCTG
GAGATCCTGG CCGGGCGCAG CGACGTCGAC GTCATCGTGT CCGACCAGCG CATGCCGGAA
ATGGGCGGTG ACGTCATGCT GGCGCGGGCG CGCACCGTCT GCGACGCGCA GGCGATCCTG
CTGACCGGCT ATGCCGATAT CGGCGCGGTG GCGGCGGCGC TGAACCAGGG GCGCATCTCG
TTCTATTCCC ACAAGCCGTG GGACGGCGAC GCGCTGCGCG CCATGATCGT CCAGGCGGCG
CATCGCCATC GGCTGGAAGG CGAACTGCAG ACCGAGCGGA TGCTGCTGCG GGGGTTGCAG
CAGAATCTGC GCGCGGGCCT GGCGTTCAAG GACCCGCAGG GGCGCTTCAT CCGCATCAAC
CGCAAGGCGG CCGCCTTCTA CGGGCGCGAC GAGGCCGCCT GCCTGGGACA GCGGGAGGAG
GATGTGTGCG ACCCCGCCCA GCGTCCCGCC GTCCGTGAGG CCGAGGCCCG CCTGGCGGCC
GAGGGCAAGG ACGAGGAGGT GATCGCCATT CCCGCGCCGG GAGGCGGGCT GTCCTGGCGG
GAATTCACCC GGGTGCAACT GGACCGCAAT GCGCGCGGCG AGGCGTATTC CGTGCTGATC
AACCGCGACA TCACCCGCCA GAGGGAGATG GAGGCCCGGC TGCGCCAGGC GGAGAAGATG
CAGGCGCTGG GCACCATGGC CGGCGGGATC GCGCATGATT TCAACAATCT GCTGACGGCG
GTGATGGGCT CGCTGGAACT GGCCACGGAC ATGGCCGACG GGCTGGACGA ACGGACGGCG
CACCTGCTGG ACAACGCGAT GGCGGCGGCG CGGCGCGGGG CGGAACTGAC GCGGCGCCTG
CTGAATTTCA GCCGGCCGCG CGACCTGAGC CTGCAGCCGG TGGACGTGAA CGCGCTGCTG
CGCGGCATGC GCGATCTGCT GATGCAGGGC GTGACCTCGC GCCGGCGCGA CGGCAGCCAT
GCGTCGTTCG ACATCCGCAT GGACAAACTG GCGCCGGACG GCGACCTGCC GCCGGCGCGC
ACCGATGCCG GGCAACTGGA ACTGGCGCTT CTGAACCTGT GCATCAATGC CAGCGACGCC
ATGCCCGACG GCGGGACCAT CACCCTGTCC ACGCGGGTGG CGCACCTGGA CGAACCGGCG
GCCGAGGGCG AGCCCGCTTC GGGCGATTAC GTCGTGGTGT CGGTGGCCGA CCAGGGAACA
GGCATGCCGC CCGAAACGGT GGCGCGGGTG TTCGAACCGT TCTTCACCAC CAAGGACGTG
GGACGCGGGA CGGGGCTGGG GCTGTCGATG ATCTACGGCT TCGTGCGGCA TGTCGGCGGC
GATGTCCGCG TGACCAGCGC GCCGGGGCAG GGCACGCGGG TGGATCTGTA TTTCCCGGTC
CATCATCGCC AGGGCGGCGC ATCCGACCCC CGCGAGGCGG CGGAGCACGC GGCGGCCCCG
CACGGGCTGC GGGTCCTGGT GGTCGATGAC GAGGACGCGG TGCGGGCCGT GACCGCCGGC
TTCCTGCGCG GAATGGGCCA CCAGGCGATC GAGGCCCGGG GCGGCGAGGA CGCCCTGGCG
CGGATCGCCG GCATGGCGCC CGATGTCCCC GACCTGGTGG TCATGGACGT GATGATGCCC
CGGATGGACG GCGAGGAAGC GGCGCGCCGG ATTCGCGCGC ATTATCCGGG CAGCCGCATC
CTGTTCCTGA CCGGCTATGC CGACGACACC ATCCTGCCCG ACGACGCCCT GGTGCTGCGC
AAGCCCTTCA CCCAGGCGGA CCTGTCCCGC CATGTCAGCC GCGCGATGGC GGGCTGA
 
Protein sequence
MTDRPVVLLV DDEPEILVAL GDLLEDSFTI LSTTSPVEAL EILAGRSDVD VIVSDQRMPE 
MGGDVMLARA RTVCDAQAIL LTGYADIGAV AAALNQGRIS FYSHKPWDGD ALRAMIVQAA
HRHRLEGELQ TERMLLRGLQ QNLRAGLAFK DPQGRFIRIN RKAAAFYGRD EAACLGQREE
DVCDPAQRPA VREAEARLAA EGKDEEVIAI PAPGGGLSWR EFTRVQLDRN ARGEAYSVLI
NRDITRQREM EARLRQAEKM QALGTMAGGI AHDFNNLLTA VMGSLELATD MADGLDERTA
HLLDNAMAAA RRGAELTRRL LNFSRPRDLS LQPVDVNALL RGMRDLLMQG VTSRRRDGSH
ASFDIRMDKL APDGDLPPAR TDAGQLELAL LNLCINASDA MPDGGTITLS TRVAHLDEPA
AEGEPASGDY VVVSVADQGT GMPPETVARV FEPFFTTKDV GRGTGLGLSM IYGFVRHVGG
DVRVTSAPGQ GTRVDLYFPV HHRQGGASDP REAAEHAAAP HGLRVLVVDD EDAVRAVTAG
FLRGMGHQAI EARGGEDALA RIAGMAPDVP DLVVMDVMMP RMDGEEAARR IRAHYPGSRI
LFLTGYADDT ILPDDALVLR KPFTQADLSR HVSRAMAG