Gene Gdia_0837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0837 
Symbol 
ID6974234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp950384 
End bp952633 
Gene Length2250 bp 
Protein Length749 aa 
Translation table11 
GC content60% 
IMG OID643390366 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_002275242 
Protein GI209543013 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.20117 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.390966 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGCAT CGTCAAGCGC CCGGTCCCGC CCGTATTTGT CGACCAAAAT CATCATGATA 
TGCGGGCTGT CGTTTCTCGT TATCCAGATC GTCGCCGTAA GCTTGTCCGC CAGACTGATG
AAGGCGCAGG TCGAGAAGAA TATCTATTCC GAGGCGATTT CAAAAAGCCA GATCCTGGCG
CAGCAGATCA TCCAGGCGCT CGACGCGCAG AATACCGTCG TGCGCAGCCT GCGAGGCACC
ATTGAATCCG CCCATCGGTC CGGTGTGCTG ACCAGGCAGT TCGTCATCGA CAATCTGTCC
GAAACCATGC GCCTCTTCCC CGATATTTAC GGCATGGACG TCAAGGAAGC CGCGCATGGC
GCCGAGGGCT CGCTCTTCCC CGGCGGGCCG GCCGGCAGTT CTCACGACGT ATTCTTTCCC
TACGCCGTGC GCGGCCCGGG CGGCAAGGTT GCCATCACCA CGCCCGCGCT GCATTACAGT
CCCGTCTTTT ATAGTATTAT AAATACCGGA AATCTCGAAG GTCTTGAACC GTATATCGAC
ACGGATTCAG GGGTTCCGAT GGTATCGCCC ACCTATCCCA TCACGTTCGA TGGAAAGAGG
ATCGCCGTCA TCGGCGCGGA TATCCCGCTC GGGTGGCTGG CGCCGCTTCT GAAAACCGTC
CATATCGCCC AGGGTTCGAC GGTTTCCCTT CTGTCGGATC AGGGCTTCTG GATCGTCACC
CCCAATTCCT CGCAGGTGAT GAAGCATTAT GATGCGTCGT CCGACGCGAC TGTGCAAAAG
GCCATAACAC AGCATACACT GACCATCGCG CGGGATTTCA ATGACGGCGC GGTGGACCGT
ATCATCGTTC CGATCAGAAT TGACGATTTC GGCATCTACT GGACGCTTAT CGTCGACGTG
CCGTCCGACG CGATCACCAA ACCCGTCTGG AATACCGTCT TTTCCCTGGT CGTTCCGGGG
GTTACGCTGA TTCTGCTGTC CATCGTGGTG ATGTGGCTCA CGTTCAACAG GATGTTGAGC
ACGCCCCTGC GCAACCTTTT GCGGTCGGTT TCGGAACTGG AACGCGGTGA ATATAACGGC
ACGGTCTACG GGACACGGCG TTCCGACGAA ATCGGCGCCA TGGCCCGTGG GCTGGAGGGC
TTCCGCGGGT CCCTGCTCAG GGCACAGCAG GCTGACGCCG AAGCCCGGCA GGCCCGCCAG
CAGAACGAGG CGATGCAGGC CGCGCGCCGC CGGGAAGACG AACAGAGGCT CAAGGACGGC
AACCGGGTCA TCAAGACGAT CGGCGCCGCG CTGGACGACC TGGCCAACGG GGACCTGTCG
GCCCGTGTCC ATGACGCGCT GCCCCGGGAG TTCGAGCCGC TGCGCGACAA TTTCAACCGG
TCGGTCCAGC AACTCGCCAT CGCCATCGGG GGCATCTCCG AGGCCGTCAG CGTCATCAGG
AAAGGCAGCC AGGTCGTCTC GGGCGGGGCG GAGGAACTCG CCGAACGGAC GGAACAGCAG
GCCGCCGCGC TGGAAGAAAC GACGTCCGCG ATAAATCAGA TCGCCCGGAA CGTTTCCCAA
TCGGAGGAGA TCATCACCCA GACCCAGACC GTCGGTCTGA AAGCCTGCCG GTCCGCCGAA
TCGTCTTCGG ACATCATGGG CAGGACCCGC AAGGCCATGC AGCGCATAGA AACGAGCTCG
GCCGAAGTCG TCGCCATTAT CGAGATCATC GAAGGCATCG CGTTCCAGAC CAATATCCTT
GCCCTCAATG CCAGCATCGA AGCCGCCAGC GCGGGCAACG CGGGAAAAGG CTTCGCCGTC
GTGGCCAACG AAGTCCGCAG CCTGGCCCAA CGCAGTGCCG AGGCCGCCAA GGGCATCAGT
TCGCTGGTGA ACAAGACGGT CGAGGAAATT CATGAAGGTG CGACCTGCGT GCGGGACACG
GAACGGGCCC TGCAGGATAT TTCCAGCCTT GTCTCGACAA TGGGCGAGAA GCTGAACACG
ATCGTCCATA GCGCGCGTGA ACAATCCTCC AGCCTGCGGG AGGTCACCAC CGCTGTAAAC
GTCATGGACC AGACCACGCA GAAAAACGCA TCCATCGCCG ACAATTCGCG CTCGTCGTCA
CGCAGCCTGA CAGAGGAAAC GCAGAAGCTG ACGGACCTGA TCACGCACTT CAAACTGTCC
GGGTCCGAAA ACAGCGATAT CCAGGCATGG CACGGTGAAG TCATTCACCT GGCCGAACGG
GTTCATGAAC GTTCGGGGTT CAATCGCTGA
 
Protein sequence
MMASSSARSR PYLSTKIIMI CGLSFLVIQI VAVSLSARLM KAQVEKNIYS EAISKSQILA 
QQIIQALDAQ NTVVRSLRGT IESAHRSGVL TRQFVIDNLS ETMRLFPDIY GMDVKEAAHG
AEGSLFPGGP AGSSHDVFFP YAVRGPGGKV AITTPALHYS PVFYSIINTG NLEGLEPYID
TDSGVPMVSP TYPITFDGKR IAVIGADIPL GWLAPLLKTV HIAQGSTVSL LSDQGFWIVT
PNSSQVMKHY DASSDATVQK AITQHTLTIA RDFNDGAVDR IIVPIRIDDF GIYWTLIVDV
PSDAITKPVW NTVFSLVVPG VTLILLSIVV MWLTFNRMLS TPLRNLLRSV SELERGEYNG
TVYGTRRSDE IGAMARGLEG FRGSLLRAQQ ADAEARQARQ QNEAMQAARR REDEQRLKDG
NRVIKTIGAA LDDLANGDLS ARVHDALPRE FEPLRDNFNR SVQQLAIAIG GISEAVSVIR
KGSQVVSGGA EELAERTEQQ AAALEETTSA INQIARNVSQ SEEIITQTQT VGLKACRSAE
SSSDIMGRTR KAMQRIETSS AEVVAIIEII EGIAFQTNIL ALNASIEAAS AGNAGKGFAV
VANEVRSLAQ RSAEAAKGIS SLVNKTVEEI HEGATCVRDT ERALQDISSL VSTMGEKLNT
IVHSAREQSS SLREVTTAVN VMDQTTQKNA SIADNSRSSS RSLTEETQKL TDLITHFKLS
GSENSDIQAW HGEVIHLAER VHERSGFNR