Gene GM21_2233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2233 
Symbol 
ID8137572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2605521 
End bp2607179 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content62% 
IMG OID644869848 
ProductGAF sensor signal transduction histidine kinase 
Protein accessionYP_003022040 
Protein GI253700851 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value0.0296242 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGAACA GGGGCAAACT GCTGGAGCAG TTGACGGGGG TCGATTCCTC GAAGCTCAAC 
TACTACGTCG AGCTCAAAAA GAGAAATCAG GACGTGCTGA AGCAAAACAG CCGCCTGGAG
ATCCTGCAGC AGCTCACGCG CGACATGAAC ATCGACATGT CCATCGGCGA CATCATCGAA
AGGGCGTTCA AAAAGCTCCC CGAGGCGCTC CCCTGCGATT TCCTGGGGCT GGCGAAGTTG
AGCGACGGCA AGTTGAAGCT GATCGCGGTC GCCCCCAGGG AATTTTCCAC GCTCGGTTTC
ATCCCCAAGC AGTCCGTCCT CTGGGACGCG TTCCGGGACA AGGGGGCGAC CAGCTACGAC
CCGGCCGACG ACCCCCTTTG CTTCGCCCAG ATCGCGGCCG AACATCCCCT GACCCTGCGC
TCCCTGGCCG CGGCGCCCCT TTTCGAGCGC TCCTCGGTCA ACGGGCTGTT GCTTTTGGGC
AGCGCGCTCG AATCCGCCTA CGAGGGGGCG GAGCTGAACT TCGTGCAGCA CCTGGCCGAC
CAGCTTGCCA TCAGCATGCA GAACGCGCGG CTCTACAAGC AGGTTTCCCG GGCCAAGAAG
GAGTGGGAGG CGACCTTCAA GGCCGTGACC GATCCCATCT TCCTGGTCGA TACCGATTAC
AACGTCCTTT TGCACAACGA CCGTCTTCCC CCCGGCATGG CCGAGATGTG GAACAGCTCC
GGCCGCAAGA AGTGCTTCGC GAAGATGCGC GGCGAGAACG AGCCCTGCCG CAACTGCCCC
ATCCAGGAGA TCAAGGAGAC CGGCAACCCC GTGTTCCAGC AGTGGAACAC CCCGCAGGGC
GAGTGCCTGG ATATGTCCTA CTACCCGGTC TTCAACGAGG AGAAGCAGCT TTCCGCTGTG
ACCATCATCA TGAAGGACGT CACCAAGAAG CTGAAGATGG AGGCCCAACT GGTGCAGTCG
GCCAAGATGG TGGCCTTGGG CGTCATGGCG GCGGGGGTGG CGCACGAGCT GAACAGCCCG
ATGACCGTGA TCATCGGGAC CGCGCAGATG CTTGCCAGGG AGATGCAGCA GGAGGGCGAT
TCCGAAGGGG CCGAGGCCTT GTCCGACATC ATCAACTGCG GCCTGCGCTG CAAGAGGATC
ATCCAGAACC TCCTCACCTT CTCGCGCCAG GACCAGGTCC CGGTCATGGA GATCGATTTG
AACGCCGAGG TGGAGCGGGT GCTCGACATG ATCCAGTACC AGATCAACCG GAGCCAGATC
GCGATCTTCG AGCATTTCAA CCGCGACCTG CCGATGATGA ACGCGAACGG GCCGCAGATC
CAGCAGGTGC TGACGAACTT CCTGATCAAC GCGCGCGACG CCCTGGCGGA AGTGGAAGGG
AAGGAGAAGG TCATCGAGGT CTCAACCGCT CTTCGGGTCG AGGGGGAGCG GCGCTTCGCC
GTCTTAAGCG TCGCCGACAA CGGGGTCGGC ATCGCCAAGG AGAACATCTC CAAGATCTTC
ACCCCTTTCT ACACCAGCAA AGAGGCCTCG AAGGGGACCG GACTCGGCCT TTCGGTGAGC
CTCGGCATAG CGGAGTCTCA CAACGGCCGC ATCGAGGTCG ACACCGAACT CGGGCAGGGG
AGCTGTTTCA GCCTGATCCT TCCCGTCGAC CAGCCGTAA
 
Protein sequence
MKNRGKLLEQ LTGVDSSKLN YYVELKKRNQ DVLKQNSRLE ILQQLTRDMN IDMSIGDIIE 
RAFKKLPEAL PCDFLGLAKL SDGKLKLIAV APREFSTLGF IPKQSVLWDA FRDKGATSYD
PADDPLCFAQ IAAEHPLTLR SLAAAPLFER SSVNGLLLLG SALESAYEGA ELNFVQHLAD
QLAISMQNAR LYKQVSRAKK EWEATFKAVT DPIFLVDTDY NVLLHNDRLP PGMAEMWNSS
GRKKCFAKMR GENEPCRNCP IQEIKETGNP VFQQWNTPQG ECLDMSYYPV FNEEKQLSAV
TIIMKDVTKK LKMEAQLVQS AKMVALGVMA AGVAHELNSP MTVIIGTAQM LAREMQQEGD
SEGAEALSDI INCGLRCKRI IQNLLTFSRQ DQVPVMEIDL NAEVERVLDM IQYQINRSQI
AIFEHFNRDL PMMNANGPQI QQVLTNFLIN ARDALAEVEG KEKVIEVSTA LRVEGERRFA
VLSVADNGVG IAKENISKIF TPFYTSKEAS KGTGLGLSVS LGIAESHNGR IEVDTELGQG
SCFSLILPVD QP