Gene GM21_4084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_4084 
Symbol 
ID8139458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4663966 
End bp4665291 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content60% 
IMG OID644871699 
ProductGAF sensor signal transduction histidine kinase 
Protein accessionYP_003023857 
Protein GI253702668 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones138 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGGTA GCGATATTTC AAGCCACGTT TCCGATCCTG CCCGGCTTGC CGCGTTGCGG 
GCAGTGGCGC TGCTGGATAC GCCGACCGAG GAGGCTTTCG ATCGCCTGAC CAAGCTCGCC
TCGCGTTTTG CCTCTGCACC CGTCGCCCTC GTCACGCTGG TGGACAGCGA CCGGCAGTTC
TTCAAAAGCT GCGTGGGGCT GCCCGAGCCG TGGCTCTCCA GCCGCCAAAC CCCTTTGTCG
CATTCGTTCT GCCAATACAA CCGCGTCGCG AAGCAGCCCC TGATCATCGA GGACGCACGC
GTTCATCCGC TGTTCAAGGA AAACCTCGCC ATCAGGGATC TGAAGGTGAT CGCCTACCTG
GGGATCCCGC TGGTCACTTC CGACGGCTAC GTTCTCGGCT CCTTCTGCGT CATCGATAAC
AAGCCGAGGC ACTGGAGCGG CGAGGATGTC GAGGTGGTCG AGAACCTCGC CGCCGCGGTG
ATGACCGAAA TCCAGTTGCG CACGGAGATA GCGGTACGGG CCCGCGGCGA GAAAGATCTG
CGCCGGCAGC ACGAGGAACT GGGGCGGGCA TATCGGGATC TTGAACGGGA GACGGCAGAG
CGGGTGAAGA CTGCGGAGCA GTTGCGGCAA AGGGACCAGA TGCTGATCCA GCAAAGCCGC
CTGGCCGCGA TGGGGGAGAT GATCAATAAC ATCGCCCACC AGTGGCGGCA ACCGCTCAAC
CTGTTGGGGT TACTGGCCCA GGAACTGCCG ATAACCTACG TGACAGAAGA GTTTTCGCAA
CAATACCTGG AGTCGCGGGT GCGAAAGATG ATGGAGGCGA TCGGACACAT GTCAGCCACC
ATCGACAATT TCCGCAACTT CTTCTGCCCC GAGAAAGACA AGGTGGAATT CAGCATCTTC
GACGTCGTGG ACCAGACCCT ATCATTGATG GGCTTGACCC TGAACCAGGT GCAGGTAAGG
ATCGAGGTGG TAGAAAGGAT CAACCCCGTC ATCACAGGGT ACCCCAACCA GTATGCGCAG
GTTTTGCTCA ACATCCTCAA CAACGCGAGG GACGCCTTCG CCGAGCGCAA CGTCCCAAGC
CCCAAAGTAG AGATACGGAT AGCCGCCGAG GAAGGGCGAT CGGTGGTGAC GGTCAGCGAC
AACGCCGGCG GCATCCCCCC CGAGGTCATC GACAAGGTTT TCGACCCCTA TTTCACCACC
AAAGATCCCG ACAAGGGGAC CGGCATCGGT CTCTACATGT CCAAGATGAT CATCGAGAAG
AACATGGGCG GGTCGCTGAC CGCCTGCAAC ACCGAAGAGG GCGCCCGCTT CCGGATAGAG
GTATGA
 
Protein sequence
MNGSDISSHV SDPARLAALR AVALLDTPTE EAFDRLTKLA SRFASAPVAL VTLVDSDRQF 
FKSCVGLPEP WLSSRQTPLS HSFCQYNRVA KQPLIIEDAR VHPLFKENLA IRDLKVIAYL
GIPLVTSDGY VLGSFCVIDN KPRHWSGEDV EVVENLAAAV MTEIQLRTEI AVRARGEKDL
RRQHEELGRA YRDLERETAE RVKTAEQLRQ RDQMLIQQSR LAAMGEMINN IAHQWRQPLN
LLGLLAQELP ITYVTEEFSQ QYLESRVRKM MEAIGHMSAT IDNFRNFFCP EKDKVEFSIF
DVVDQTLSLM GLTLNQVQVR IEVVERINPV ITGYPNQYAQ VLLNILNNAR DAFAERNVPS
PKVEIRIAAE EGRSVVTVSD NAGGIPPEVI DKVFDPYFTT KDPDKGTGIG LYMSKMIIEK
NMGGSLTACN TEEGARFRIE V