Gene GM21_4097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_4097 
Symbol 
ID8139471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4677434 
End bp4679734 
Gene Length2301 bp 
Protein Length766 aa 
Translation table11 
GC content64% 
IMG OID644871712 
ProductGAF sensor signal transduction histidine kinase 
Protein accessionYP_003023870 
Protein GI253702681 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones135 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGGCC GGAAGGGACA AAACCTCCTG GAGCAAGCAT TAACTATTGC CCAGGCCTCA 
GGTCGGTCTC ATCAGGTTCG GCTCAACAGC CTGCTTCGCC TGGCGGTCCG TGGGCACTCC
CTCGCCTCGG CCACCATCTA TCTCCCCGAT CCCAAAGGGG CGGCGTTGCA GCGTCGCTTC
AGCACGCTTG CCACCCCCTC CGGCCATAGC TGCCACATCC CTTACGGCGC AGGTCTCGCC
GGGCGCGTCG CCGCCACCCT TTCACCCCAA TCCAGCAGTA CCGTTTGGCT GCATGGCGAC
GAGCCCTTCT CCGGTGACGG TTCCACGATA GCGCTTCCCC TTTTGGACGG GGACCGGCTC
TTCGCCGTGC TCGCGCTGGA GAGCGGCGCC GCCGACGTCG CCCAGGAAGC CGTCGACGCG
GCCGGCATCC TGGCGCCGGT CTTCTCCCTG ACCGTAACCG GCCTGGCCGC AGCCGAAGAG
GCCGAGGAGG CACGTCGCAA CCTGTCGCTA CTTTCGGCGC TGGCCAAGCT CTTGAGCTCG
CCGCAGCCGC GCGGGGTATT GCTGCACCGG TTGATGCAGC TTTGTACCGG TTCCGGCCTC
TCCAGTTGCG CCATCGTCCG CCTGAAGCAA AGAAACTCCG GCAAGGAGAG GGTGATCCGG
AGCTGCCGCA GGGGAATGGG CGACAAGCTC CCCGACCTGC TGGAAAAGGA AGCCGCGCTT
GCGGTTCATG TCTGCGCCAC CGAGGCCACC TGCGCCGAGG AACTCGGAGT CGACTCCAGC
TACCGGTACG CCCTCTGCAC CCCGCTGGGA AGCAACGGTG CCGCGCTCGG GACCATGACC
CTCTTCGGGG GTCCCGAACT GACCGCGCCC AAGCAGATCG AGCTTGCCGA AACGGTGGCG
CGCCTTTTGT CCGGCGCCAT GGCCGAGGCG ATCTGCAAGG AGCAGATCAA GACCTACGAC
AGCGAGAACG AGAAGAAGCT GAAAGAGCTC TCCCTTCTCT ACAGGATGAG CAACACCATG
CTCTCCACCA TTCAGCTGAA CAAGCTGATC CACCTCACCC TGACCGCGCT CACCTCAGGT
CCCACCCCCT TCTTCGACCG GGCCATGCTC TTTCTCACCA ACGAGCGCTC CGGCATGCTC
CTCGGCATGC TGGGGGTGAC CACGGAAACC TCCCCTTCCC TTTCAACCCA AAATGGAGGG
AGCGACGACG TCCTCTCCAG CCGGTGGGAC ATCTCAGACG ACGAGATGGC CGCCCAGCGC
AACTCCGAGT TCTGCCGCCA GGTACAGGGA AGGCGACTCG AACTGGACGG CACGCTCAAC
ATCGCTTCCC AGGCCGTGCT GGAGAAGAGG CTGATCTACA TCCCGGAAGA AGAAGGGTTC
GACGGCGGCG CGCTCCATTC CGGCCGCAGC GCCCTGGCGG CCTCTCCGCT CATCGCGCAT
GGGCAGGCGG TAGGGGCGGT ACTGGTGGAC AACGCCCTCA CACATAAGCC GATCAACCAG
GAGCACCTGC GTTTCCTGCA GCTCTTCACC AACCAGGCGG GGATGGCCAT CGAGAACTCG
ATGCTCTACA ACAAGATCGA GGACGCGAAC CGGCAGTTGA GCGAGGCGCA GGAGCACCTG
CTCCAGAAGG AGCGGCTCGC CGCCATAGGC GAGATGGCCG CCGGCATCGC GCACGAGTTG
AAGGGGCCGC TGGTCTCCAT CGGCGGCTTC GCCGGCAGGC TCGCGAAAAA GCTCCCCCAG
GAGACCAGCG AGTGGGCCCA TGCCGACCTC ATCGTGCGCG AAGTGCTCCG GTTGGAGGGG
ATCCTCTCCG AGATCCTGCT CTTTTCGAAG AAGACAACCA TCTGTTACAC CCGGTGCGAC
TTATCCGAGA TCGTGAAGGA GTCGCTCGCC GTGGTCACCC CTCCCCTGGA GGAGAAGCGG
ATCAGCGTGA ACGCCAAATT CCCGCGGCAA AAGCTCGTGC TTTTGGGCGA CGGGCAGCAG
TTGAAGCAGG TTTTCATCAA CATCATCCTG AACGCCCTCG ACGCCATGGG GACCGGCGGA
ACGCTGAACA TCCAGGTCTT GGCGGCGGAA ATGGACGGCA AGGAAGCCGT CCAGGTGAAG
ATATCCGACA CCGGCGGCGG CATACCGCTT GAGTCCCTGC ACAGCATCTT CACCCCGTTC
TTCACCACCA AAGGAAGCGG CACCGGCCTC GGGCTCCCCA TCGCCAACCG CATCATAACC
AACCACGGCG GGAAGATCCA AGTCACCAAC CACCCCGGCC TCGGGGTCGA GTTCAGGGTC
ATCCTGCCGA AACACTGGTG A
 
Protein sequence
MAGRKGQNLL EQALTIAQAS GRSHQVRLNS LLRLAVRGHS LASATIYLPD PKGAALQRRF 
STLATPSGHS CHIPYGAGLA GRVAATLSPQ SSSTVWLHGD EPFSGDGSTI ALPLLDGDRL
FAVLALESGA ADVAQEAVDA AGILAPVFSL TVTGLAAAEE AEEARRNLSL LSALAKLLSS
PQPRGVLLHR LMQLCTGSGL SSCAIVRLKQ RNSGKERVIR SCRRGMGDKL PDLLEKEAAL
AVHVCATEAT CAEELGVDSS YRYALCTPLG SNGAALGTMT LFGGPELTAP KQIELAETVA
RLLSGAMAEA ICKEQIKTYD SENEKKLKEL SLLYRMSNTM LSTIQLNKLI HLTLTALTSG
PTPFFDRAML FLTNERSGML LGMLGVTTET SPSLSTQNGG SDDVLSSRWD ISDDEMAAQR
NSEFCRQVQG RRLELDGTLN IASQAVLEKR LIYIPEEEGF DGGALHSGRS ALAASPLIAH
GQAVGAVLVD NALTHKPINQ EHLRFLQLFT NQAGMAIENS MLYNKIEDAN RQLSEAQEHL
LQKERLAAIG EMAAGIAHEL KGPLVSIGGF AGRLAKKLPQ ETSEWAHADL IVREVLRLEG
ILSEILLFSK KTTICYTRCD LSEIVKESLA VVTPPLEEKR ISVNAKFPRQ KLVLLGDGQQ
LKQVFINIIL NALDAMGTGG TLNIQVLAAE MDGKEAVQVK ISDTGGGIPL ESLHSIFTPF
FTTKGSGTGL GLPIANRIIT NHGGKIQVTN HPGLGVEFRV ILPKHW