Gene GM21_0741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0741 
Symbol 
ID8136056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp883763 
End bp887344 
Gene Length3582 bp 
Protein Length1193 aa 
Translation table11 
GC content59% 
IMG OID644868358 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_003020573 
Protein GI253699384 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.0000131287 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTCACCG ACGATGGCCA CCTCGCAAAC GGTCGCACCC GCAGCGCCCT CATCTTTTTG 
GTCCTCACCG CCTTTGGCTT GGCGGGCAAT TATTTCAAGT TTCCCCTCTT CCTCAACATC
GACTTTCTCT TCGGCAGCAT CTTCGCGCTG TTCGCACTGC AGCTCCTGGG TCTCCGGCTC
GGCACTGCGG CAGCGGCGCT GATCGGATGC TATACCTACG TTCTCTGGAA CCACCCTTAC
GCAGCCATCA TTCTCGCTGC CGAGGCCGCA GCAGTAGGCG CTCTGATGGA GCGCCGCAAG
TACGGCATGA TCTCGGCAGA CATTGTGTAC TGGGTCTTCA TGGGCGTGCC GCTCGTTTAC
CTGTTTTACC ACGGCGCCAT GCACGTCCCG CTGAGCAACG TCCACATCAT CGCGTCGAAG
CAGGCGATGA ACGGTGTAGC GAACGCCATC GTGGCGAGGC TTCTCTTCAC CGTTTTTGCC
CTCAGGTCGC GTTCGGCGAT GATCCCGTTC CGGGAGATGA TTACGAGCCT GCTCGCCTTT
TTCGTTGTGG CGCCAGCTCT TTTGCTGTTG ACCCTTGAGG GGAGGGGCGA CTTTGCGGAA
ACCGATGAGA TCATCCGCGC CGGACTGAAG CGGGATATAG ACCGCGAGGC CTCACAGTTG
GCAACGTGGG TGACGAACAG GAAGGCGGTG ATCGAGAGTC TGGCGGACCT GGCGGAGTCC
AATTCCCCCG AAAAGATGCA ACCGTACCTG GAGCAGGCAA CAAAGTCGGA CACCAATTTC
TTGAGTGCCG GACTTCTTGA CCGAAGCGCC ACACTCAAAG CGTATTATTC CTTGCACGAC
GACCTGGGGG AGATGAACAC CGGCAAGACC TTCATCAACG ACCACTACAT CCCCAAACTG
AAGCAGACGC TAAAGCCGAT GCTGTCTGAA ACAGTCCAGG CCAAGGACAA TGGCCCCAAA
CTGCTGGTAT CGATGCTCGC TCCGGTGGTA GTCCGGGGAG AGTACCGCGG CTATGTCATA
GGGGTGCTCA GCCTGCAACA GATAAAGGAG CGACTGGAAA GGAGTAGCAG CTACAAGGGG
ACCCATTACA CCCTTTTGGA CCGGAAAGGC ACCGTCATCA TGAGCAATCG TTTCGACCAG
ACGACGATGA CACCTTTTCG GCGCGGCAAG GGAACCATCA AGAAACTGGA TAGCGGCATC
AGCCAGTTTG TGCCGGACGC TCCCCCCAAT ACGCCTATGT CGGAGCGTTG GAAAAAATCC
CGCTACAGCG CGGAATCGAC CATAGGAGAA CTGGCGGAGT GGCGACTGGT ACTGGAACAG
CCTGCGGCGC CTTTCCAAAA GCAGCTTTAC GACAGCTGTA CCAAGAAGCT GATCCTGATC
TCGCTGCTGA TGCTCATGGC GCTCGCCCTG GCATCCTTTT TGGGACGCCG GATCATGGTT
ACCATAGAGA ACCTGCGCAG GGTGACCAAC GACCTCCCCT CAAGGCTTGC CTCCGGCGGA
TTTGTCGATT GGCCGCAAAG CGGGATAACG GAGACCAACA ATCTGATCGA GAATTTCCAG
GTGATGAGCG AAACCATCGA AGAACACCTC ACCGAACTGC AACTGCTCAA CGACTCCCTG
GAGCAACGGG TACAGGAGCG GACGCAGCAG GTGGAACGAC TGGCAAGCGA GCAGCGAACC
ATATTGACCA CCATGCCGAT AGGTGCATGC CTTTTGGTGG ACCGGAAGAT ACGCATGGCC
AACCAGGCCT TCGACAAGAT TACCGGGTAC GAACCGGGTA AGACTCTGGG GATGGATACG
GCTCAATTGC ACCCGGACCT GCAGTCGTAT CAGCAGTTTT GGGACGCCGC AAGCAGTGCG
ACAGCCAAGG GCGGGATTCA TAGCGCCGAC ATGGAGCTGA GAAGAAAGGA CGGCTCCTTG
ATCTGGTGCA ACCTCGTGGG GCAGATGGTG AACCCGACTG CACCTGAGGA GGGCTTCATC
TGGATGGTCC AGGACATCTC CGAGCGCAAG ACCATGGAAT CTCAACTGCG CGCGAGCGAA
ACCCACTACC GCCTGCTTAC CGAGGACGTC GCCGATGTGG TGTGGAAGTT GGACGCCGGT
TACCGCTTCA CCTACATCAG TCCCGCCGAC GAGCGCCTGC GAGGCTATCG GGCGGACGAG
GTCTTAGGGC ACACCATACT GGAGCAGACG ACGAGCGAGT GGCATGCCGC CATTACGGAA
AAAATGCGCC CCGGCGAGGA TGCCCGGGAG ACCTCCCCCT TGGAAATACA GCAGCGCTGC
AAAGACGGGC AACTTATCTG GACCGAGATC TTCTTCACCG CCGACCATGA CGCCGACGGA
GCCATCACCG GCTACCACGG CATAACGAGG GATATCACCC AGAGAAAGCA GGCCGTGGAG
CTGGAGCAGC AGTTGCTGCA TGCGCAAAAG CTCGAAAGCC TCGGCGTTCT CGCCGGGGGG
CTCGCCCATG ACTTCAACAA CATCCTGATG GCGATCATCG GCAACGCCGA TCTCGCCCAG
ATCCACCTCG GCACGGGTTC GCCGGCGGCG GAGAACCTGC AGAGGATCCA GAACGCCGCG
TCGCGCGCCG CGGACCTGAC CAGCCAGATG CTCGCCTATT CAGGCAAGGG AAGGTTCGTG
GTGGAGCGGA TAGACCTGAA CAGCCTGCTG GACGACATGC TGCACCTGCT TGAGGTCTCC
ATCTCGAAGA AAGCGGCCTT GAAGTTCAAC CTGCACCGGC CGCTCCCGCA CATCGTTGCC
GACGCGACCC AGATTCGGCA AGTGGTGATG AACCTGGTGA TCAACGCCTC GGAAGCAATC
GGGGACAACA GCGGGGAAAT AACGATCTCC ACAGGATGTT CGCAGTGCGT TGAGGAGTGC
CGGAGAGGGA GAAAGGTCAG CCGCGACCAG GGCGAGAGAC CATGCGTCTA TCTTGTCATC
GCAGACACCG GCTGCGGCAT GGGCAGTGAA ACGGTGGCAA AGATTTTCGA CCCGTTCTTC
ACCACTAAAT TCACCGGCCG CGGCTTAGGC ATGGCCGCGG TCCAGGGGAT CGTGAAAGGG
CATAAAGGCA CCGTAAAGAT CTCCAGCGAG CCAGGCAGAG GGACGACGTT CACGATCTGT
TTCCCCGTCG CCGAAGCGAC GGGCGAGGCA CCGGCGGAAA ACCGCAATGA AACCGGCGTC
GAAGTTTGGC AGGGGAGCGG CACGGTTCTT CTGGTGGAGG ACGAGGATAC GGTGCGGGAG
ATAGGGGTGC AAATGCTGGA AAGCCTTGGT CTCAAGGCGC TCACCGCTAA AGACGGCGAA
GAGGCCGTCG AGGTCTTCAG GAGGGGGGAG GAGGTTTCCT TCGTGATCCT CGACCTGACC
ATGCCGCGGA TGGACGGCAA GCAATGCCTG AGAGAGCTGC GCCGGTTGGA TCCGGCGGTC
AAGGTGATCA TGTCCAGCGG CTTCAACGAA CAGGAGATCG CCCGCGACCT CGATGGGGGG
CCGTGCGGTT TCATCCAAAA GCCGTACGAC CTGCCCGAAC TGCAACAGGC GATCGGGAAA
TATATAGGAA AACCGGAACC GTTTGCCACG GAGAACTTCT GA
 
Protein sequence
MFTDDGHLAN GRTRSALIFL VLTAFGLAGN YFKFPLFLNI DFLFGSIFAL FALQLLGLRL 
GTAAAALIGC YTYVLWNHPY AAIILAAEAA AVGALMERRK YGMISADIVY WVFMGVPLVY
LFYHGAMHVP LSNVHIIASK QAMNGVANAI VARLLFTVFA LRSRSAMIPF REMITSLLAF
FVVAPALLLL TLEGRGDFAE TDEIIRAGLK RDIDREASQL ATWVTNRKAV IESLADLAES
NSPEKMQPYL EQATKSDTNF LSAGLLDRSA TLKAYYSLHD DLGEMNTGKT FINDHYIPKL
KQTLKPMLSE TVQAKDNGPK LLVSMLAPVV VRGEYRGYVI GVLSLQQIKE RLERSSSYKG
THYTLLDRKG TVIMSNRFDQ TTMTPFRRGK GTIKKLDSGI SQFVPDAPPN TPMSERWKKS
RYSAESTIGE LAEWRLVLEQ PAAPFQKQLY DSCTKKLILI SLLMLMALAL ASFLGRRIMV
TIENLRRVTN DLPSRLASGG FVDWPQSGIT ETNNLIENFQ VMSETIEEHL TELQLLNDSL
EQRVQERTQQ VERLASEQRT ILTTMPIGAC LLVDRKIRMA NQAFDKITGY EPGKTLGMDT
AQLHPDLQSY QQFWDAASSA TAKGGIHSAD MELRRKDGSL IWCNLVGQMV NPTAPEEGFI
WMVQDISERK TMESQLRASE THYRLLTEDV ADVVWKLDAG YRFTYISPAD ERLRGYRADE
VLGHTILEQT TSEWHAAITE KMRPGEDARE TSPLEIQQRC KDGQLIWTEI FFTADHDADG
AITGYHGITR DITQRKQAVE LEQQLLHAQK LESLGVLAGG LAHDFNNILM AIIGNADLAQ
IHLGTGSPAA ENLQRIQNAA SRAADLTSQM LAYSGKGRFV VERIDLNSLL DDMLHLLEVS
ISKKAALKFN LHRPLPHIVA DATQIRQVVM NLVINASEAI GDNSGEITIS TGCSQCVEEC
RRGRKVSRDQ GERPCVYLVI ADTGCGMGSE TVAKIFDPFF TTKFTGRGLG MAAVQGIVKG
HKGTVKISSE PGRGTTFTIC FPVAEATGEA PAENRNETGV EVWQGSGTVL LVEDEDTVRE
IGVQMLESLG LKALTAKDGE EAVEVFRRGE EVSFVILDLT MPRMDGKQCL RELRRLDPAV
KVIMSSGFNE QEIARDLDGG PCGFIQKPYD LPELQQAIGK YIGKPEPFAT ENF