Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0741 |
Symbol | |
ID | 8136056 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 883763 |
End bp | 887344 |
Gene Length | 3582 bp |
Protein Length | 1193 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 644868358 |
Product | PAS/PAC sensor hybrid histidine kinase |
Protein accession | YP_003020573 |
Protein GI | 253699384 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.0000131287 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTCACCG ACGATGGCCA CCTCGCAAAC GGTCGCACCC GCAGCGCCCT CATCTTTTTG GTCCTCACCG CCTTTGGCTT GGCGGGCAAT TATTTCAAGT TTCCCCTCTT CCTCAACATC GACTTTCTCT TCGGCAGCAT CTTCGCGCTG TTCGCACTGC AGCTCCTGGG TCTCCGGCTC GGCACTGCGG CAGCGGCGCT GATCGGATGC TATACCTACG TTCTCTGGAA CCACCCTTAC GCAGCCATCA TTCTCGCTGC CGAGGCCGCA GCAGTAGGCG CTCTGATGGA GCGCCGCAAG TACGGCATGA TCTCGGCAGA CATTGTGTAC TGGGTCTTCA TGGGCGTGCC GCTCGTTTAC CTGTTTTACC ACGGCGCCAT GCACGTCCCG CTGAGCAACG TCCACATCAT CGCGTCGAAG CAGGCGATGA ACGGTGTAGC GAACGCCATC GTGGCGAGGC TTCTCTTCAC CGTTTTTGCC CTCAGGTCGC GTTCGGCGAT GATCCCGTTC CGGGAGATGA TTACGAGCCT GCTCGCCTTT TTCGTTGTGG CGCCAGCTCT TTTGCTGTTG ACCCTTGAGG GGAGGGGCGA CTTTGCGGAA ACCGATGAGA TCATCCGCGC CGGACTGAAG CGGGATATAG ACCGCGAGGC CTCACAGTTG GCAACGTGGG TGACGAACAG GAAGGCGGTG ATCGAGAGTC TGGCGGACCT GGCGGAGTCC AATTCCCCCG AAAAGATGCA ACCGTACCTG GAGCAGGCAA CAAAGTCGGA CACCAATTTC TTGAGTGCCG GACTTCTTGA CCGAAGCGCC ACACTCAAAG CGTATTATTC CTTGCACGAC GACCTGGGGG AGATGAACAC CGGCAAGACC TTCATCAACG ACCACTACAT CCCCAAACTG AAGCAGACGC TAAAGCCGAT GCTGTCTGAA ACAGTCCAGG CCAAGGACAA TGGCCCCAAA CTGCTGGTAT CGATGCTCGC TCCGGTGGTA GTCCGGGGAG AGTACCGCGG CTATGTCATA GGGGTGCTCA GCCTGCAACA GATAAAGGAG CGACTGGAAA GGAGTAGCAG CTACAAGGGG ACCCATTACA CCCTTTTGGA CCGGAAAGGC ACCGTCATCA TGAGCAATCG TTTCGACCAG ACGACGATGA CACCTTTTCG GCGCGGCAAG GGAACCATCA AGAAACTGGA TAGCGGCATC AGCCAGTTTG TGCCGGACGC TCCCCCCAAT ACGCCTATGT CGGAGCGTTG GAAAAAATCC CGCTACAGCG CGGAATCGAC CATAGGAGAA CTGGCGGAGT GGCGACTGGT ACTGGAACAG CCTGCGGCGC CTTTCCAAAA GCAGCTTTAC GACAGCTGTA CCAAGAAGCT GATCCTGATC TCGCTGCTGA TGCTCATGGC GCTCGCCCTG GCATCCTTTT TGGGACGCCG GATCATGGTT ACCATAGAGA ACCTGCGCAG GGTGACCAAC GACCTCCCCT CAAGGCTTGC CTCCGGCGGA TTTGTCGATT GGCCGCAAAG CGGGATAACG GAGACCAACA ATCTGATCGA GAATTTCCAG GTGATGAGCG AAACCATCGA AGAACACCTC ACCGAACTGC AACTGCTCAA CGACTCCCTG GAGCAACGGG TACAGGAGCG GACGCAGCAG GTGGAACGAC TGGCAAGCGA GCAGCGAACC ATATTGACCA CCATGCCGAT AGGTGCATGC CTTTTGGTGG ACCGGAAGAT ACGCATGGCC AACCAGGCCT TCGACAAGAT TACCGGGTAC GAACCGGGTA AGACTCTGGG GATGGATACG GCTCAATTGC ACCCGGACCT GCAGTCGTAT CAGCAGTTTT GGGACGCCGC AAGCAGTGCG ACAGCCAAGG GCGGGATTCA TAGCGCCGAC ATGGAGCTGA GAAGAAAGGA CGGCTCCTTG ATCTGGTGCA ACCTCGTGGG GCAGATGGTG AACCCGACTG CACCTGAGGA GGGCTTCATC TGGATGGTCC AGGACATCTC CGAGCGCAAG ACCATGGAAT CTCAACTGCG CGCGAGCGAA ACCCACTACC GCCTGCTTAC CGAGGACGTC GCCGATGTGG TGTGGAAGTT GGACGCCGGT TACCGCTTCA CCTACATCAG TCCCGCCGAC GAGCGCCTGC GAGGCTATCG GGCGGACGAG GTCTTAGGGC ACACCATACT GGAGCAGACG ACGAGCGAGT GGCATGCCGC CATTACGGAA AAAATGCGCC CCGGCGAGGA TGCCCGGGAG ACCTCCCCCT TGGAAATACA GCAGCGCTGC AAAGACGGGC AACTTATCTG GACCGAGATC TTCTTCACCG CCGACCATGA CGCCGACGGA GCCATCACCG GCTACCACGG CATAACGAGG GATATCACCC AGAGAAAGCA GGCCGTGGAG CTGGAGCAGC AGTTGCTGCA TGCGCAAAAG CTCGAAAGCC TCGGCGTTCT CGCCGGGGGG CTCGCCCATG ACTTCAACAA CATCCTGATG GCGATCATCG GCAACGCCGA TCTCGCCCAG ATCCACCTCG GCACGGGTTC GCCGGCGGCG GAGAACCTGC AGAGGATCCA GAACGCCGCG TCGCGCGCCG CGGACCTGAC CAGCCAGATG CTCGCCTATT CAGGCAAGGG AAGGTTCGTG GTGGAGCGGA TAGACCTGAA CAGCCTGCTG GACGACATGC TGCACCTGCT TGAGGTCTCC ATCTCGAAGA AAGCGGCCTT GAAGTTCAAC CTGCACCGGC CGCTCCCGCA CATCGTTGCC GACGCGACCC AGATTCGGCA AGTGGTGATG AACCTGGTGA TCAACGCCTC GGAAGCAATC GGGGACAACA GCGGGGAAAT AACGATCTCC ACAGGATGTT CGCAGTGCGT TGAGGAGTGC CGGAGAGGGA GAAAGGTCAG CCGCGACCAG GGCGAGAGAC CATGCGTCTA TCTTGTCATC GCAGACACCG GCTGCGGCAT GGGCAGTGAA ACGGTGGCAA AGATTTTCGA CCCGTTCTTC ACCACTAAAT TCACCGGCCG CGGCTTAGGC ATGGCCGCGG TCCAGGGGAT CGTGAAAGGG CATAAAGGCA CCGTAAAGAT CTCCAGCGAG CCAGGCAGAG GGACGACGTT CACGATCTGT TTCCCCGTCG CCGAAGCGAC GGGCGAGGCA CCGGCGGAAA ACCGCAATGA AACCGGCGTC GAAGTTTGGC AGGGGAGCGG CACGGTTCTT CTGGTGGAGG ACGAGGATAC GGTGCGGGAG ATAGGGGTGC AAATGCTGGA AAGCCTTGGT CTCAAGGCGC TCACCGCTAA AGACGGCGAA GAGGCCGTCG AGGTCTTCAG GAGGGGGGAG GAGGTTTCCT TCGTGATCCT CGACCTGACC ATGCCGCGGA TGGACGGCAA GCAATGCCTG AGAGAGCTGC GCCGGTTGGA TCCGGCGGTC AAGGTGATCA TGTCCAGCGG CTTCAACGAA CAGGAGATCG CCCGCGACCT CGATGGGGGG CCGTGCGGTT TCATCCAAAA GCCGTACGAC CTGCCCGAAC TGCAACAGGC GATCGGGAAA TATATAGGAA AACCGGAACC GTTTGCCACG GAGAACTTCT GA
|
Protein sequence | MFTDDGHLAN GRTRSALIFL VLTAFGLAGN YFKFPLFLNI DFLFGSIFAL FALQLLGLRL GTAAAALIGC YTYVLWNHPY AAIILAAEAA AVGALMERRK YGMISADIVY WVFMGVPLVY LFYHGAMHVP LSNVHIIASK QAMNGVANAI VARLLFTVFA LRSRSAMIPF REMITSLLAF FVVAPALLLL TLEGRGDFAE TDEIIRAGLK RDIDREASQL ATWVTNRKAV IESLADLAES NSPEKMQPYL EQATKSDTNF LSAGLLDRSA TLKAYYSLHD DLGEMNTGKT FINDHYIPKL KQTLKPMLSE TVQAKDNGPK LLVSMLAPVV VRGEYRGYVI GVLSLQQIKE RLERSSSYKG THYTLLDRKG TVIMSNRFDQ TTMTPFRRGK GTIKKLDSGI SQFVPDAPPN TPMSERWKKS RYSAESTIGE LAEWRLVLEQ PAAPFQKQLY DSCTKKLILI SLLMLMALAL ASFLGRRIMV TIENLRRVTN DLPSRLASGG FVDWPQSGIT ETNNLIENFQ VMSETIEEHL TELQLLNDSL EQRVQERTQQ VERLASEQRT ILTTMPIGAC LLVDRKIRMA NQAFDKITGY EPGKTLGMDT AQLHPDLQSY QQFWDAASSA TAKGGIHSAD MELRRKDGSL IWCNLVGQMV NPTAPEEGFI WMVQDISERK TMESQLRASE THYRLLTEDV ADVVWKLDAG YRFTYISPAD ERLRGYRADE VLGHTILEQT TSEWHAAITE KMRPGEDARE TSPLEIQQRC KDGQLIWTEI FFTADHDADG AITGYHGITR DITQRKQAVE LEQQLLHAQK LESLGVLAGG LAHDFNNILM AIIGNADLAQ IHLGTGSPAA ENLQRIQNAA SRAADLTSQM LAYSGKGRFV VERIDLNSLL DDMLHLLEVS ISKKAALKFN LHRPLPHIVA DATQIRQVVM NLVINASEAI GDNSGEITIS TGCSQCVEEC RRGRKVSRDQ GERPCVYLVI ADTGCGMGSE TVAKIFDPFF TTKFTGRGLG MAAVQGIVKG HKGTVKISSE PGRGTTFTIC FPVAEATGEA PAENRNETGV EVWQGSGTVL LVEDEDTVRE IGVQMLESLG LKALTAKDGE EAVEVFRRGE EVSFVILDLT MPRMDGKQCL RELRRLDPAV KVIMSSGFNE QEIARDLDGG PCGFIQKPYD LPELQQAIGK YIGKPEPFAT ENF
|
| |