Gene GM21_2369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2369 
Symbol 
ID8137710 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2758775 
End bp2762215 
Gene Length3441 bp 
Protein Length1146 aa 
Translation table11 
GC content57% 
IMG OID644869984 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_003022175 
Protein GI253700986 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones154 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTATTA AGATGAAAAT GATGTGGGCA GTGTTCCTGC TTATTGCCTC ATTGACATCC 
GTCCTTACGC TTGCCTCATA CCAGTACCTA GCGTACGTCA TCAGAGAAGG CCTTGTCAGG
CAACATGACG CTGTACTCGA TGCCGCCGCA CACCACCTAG ACGAGGACAT ATCTCAGCTG
CAGACCCTGC TCTCCCGGAC GGCCAACAGG ATAGAGCTGC AATTGCTCAA CCCGGAAGGG
TTACAGAGGG TCCTTGGGGA GGATGAATCC ACGGCTGCTC TTTTCGATGG CGGCCTGGAG
ATCATTGATT CCGGTGGCCG GATCCTTGCC GCAGTCCCCT TCGGCACCGG GAAGGTCGGA
ACCAGGGTGC CTCCAGGCAA TTTCAATATG ACGCTTGCAC AGGGCAAACC GCAGATCTCC
GCCCCCTACC GCTCAGGAAC TACGCACACG CGCCCAATGA TCACCTTCAG TGCCCCAATC
CTGCGGCCAA ACGGCAGCGT CGCCGGTATC CTCGCAGGCC ATAAAGACTT GCTCAAGGAA
GGCCCTCTGG CTGGGTTTAG CCATCTACGG TTCGGCAAAA ACGACATCTT CTTCGTAGTA
GGGAGAAATA GGACCATCAT TATGCACCCG GACCAGCGCC GGGTCATGGA GCAGCTGTCG
CCGGGAAAAT CCCCTGTACT TGATAGCATC ATCGAGCGCA ATCATTACCA GCCCCAGGAA
GAAATCAGCT CCGACAGTGA GCACGAGATC ATAACGGCAA GGCCGCTGAA AAACGCAGAG
TGGCTCCTGG TGAGCCGTTA CTCCGTTTCG GAACTCTTCG CGCCGCTCGA CCAGCCGCGC
TGGTTCTTCG CTGCAGCCTT TATCATCACG ATGGTGATGG CGCGGGTCAT TCTGACTGTC
TTGGTCAGGC GCATTATCTC GCCGCTGTTG CGGCTGATCG ATCACGTACA GAACCTCCCC
TCGAAGACAG GCGAAGAGCG TGTGCTTGCG AACGGGAGCG GCGACGAGGT GGAATCGCTT
ACCCGCGCGG TGAACGATAT GGTGCAGGAC ATGGACAGGA AGAAGGAAGC GCTCCTGAAA
AGCCAGGAAG TCTACCACAT AATCGCCGAA TTCACCTCAG AACTCGCCAT AGTGAGAAAC
CCAGACGCGT CCATCCGCTA CATTTCCGCC AATTGCCTGG CACTCACAGG CTACACCGAT
CGCGAGTTCA TGGAGAAGCC TGAACTTCTC GAAGCGGTAA TCCACCCCTC TGACGCTGAT
ATCTGGCGCG CCCACTGCAC TCCCCCCTGT GGGAATGAAG TTGACTTCAA TCTGCGTCTG
TTGACAAAGC AGGGTGAGTC CCGTTGGTTC AGCTACACCT GCCACGCGGT TACCTCCCCC
GATGGGGCCT ACTTGGGGGT CCGCGGCAGT TTTCGCGATA TTTCGCATCG GGTGATGCTG
GAACAGCAGC TTTGCGACCA GAGGGAATTT GCGCGCAACC TGCTGGAAAG TACCTCAACT
CCCTTGTTCG TGATAGATCA GGACCATAAG GTGATTGTCT GGAACAAGGC CTTGTCCGAG
CTCACCGGAA TCCCTTCTTC CGACGTTATC GGTACCAACC GGCATTGGCG GGCGTTCTAT
CTTGAGCCGC AGCCCACGCT GGGTGATTTT TTGGTCGAGC TGCGTCCCGA GGAGGTCGGT
GACTTGGAGG GGAGGTTCGA GCGCGTAGCA GTGCAGCGAG GTAACCTGCA GGCGGAAAGG
TGGTTCAACA CCATAAACGG GGAGCGACGC CGCCTGCTGG CGAATGCCTC CCAGGTATAC
CGGGACGGCG AGGTGGTCGC GGTGGTCGAG ACCCTCCACG ACATCACTGC GAGGACGCAG
GCGGAGCAAT CTCTGCGCCT GTTGGCGCAG GCTGTGGAGC AGACAGCGAG CTCGATCCTC
ATCCACGATC TGCAGGGGGA GATGACCTAC GTCAACAACA AGTTCTGCCA GGTGACCGGC
TACAGCAGGG AGGAGACGCT GGGCAAAAAA ATAGTCATGC TGAAGACGGA GCCGAAGGTG
TTCGATGAGA TCTGCCGCAC CACGAAGTGC GGTCAACCAT GGCACGGTGA GTTGCAGAGC
AGGCGTAAGG ACGGCAGCTT GTTCTGGGAA AGGGCCACCA TATCCCCCAT TGCCGATGAC
AAAGGTGCCA TCAGCCACTA CCTCGCGGTG AAAGAGGACA TAACGGATGC CAAAGAGGTC
GAGTATCTGC TTAAACAGCA GCAGGCCGAA CTGTTGCTGA AGCACGAACA GCTGTCGGAG
CTGTTTGTTC AGGTTGAAAA GGCAAAGCGA GAGTGGGAGC AGACCATGGA CTGCGTAGAT
GACATGGTTG CCCTGGTGGA AAAAGACGGA AGCATAAGAC GCTGTAACCG CGCCTTCATG
CAGTTCGTGA ATTGCGGATA CAACGACCTT CTCTCTCGTA ATTGGCGCAT GCTATTGACA
AAGACCGGGC TGGATCTGGA TGGGCATGGG GAAGGCCAGT TCTTACACCA ACCGACCCAG
CGCTGGCTGG CGCTCAAAAC CTATACCTGG GACGCTGAGC AGGCAAAGGT GATTACCTTG
CACGACCTGA CTGAAGTAAA GAGGGTCTCA GAACAGCTTG TGACTGCATA TCAAGAGCTC
AAAGCGACTC ACTCGCAGCT TTTGCAGCAG GAGAAAATGG CTTCCATAGG GCAGCTCGCA
GCAGGGGTCG CCCATGAGAT CAACAACCCG ATGGGGTTCA TATCGAGCAA CCTGAGCACC
CTTGAGAAAT ACCTGGAAAG GATCAGCAGC TTCATCACGC TGCAGTCGGC AAAGGTGATG
CCAAACGCCA CCGCCGAGGT GTTGGCGGAA TTGACGCAAG CAAGGCAGAC GCTGAAGGTC
GATTACATCC TGCAGGACGC GCCCGATTTG GTCGCGGAAT CGATGGACGG AGCGAATCGG
GTGAGGAAGA TAGTGCAGAA CCTGAAGACA TTCTCCAGGG TCGACGACGC TGAAACCATG
TACGTGGATC TCAACGACTG CCTGGAGAGC ACTGTAACAA TCGCCTGGAA CGAGCTCAAG
TACAAGACGA CCCTGAATCG TGACTACGGC GAGCTTCCCC CGGTCAAATG CTTCCCGCAT
CAGTTGAACC AGGTGTTCCT GAACATCCTG GTGAATGCGG CTCATGCCAT CGAGGTACAG
GGCGAGGTGA CCATAACCAC CCGCTGCCTC GGGGAGACGG TCAAAGTAAC GATTAGCGAC
ACCGGCTGCG GCATTCCAGA CGAGATCAAG GAACGGATCT TCGAGCCCTT TTTCACCACC
AAGGAGGTGG GCAAGGGAAC CGGCCTGGGG CTTTCCATCA GCTACGACAT CGTAAAGAAG
CACGGCGGCA GCATCGAGGT GAAAAGCACT CCTGGAAAGG GAACCACATT CAGCGTTGTG
CTGCCGGTGG AGGGAACATA G
 
Protein sequence
MSIKMKMMWA VFLLIASLTS VLTLASYQYL AYVIREGLVR QHDAVLDAAA HHLDEDISQL 
QTLLSRTANR IELQLLNPEG LQRVLGEDES TAALFDGGLE IIDSGGRILA AVPFGTGKVG
TRVPPGNFNM TLAQGKPQIS APYRSGTTHT RPMITFSAPI LRPNGSVAGI LAGHKDLLKE
GPLAGFSHLR FGKNDIFFVV GRNRTIIMHP DQRRVMEQLS PGKSPVLDSI IERNHYQPQE
EISSDSEHEI ITARPLKNAE WLLVSRYSVS ELFAPLDQPR WFFAAAFIIT MVMARVILTV
LVRRIISPLL RLIDHVQNLP SKTGEERVLA NGSGDEVESL TRAVNDMVQD MDRKKEALLK
SQEVYHIIAE FTSELAIVRN PDASIRYISA NCLALTGYTD REFMEKPELL EAVIHPSDAD
IWRAHCTPPC GNEVDFNLRL LTKQGESRWF SYTCHAVTSP DGAYLGVRGS FRDISHRVML
EQQLCDQREF ARNLLESTST PLFVIDQDHK VIVWNKALSE LTGIPSSDVI GTNRHWRAFY
LEPQPTLGDF LVELRPEEVG DLEGRFERVA VQRGNLQAER WFNTINGERR RLLANASQVY
RDGEVVAVVE TLHDITARTQ AEQSLRLLAQ AVEQTASSIL IHDLQGEMTY VNNKFCQVTG
YSREETLGKK IVMLKTEPKV FDEICRTTKC GQPWHGELQS RRKDGSLFWE RATISPIADD
KGAISHYLAV KEDITDAKEV EYLLKQQQAE LLLKHEQLSE LFVQVEKAKR EWEQTMDCVD
DMVALVEKDG SIRRCNRAFM QFVNCGYNDL LSRNWRMLLT KTGLDLDGHG EGQFLHQPTQ
RWLALKTYTW DAEQAKVITL HDLTEVKRVS EQLVTAYQEL KATHSQLLQQ EKMASIGQLA
AGVAHEINNP MGFISSNLST LEKYLERISS FITLQSAKVM PNATAEVLAE LTQARQTLKV
DYILQDAPDL VAESMDGANR VRKIVQNLKT FSRVDDAETM YVDLNDCLES TVTIAWNELK
YKTTLNRDYG ELPPVKCFPH QLNQVFLNIL VNAAHAIEVQ GEVTITTRCL GETVKVTISD
TGCGIPDEIK ERIFEPFFTT KEVGKGTGLG LSISYDIVKK HGGSIEVKST PGKGTTFSVV
LPVEGT