Gene GM21_2097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2097 
Symbol 
ID8137433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2436518 
End bp2440033 
Gene Length3516 bp 
Protein Length1171 aa 
Translation table11 
GC content63% 
IMG OID644869712 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_003021907 
Protein GI253700718 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.000000135263 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCTACG ACCAATCCGC ATCCAGGCTG TCGGCTGAAG GTATGGAGGA GGCGCGGGCG 
CACCTGGCGG CCATCGTCGA TTCCTCCGAC GACGCCATCG TCAGCAAGTC CCTGGAAGGA
ATCGTCGCCA GCTGGAACCA TGGGGCCGAG AAGCTCTACG GCTACAGCGC CGCCGAGGCC
ATCGGTCGCC ATATTTCCTT CCTGGTGCCT CCGGAGCGCC TCGACGAGCT GAGGCAGATA
ACCGAGAAGA TCCTGCAAGG GGATCGGATC AAGAACCTGG AAACCGTCCG GCTTCACAAG
GACGGCCGTG AAATCCCAGT TTCCATAACC CTCTCCCCCA TCTTGTGCGA GACAGACACC
ATCATAGGGA TTTCCATGAT CGCCCGCGAC AATACCGAGC GCCTGAGGAT GGTGCGGCTT
TTGCGGGAGA GCGAGGTGCG TTACCGGGTT CTGGTGGAGA TGGCCCCGGA CGCGGTCTTC
GTGCACCAGG ACGGGCGTTT TGTCTACGCC AACTGCGCGG CGCTGGGCAT CTGCGGGGTG
CAGAGCAGGG AAGAGCTGCA ACGGCACACC CTGTTCGATC TGGTGCATCC CGAGGACAGG
GAGACGCTGC GGGATCGGAT CCACGAGCTG ATGGCGAACC AAGGCATAGC GCACCAGGAA
TACCGGTTGG CCTGCCTGCA CGGCGAGGAG CTGGTCCTGG AGACCTCCTC CAGCCTCATC
GAGTATCAGG GGGTCCCCTC CATACAGGTC ATCGCCCGCG ACGTGACGCT CAGGAAGCAG
CAGGAACGCG AGCGAGAGCA GATGCTAAAG GAACTCGCCT TCCAGCAGAA CCGCTTCGAG
ACCACGGTGA GGCAGTTGCC GGTCGGGGTG GTCATCGTCG AGGCCCCCTC GGGAAAGACT
CTTTATCAAA ACGAGCGCGC ACGGCAGATA TTCGGTCATG ACATCGGCGC CGTTTCCGGC
ATAGAACAGT ACCGACAGTG GGAGATGTTC CGCCTGGACG GCACCCCTCT TCCCGCGGAT
GAGTACCCCG TCGCCCGTTC CCTGCTGCAA GGGGAAGCCT GTTGCGGCGA CGAGTACAAG
ATCCGGCGGG CCGACGGCTC CTACGGCTAC ATCTTCGTCA ATGCGACCCC GTTACGTGAT
GGGTCAGGGG CGATCGTTTC CGCCGTGGCG GCCTTTAGCG ACATCACGGA GAGAAAGCTT
GCCGCCCAGG CACTCTTCGA GAGCGAGGAA CGCCTGAAGC TCGCGCTTGA GGCAGCCGGG
ATGGGATCCT GCGACATGGA GGTGAAGACA GGCGCCGGAA TCTGTTCCCG GCGCTACTTC
GCGCTTTTGG GCTACCCCGA GCCCGAGGGC GAATCCGCAC CGGCCACAAC CTCCATGTGG
CTCGACCTGG TACACCGGGA CGACCTGGAA GAGGTGCAGC GTCGGCTGGA GCAGGCCCGC
AGGGACAACA CCCTGTTTGG CTCGGAGCAC CGCATTGTCA AGGCCGGCAG CGAAGAGACG
GTCTGGGTCA ACGTGATGGG GCGCTTCATC TGCGAGCAGA CGGAAGAGAG GTGCCGCTTC
ATCGGCGTCA TCTTCGACAT CAGCGAACGC AAGGCTGCGG AGGAGGCGCT TTTGCGCAAC
ATCCGGCGCT TTCGCCGCAT GGCGGACTCG ATGCCCCAGA TATTCTGGAC CGCGAACGCC
GATGGGAGCG TGGACTACAT CAACGCCTAT TTTCAGGAGT ACGGCGGCAT CGACAGAACC
GGTGTGGAGG AGGGGCGTGT CACCGCCGAG AGTCTACTTT CCGAGGTGGT CCATCCCGAC
GACCATGACG GGCTGTGGAT CGCCTGGTGC CGGTCCATGG AGAGCGGCGA GCCGTTCCAG
TACGAAAGCC GGGTGCGCCG CAAGGACGGG GTGTACCGCT GGTATCTAAG CCGTGCCCGT
GCCGAGCTCG ACGAGCAGGG GAGGGTGGTC AAGTGGTACG GGACCGCCAC CGACATCGAC
GAGCTGAGGG AGACGCAGGA AAAACTCGCG GCCAGCGAGA CCAGATTCCG CTGGCTTTAC
GAATCGAACC TGATCGCCAT CTTCTACTGG AACCGCGACG GCGCCATCAC CGACGCGAAT
CAGGCCTACT GCGACCTCGC CGGATATACC GCCGAGGAGT GCCGGTCCGG TGGGCTCAAT
TGGCTGGACG TGACGGCCCC GGACTACCTG GACAGAGACG TAGCCGCCGT GGCCGAGAGC
CAGATCCAGG GCATCTGTAA GTCCTACGAG AAGCTCTTCA TAAACCACAC TACCGGCGAG
AAGGTGCCGG TACTGACCGC CATCGCCATT TCGGCCGGCT CAGACGAGGG GATCGGCTTC
GCCGTGGACC TGACCGAGCT GAAGCGGGCC GAACAGGCGC AAAAGCACAG CGAATCGACG
CTGAAACTCG CCGTCGAGAC CACCGGGCTC GGCATCTTCG ACCTCGATCT GAAAACCGGC
AAGGGGGAAT GGTCCCCCAT CGCCAAGAAG CACTACGGCC TTCCTGCCGA CACGGAAGTA
GGCCTCGCCA CGGTGATCGA GGGAGTCCAT CCGGAAGACC GCGAGAAGAT AGAGCGGATC
GCGAGGGACG CCGCGAGCCC GGGAGGCGCG GGCATTTACA GCGCCGAATA CAGGACCGTG
GGTGCTGTGG ACGGCAAGGT GCGCTGGATA AGCATGCGCG GCCGGGTCTT CTACGATGAG
GAGGGGACCC CGCTGCGCCT GGTCGGCGCC TGCCTCAACG TAACCGACGT GGTGCAGGCC
CAGGAGACGC TCAAAGAAGA GATGAGCGAA AGGCTGCGTG CGGTCGAGGA ACTGCGCCGC
CAGGAGCAGC TGCTGATCAG GCAGGGAAGG CTTGCCGCCA TGGGAGAGAT GATCGCCAAC
ATCGCCCACC AGTGGCGCCA ACCCCTGAAC ACGCTTGGGC TAATCATCCA GGAGCTGCCC
ACCTACCAGG AGCGGAACCT TCTCACACGC GATTACCTGG AAGGAAGCGT CTCCCGCGCC
ATGCAGGTGA TCAACTACAT GTCGCAGACC ATCGACGGGT TCCGCAACTT CTTCGGCCCG
GACAAGGAGC ACCAGACGTT CCTGGCGAGC GAGGTGCTGG AAAAAACGGT CTCCATCCTC
GACGCTGCCT TCGCCGAGCT GAACCTGGAG CTCGTGGTCC GGGTCGACCG CGAGGCGGTG
GTGCAGGGGA TCCCCAACGA GTACTCGCAG GTGCTCCTCA ACATCCTGAT GAACGCGAAA
GACGCGCTCC TGGAGCGCAA AGTCGAGCAT CCCAAGGTCG AGGTCAGGCT ATTCAAAGAG
GGGGAGAAAG CGGTGGTCAC CATCACGGAC AACGCAGGGG GGATACCTCC GGAGATCATG
GACAGGATTT TCGACCCGTA CTTCACCACC AAGGGACCGG ACAAAGGGAC CGGCATCGGC
CTGTTCATGT CCAAGACCAT CATCGAGAAG AACATGAAGG GCTCGCTCAC GGTGATTAAC
CAGCCGGAAG GGGCGCAATT CCGCATCGAG GTCTGA
 
Protein sequence
MAYDQSASRL SAEGMEEARA HLAAIVDSSD DAIVSKSLEG IVASWNHGAE KLYGYSAAEA 
IGRHISFLVP PERLDELRQI TEKILQGDRI KNLETVRLHK DGREIPVSIT LSPILCETDT
IIGISMIARD NTERLRMVRL LRESEVRYRV LVEMAPDAVF VHQDGRFVYA NCAALGICGV
QSREELQRHT LFDLVHPEDR ETLRDRIHEL MANQGIAHQE YRLACLHGEE LVLETSSSLI
EYQGVPSIQV IARDVTLRKQ QEREREQMLK ELAFQQNRFE TTVRQLPVGV VIVEAPSGKT
LYQNERARQI FGHDIGAVSG IEQYRQWEMF RLDGTPLPAD EYPVARSLLQ GEACCGDEYK
IRRADGSYGY IFVNATPLRD GSGAIVSAVA AFSDITERKL AAQALFESEE RLKLALEAAG
MGSCDMEVKT GAGICSRRYF ALLGYPEPEG ESAPATTSMW LDLVHRDDLE EVQRRLEQAR
RDNTLFGSEH RIVKAGSEET VWVNVMGRFI CEQTEERCRF IGVIFDISER KAAEEALLRN
IRRFRRMADS MPQIFWTANA DGSVDYINAY FQEYGGIDRT GVEEGRVTAE SLLSEVVHPD
DHDGLWIAWC RSMESGEPFQ YESRVRRKDG VYRWYLSRAR AELDEQGRVV KWYGTATDID
ELRETQEKLA ASETRFRWLY ESNLIAIFYW NRDGAITDAN QAYCDLAGYT AEECRSGGLN
WLDVTAPDYL DRDVAAVAES QIQGICKSYE KLFINHTTGE KVPVLTAIAI SAGSDEGIGF
AVDLTELKRA EQAQKHSEST LKLAVETTGL GIFDLDLKTG KGEWSPIAKK HYGLPADTEV
GLATVIEGVH PEDREKIERI ARDAASPGGA GIYSAEYRTV GAVDGKVRWI SMRGRVFYDE
EGTPLRLVGA CLNVTDVVQA QETLKEEMSE RLRAVEELRR QEQLLIRQGR LAAMGEMIAN
IAHQWRQPLN TLGLIIQELP TYQERNLLTR DYLEGSVSRA MQVINYMSQT IDGFRNFFGP
DKEHQTFLAS EVLEKTVSIL DAAFAELNLE LVVRVDREAV VQGIPNEYSQ VLLNILMNAK
DALLERKVEH PKVEVRLFKE GEKAVVTITD NAGGIPPEIM DRIFDPYFTT KGPDKGTGIG
LFMSKTIIEK NMKGSLTVIN QPEGAQFRIE V