Gene GM21_1723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1723 
Symbol 
ID8137054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2007445 
End bp2009439 
Gene Length1995 bp 
Protein Length664 aa 
Translation table11 
GC content63% 
IMG OID644869335 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_003021535 
Protein GI253700346 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTC TCGCCATAGA CGACAACCAC GACAATCTTC TCACGCTGCA GGCGTTGCTG 
AAGATTTTCA TGCCCGAGGC GAACCTCATC ACCTCCGCCT CGGCCCAGGA CGGCATCCGC
AGGGCGCTCT TGGAAGCGCC CGACACCATC CTTTTGGACA TCCAGATGCC GGGCATGGAC
GGGTACGAGG CGACCCGCAG GCTGAAGGGT AACCCCTCCA CCCAGCATAT CCCCATCATC
CTGGTCACCG CGCATCGCAC CGACCCCGCC GGCAAGGTGC GGGGGCTCGA GTGCGGCGCC
GACGCGTTCC TCTCCAAGCC CATCGACGAG GCTGAACTCG TGGCCCAGAT CAGGGCGATG
GTGCGGATCA AGCGTTCCGA GGACGCGCTG CGCCTAGAGC GGGACGGCCT GGAAGCCCTG
GTGCAGAAGC GGACCTCGGA GCTTTTGCTC ATGAACCAGG AGCTGACGGA GAACCTGGCC
CGTCTGGCCG AAAACGAGGC CCGCTACTCC CGGGCGGTGC GGGGGACGAC GGACGGGCTT
TGGGATTGGA ACATGGTGAC CGGGGAGTGC TATTTTTCCC CGCGCTGGAA GGAACTCTTG
GGCTTTGCCG ACGAAGAACT CCAAAACGAG GTCGATACCT TTTATTCGCG GCTGCATCCC
GACGATATAT CACTGGCGCA GGGGGCGCTC GACTCGCACC TGGCGGGGAA AGCGCCCTTG
GACGTGGAGC TCAGGCTGAG AAACCGCGAA GGAGGGTACC TGCGCTTTCG CACCCGCGGC
CAGGCCGAGT GGGACAGCCT GGGAGCGCCC GTTCGCCTCT CCGGAGCCCT TTCCGACATT
ACGGAGCGGC AGCTCATGGA GGAGCAGCTC CGGCAATCGC AGAAGATGGA GGCTGTAGGG
CAGCTGGCAG GAGGGGTCGC GCACGACTTC AACAACATCC TCACCGTGAT CGCGGGCTAC
GCGAACATGC TCAAGATGGA CCTCCCCGCG CTCGGCAACG AGGTGAGGCT GGCGGAACAG
ATCGCCACCG CCACCGAAAG GGCTGCCGAA CTGACCAAGG GACTGCTGGC TTTCAGCAGG
AAACAGCCAA TGGCGCCTCA GAAGCTGGAC CTGGGCGAGA TCGTGCAGCG CGTGGAGACC
TTCCTCACGC GCGTCATAGG GGAGGACATC CAGCTCAAGA CGCAGCTGCT ACCGGACTCT
CTGCCGGTCT GTGCCGACAT GGGGCAGATC GAACAGGTGC TCATCAACCT GGCTGCCAAT
GCGCGCGACG CCATGGAAAA CGGAGGGACC TTTACCATCG CGACCGGCCT CTTCGAGGCG
GACGGCTCGC CTTCCGCCTT CTGTGGCCCG GCTGGTAGCT ATGCCGTCAT CATCGTCTCC
GATAACGGCA AGGGGATGTC CTCCGAAACC TGCAAGCGGA TCTTTGAGCC TTTCTTCACC
ACCAAGGAGG TGGGGAAGGG GACCGGGCTC GGCATGGCCA TCGTCTACGG CATCGTGCAG
CAGCACAAGG GTTACATCAC CGTCGACAGC AAACCAGGGG GCGGCACCGT CTTCAGCATC
TATCTGCCGC TTGCCGCGGA TAGCGCCGAA ATGCACCAGG AGCCGGCGGC CGACCAGGAG
CCGGAACAGG GGACAGAGAC CATCCTGGTC GCCGAAGACG ACCCGAGCGT GCGCAACCTG
GTGGACATGG TGCTGACCAA ACACGGCTAC CAGGTGATCC TGGCCGAGAA CGGGCAGGAG
GTGGTGGAGC GCTTCACCGC CCACTCAAGC GACATAGGGT TGATCCTGAT GGACATCATC
ATGCCGCGCA AAAACGGCAT AGAGGCGTTC GCCGAGATCA AGAAGCTGCA GCCGGAGGCG
AAGGTCCTCT TCACCAGCGG CTACACCTCC GATTTCATCC AGAGCCGCGG CATGGAAGAA
GGGGTCGAAC TGATCATGAA ACCGGTGCAG CCGGTCCAAC TGCTGCGCAA GGTGAGGGAG
GTGCTCGAAA GGTGA
 
Protein sequence
MKILAIDDNH DNLLTLQALL KIFMPEANLI TSASAQDGIR RALLEAPDTI LLDIQMPGMD 
GYEATRRLKG NPSTQHIPII LVTAHRTDPA GKVRGLECGA DAFLSKPIDE AELVAQIRAM
VRIKRSEDAL RLERDGLEAL VQKRTSELLL MNQELTENLA RLAENEARYS RAVRGTTDGL
WDWNMVTGEC YFSPRWKELL GFADEELQNE VDTFYSRLHP DDISLAQGAL DSHLAGKAPL
DVELRLRNRE GGYLRFRTRG QAEWDSLGAP VRLSGALSDI TERQLMEEQL RQSQKMEAVG
QLAGGVAHDF NNILTVIAGY ANMLKMDLPA LGNEVRLAEQ IATATERAAE LTKGLLAFSR
KQPMAPQKLD LGEIVQRVET FLTRVIGEDI QLKTQLLPDS LPVCADMGQI EQVLINLAAN
ARDAMENGGT FTIATGLFEA DGSPSAFCGP AGSYAVIIVS DNGKGMSSET CKRIFEPFFT
TKEVGKGTGL GMAIVYGIVQ QHKGYITVDS KPGGGTVFSI YLPLAADSAE MHQEPAADQE
PEQGTETILV AEDDPSVRNL VDMVLTKHGY QVILAENGQE VVERFTAHSS DIGLILMDII
MPRKNGIEAF AEIKKLQPEA KVLFTSGYTS DFIQSRGMEE GVELIMKPVQ PVQLLRKVRE
VLER