Gene GM21_1953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1953 
Symbol 
ID8137287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2262307 
End bp2264292 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content66% 
IMG OID644869567 
Producthistidine kinase 
Protein accessionYP_003021764 
Protein GI253700575 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones128 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTTC CCAGGCTGCG CTTTCCCATA AAGACCAAGC TCACCGTGGC CACGCTCATC 
CCGCTGGGGA TCGCCATCCT CATCTGCTGG ATGGCCGGGG TCTTCATCCT GAGCGCCAAG
GTGGCCGCCC AGGCCCAGGA GAAGGTCCGC TACGACCTGA GCGTCGCCCG CGAGGCGTAC
CAGGGCGAGC TTTCGCGCAT CTACGACGTG GTGAAACTCA CCGCCTCCTT CGGCAGCACC
GCCGAGACCA TCGCCTCCGG GGACCAGCGG GCGGTGAGCG AGGCGCTCTC CTCCGTACGG
GCCGGCGAGC GCCTGGACAT ACTCGCCGCC GTGGACGCCT CGGGGAAGGT CATCTTCAGG
GCCAACAACC CCGAGCTGCG CGGCGACGAC AAGAGCCGCA ACCAGTTCGT GGCCCGCGCC
TTGAAGGGGG AGATCGCCAG CGGCACCACC ATCCTCCCCT CTTCCGAGCT GGAGCTCGAG
GGGGATAGCC TGGTGCGCCA GGCGCGCGTC TATCCCGCCG CCGGGGCCCC GGTGCAGTTG
AACGGCGCCA TGTTGCTCCT GGTCGCGGCG CCGGTGCGAG ACAAGGCGGG GAACCTACTG
GGGGCGCTTT ACGGCGGGGT GCTCCTCAAC AACAACCAGA GGCTCGTGGA CCGGATCCGC
TCCGTGGTCT ACGAGGGTGC GCGTAAAAAC GGCAGGGACG TGGGGAACTC CACCATCTTC
CAGGGGGACA TAAGGGTCGC CACCAACGTC CCCAACACCG ACGGCAGCCG GGCCGTCGGC
ACCAGGCTCT CCGCGCCGGT CCAGGAGCGG GTGCTCCTCA AGGGGGCCAA GTGGGTAGAC
CGCGCCTTCG TGGTGAACGA CTGGTACCTG ACCGCCTACG AGCCGATCCT CTCGCTGCAG
GGGGTCCCGA TCGGCGCCCT TTACGTCGGG ATGCTGGAGA GCCAGTACTC GGCGGTCAAG
ATCGACATGG CGGTCCTTTT GAGCTTCGTC CTTTTGGTGA GCGGCCTGGT GGGAGTCTCG
ATGGCCGGGT TCCTGGGGAA AAAACTCTCC CAGCCCATAC GCGAGCTCGA TTCCCTGGCG
CGCCGTGTGG CGGCGGGAGA GCGCAACGTG AAAAGCAGCA TCGATTCGCG CGACGAGATC
GGGGACCTGG CGGGGAGGTT CAACGACATG AGCCGTTCGC TCGTGGAGCG CGAGGACAGC
ATCATCGAGC TGAACCGGAA CCTGGAAGAG AAGGTGCAGT CGAGAACGGC AGAGCTTGAG
GAGAAGAACC GGCTCCTGGT GCAGACCCGT GAGGAGCTGG TGCGGGTGGA GAAGCTGGCG
GCGATAGGCG AGCTGGCGGC GGGGGTCGCG CACGAGATCA ACAACCCGAT GGCGATCATC
CGCGGCAACA CGGAGCTGTT GCAGCTCTCG GTGCCGGAAG ACGCACCGAT CCGGGAGGAA
GTGGACACCA TTTTCCAGCA GGTGAAAAGG GTGGAGCGGA TCGTGTCGAA CCTGTTGAAG
TTCGCCAGGC GCGAGCAGAT GGAAGCGGGC ACCGTGCGGC TGAACGAGCT CTTGCACGAG
ATCGTCGGCC AGATCGGGCA CCAGGTGTCG CTTGAGGGGA TAGAGATAGT CGAGCAGTAT
GCGGAGAGCG TGGCGCAGGT GGAGGGGGAT GCGGACCAGT TGCGGCAGGT GTTCACCAAC
CTGGTCTTGA ACGCGGTGCA GGCGATGCCG GCAGGGGGAG TTCTTTCGGT GCGGACCCGG
CCATTAGAGC CGGAGCGCAG CTACGAGGTG AAGGTCGCCG ACACCGGCGT CGGCATCGCG
CTGGAAAACC TGAGGCAGGT ATTCAACCCC TTTTACACCA CCAAGGCCAA CGGCACCGGC
CTGGGACTTT CCGTCTCCTA CGGCATCGTC CGCGAGCACG GCGGGCTCAT AGACGTAGAG
AGCGTCCCGG ACGGGGGGAG CACCTTCACC GTGGTGCTCC CGCGCTCCCA GGCCCCGGGC
GCCTAA
 
Protein sequence
MNLPRLRFPI KTKLTVATLI PLGIAILICW MAGVFILSAK VAAQAQEKVR YDLSVAREAY 
QGELSRIYDV VKLTASFGST AETIASGDQR AVSEALSSVR AGERLDILAA VDASGKVIFR
ANNPELRGDD KSRNQFVARA LKGEIASGTT ILPSSELELE GDSLVRQARV YPAAGAPVQL
NGAMLLLVAA PVRDKAGNLL GALYGGVLLN NNQRLVDRIR SVVYEGARKN GRDVGNSTIF
QGDIRVATNV PNTDGSRAVG TRLSAPVQER VLLKGAKWVD RAFVVNDWYL TAYEPILSLQ
GVPIGALYVG MLESQYSAVK IDMAVLLSFV LLVSGLVGVS MAGFLGKKLS QPIRELDSLA
RRVAAGERNV KSSIDSRDEI GDLAGRFNDM SRSLVEREDS IIELNRNLEE KVQSRTAELE
EKNRLLVQTR EELVRVEKLA AIGELAAGVA HEINNPMAII RGNTELLQLS VPEDAPIREE
VDTIFQQVKR VERIVSNLLK FARREQMEAG TVRLNELLHE IVGQIGHQVS LEGIEIVEQY
AESVAQVEGD ADQLRQVFTN LVLNAVQAMP AGGVLSVRTR PLEPERSYEV KVADTGVGIA
LENLRQVFNP FYTTKANGTG LGLSVSYGIV REHGGLIDVE SVPDGGSTFT VVLPRSQAPG
A