Gene GM21_0952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0952 
Symbol 
ID8136273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1129548 
End bp1130996 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content64% 
IMG OID644868567 
Producthistidine kinase 
Protein accessionYP_003020776 
Protein GI253699587 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value0.483334 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATAA TGATCAAGTA CAGGCTGTTG ATCGCCATGC TCGCCGCGAC CGGCGCCGTG 
GTGATCTCGA TGTTTCTCAT CATGCAGTGG AGCATCGGGC GCGGCTTCCT GGCCTACGTG
AACACCATGG AAAAAGATCG GCTCGAACGG CTCGCCCAGG TGATGGAGCG GGCGTACCGC
ACCAACGGCA GCTGGGACTT CATAAAGGGA GACGAAGCTA CCTTGTTCAA ACTGGTTGCA
GCCAGCAGGG AGACGAACGA CCTGGACCGG AGCGACCGCG GCGAGCGGAG AATGCGCCCC
CCCCCGCCGT TTCCCAACCG CTCCGGGCAC TCCTTCCCGG CGCGCTTCCA GTACCGTTTC
GAGGGAAGGG TGCTGCTGCT CGATGCACAA AAGGTGAAAC TGGTCGGCAT GCCGAGCGCC
CAGGAGGCGG CGCAACTGCA CGAAATCACG AGCTCCGACC GCATCGTCGG CTACGTGGGG
CTGCGCCCCA GGCAGCGCCT CACCGACACC TATCAACTCC TTTTCGTCAA GCAGCAGAAA
CTCACCATGG GGCTCGTGGC GGTGATCATG TTCCTCGTTT CCGCTGCGAT CTCGCTCCCT
TTGGCCCACC GCCTGGTGCT CCCCATCAAG AGGCTCGCAG CCTCCATGCA CCGCTTAGCC
TCCGGCGAGT ACGGCACCAG GGTGGCGGTC GGCCCCGAGG ACGAACTTGG GCAACTGGCC
CGCGACTTCA ACACGCTCGC GCTGACGCTG GAGAACAACG AACGGGCGCG AAGGCGCTGG
GTGGCGGACG TCTCGCACGA GCTGCGCACG CCGCTTGCCA TCCTGCGCGG GGAGATCGAG
GCGATCCAGG ACGGCGTGCG CCAGGCAGGC CCGGAGTCGA TGCGCTCCCT GCACGGCGAG
GTCATGCACC TAAGCCGCCT GGTCGACGAT CTCTACCAGC TCTCCCTTTA CGACATCGGG
GCGCTGACCT ACCGCAAAGA GAGTGTCGAT CTTAAGGAGG TGCTGGAGGA TGCGCTGACT
TCGGTCGGGC AGGAATTAAT CCAGAAGGGA ATCAACCTCT CCATCGAGCT GCCGCGAGAC
GACGGCTGCT CCGTCTTCGC CGACCCTGAC CGGTTGAGCC AGCTTTTCTC GAACCTTTTG
GACAACTCGC TCAAGTACAC AGATGCCGGG GGGAAACTGG CGGTCAGGCT CCAGCGCGGG
CCGAACACGG CGCAGGTCGA GTTCGCCGAC AGCGCGCCGG GCGTGGCACC GGACCAGTTG
CAGCGCCTCT TCGACAGGCT GTACCGGGTG GAAAGCTCGC GTAACCGCGC CAAAGGGGGC
GCGGGGCTGG GCCTTGCCAT CTGCAAGAAC ATCGTGGAGG CCCACGAAGG AACCATAGCC
GCCCTCCCCT CCCCGCACGG CGGGGTGCTG ATCAGGGTGG AACTGCCGCT TATCGGGAGC
AGGACATGA
 
Protein sequence
MKIMIKYRLL IAMLAATGAV VISMFLIMQW SIGRGFLAYV NTMEKDRLER LAQVMERAYR 
TNGSWDFIKG DEATLFKLVA ASRETNDLDR SDRGERRMRP PPPFPNRSGH SFPARFQYRF
EGRVLLLDAQ KVKLVGMPSA QEAAQLHEIT SSDRIVGYVG LRPRQRLTDT YQLLFVKQQK
LTMGLVAVIM FLVSAAISLP LAHRLVLPIK RLAASMHRLA SGEYGTRVAV GPEDELGQLA
RDFNTLALTL ENNERARRRW VADVSHELRT PLAILRGEIE AIQDGVRQAG PESMRSLHGE
VMHLSRLVDD LYQLSLYDIG ALTYRKESVD LKEVLEDALT SVGQELIQKG INLSIELPRD
DGCSVFADPD RLSQLFSNLL DNSLKYTDAG GKLAVRLQRG PNTAQVEFAD SAPGVAPDQL
QRLFDRLYRV ESSRNRAKGG AGLGLAICKN IVEAHEGTIA ALPSPHGGVL IRVELPLIGS
RT