Gene GM21_2103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2103 
Symbol 
ID8137439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2447136 
End bp2448284 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content62% 
IMG OID644869718 
Producthistidine kinase 
Protein accessionYP_003021913 
Protein GI253700724 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones84 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGATGG AGCTCGCCGA GCGGCTTTAC AGACTGATGT TCACCAACAT GCGGGAAGGG 
CTGGCCATCC TGCGCGTCGA CCCCGATGAT GGCGAAGCAT CACCGGCGGT GATCGAGATG
AACCCCGCCG CCGTGAAACT CTGCAGTTGC AGCTCGTTCA ACTTCGGCAA CTGCAAGGTC
GTCGACTGTT TCCCCGGCTG GTTCGACGAG GACCGGCTTC GTGAAACCTG CAGACGCCTG
GCGGCCTTCG GCGGCATGGT GGAACTGGGC GAGGTCTCCT GGGGAGGGTC CAGTTATCGG
GCCCGTATCT TCGGGTTGTC GGAGGAGCAT TTAGGGCTCG TCGTAGACGA CGTTACGCGG
CAGAAAAAGG GGGAGGGGGA GATAGCGAGG CTCTCAACCC AGGTCGAGCA GCAGACAGCG
GGGCTGGAAA AGCGCGTGGC GGAGAGGACC GCGCAGTTGC AGGAGATGAA CGAGGAACTC
GACAGCTTCG CCTACTCGGT TTCCCACGAC CTGCGCGCGC CACTGCGCGC CATGCGGGCC
TTCGCCGGGA TACTGCTGGA GGAGGAGCAA AGCGAGGCCG AGCGGGTAGC GTACCTGAAA
CGGATCCAGG GGGCGGCGGA GGGAATGGAG CGCCTGATCC AGGATCTCCT CGCCTACAGC
CGCGTCGGCC GCCAGGAACT GGTGCTGCAG CGCGTCAGCC TCGATGAGGT GCTCGCCGAC
GCGGCTAAGC AACTGGATCT AACCAGCGGG GGCAAGAGCT ACCGTTTGGA GGTGCAGGAC
CATCTTCCCG ATGTGGTCGG GCACCATACG GTGCTGGTCC AGGTGGTTTT GAACATCATG
GGGAACGCCA TAAAGTTCGT TCCCAAAGGG GTGGTCCCGG CGCTGGAAGT GTGGGCCGAT
GAGATGGACG GAGAGTGCCG CCTTAACATC GCCGACAACG GCATCGGTAT TGCGCCTGAG
CACCAGGAGC GGATCTTCAA GATCTTCGAA AGGCTGCACG GCATCGAAAG CTACCCCGGC
ACAGGAATCG GGCTAGCCAT CGCACGTAAG GCGGTCACCA GGCTTGGGGG AAGGATAGGG
GTAGAGTCGT TGGAAGGGGA AGGGAGCAGG TTCTGGATCG AGCTTAAAAA AGCCGTTCGC
TCGTCCTGA
 
Protein sequence
MEMELAERLY RLMFTNMREG LAILRVDPDD GEASPAVIEM NPAAVKLCSC SSFNFGNCKV 
VDCFPGWFDE DRLRETCRRL AAFGGMVELG EVSWGGSSYR ARIFGLSEEH LGLVVDDVTR
QKKGEGEIAR LSTQVEQQTA GLEKRVAERT AQLQEMNEEL DSFAYSVSHD LRAPLRAMRA
FAGILLEEEQ SEAERVAYLK RIQGAAEGME RLIQDLLAYS RVGRQELVLQ RVSLDEVLAD
AAKQLDLTSG GKSYRLEVQD HLPDVVGHHT VLVQVVLNIM GNAIKFVPKG VVPALEVWAD
EMDGECRLNI ADNGIGIAPE HQERIFKIFE RLHGIESYPG TGIGLAIARK AVTRLGGRIG
VESLEGEGSR FWIELKKAVR SS