Gene GM21_2571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2571 
Symbol 
ID8137913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3002030 
End bp3003250 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content62% 
IMG OID644870179 
Productresponse regulator receiver sensor signal transduction histidine kinase 
Protein accessionYP_003022369 
Protein GI253701180 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value0.156881 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTGC TAGTTGTCGA CGATTCCCGC AACGATCGCA AGATCATCCG CTACAACTTC 
GAATGGCACG GCTGCGAGGT CCTGGAGGCC TCCAACGGGA AGCAGGGGCT GGAGCTGGCA
GCGGCGGAGA AACCCGACCT GATCGTTTCC GACTGCCTGA TGCCGGTCAT GGACGGCTTC
CAATTCCTTC ACGAAATCAA GAAGTTTCAA GACACAAAAA CCATCCCCTT CATCTTCTAT
TCAGCCGTCT ATACCGGGAG CCGCGAGGCG GAATTGGCGG CTTCCCTGGG AGCCCGCGCA
TTCCTGGCGA AACCCATGCG CCCGGAGGAG CTTTGGGATG AGGTGGGGCG GCTGATGGCG
GCGGAGCCGG CCGGCGAGGC GGTGGAGCGA AAGCCGTGGC CCGAAGAGGA ATTCCTCAAG
AACTATAGCC AGCTGGTAGC GGGCAAGCTG GAGGAAAAGG TCCGCGAGCT TACCGAAACG
AACGAAAGCC TCCTCAGGCT GAACAGCGAA TTGGAGCGCA GGGTGGTGGA GCGGACCTCG
CAACTGGAGG CAGCAAACCG CGAGCTCGAC ATGTTCAGCT ATTCCATCTC CCACGACCTG
CGCGCCCCTT TGCGGCACCT GGAGGGGTTC AGCCAGGCGC TGATCGACGA ATACGCGACC
AAGCTGAACC ACACGGGGAG GGAGTACCTG GAGCGGCTCA GGAAGTCCGC CCGACGGCTG
ACGGACATGA TAGACGCGCT CTTGGAACTG TCGCGGCACA CGAGGGGGAA GCTGGTCAAG
GAGAGCGTGG ATTTAACCTC CATCGCCAAG GAGGTCGCGG CTCAACTGGC GCGGTCCCAG
CCGGAGCGTA AGGTATCGAT GGAGGTGGCG GAGGGGATGA TGGTGCGCGG GGACTCGCGG
TTGTTGAAGG TGGTGCTGGA GCAATTGATC GGCAACGCCT GGAAGTTCTC GCAACCGCGA
GGGGAGGAGG CGCTGGTAGA GGTCTTTCCC ACCGAGCTTG AAGGGCGACC CGCCTGCGCG
GTCAGGGACA ACGGGGTCGG CTTCGAGATG GAGTACGCGG ACAAGCTCTT CTCCCCGTTC
CAGAGGCTGC ACGCGCAGGA CGAGTTCCCC GGCCGCGGGA TCGGGCTCGC CATCGCCAAG
AGGATAATCA CCCGCCATGG AGGGAAGATG GAGGCGCAGG CCGAACTGGG GAAGGGGGCG
ACCTTCACCT TCAGCGTCTA G
 
Protein sequence
MKVLVVDDSR NDRKIIRYNF EWHGCEVLEA SNGKQGLELA AAEKPDLIVS DCLMPVMDGF 
QFLHEIKKFQ DTKTIPFIFY SAVYTGSREA ELAASLGARA FLAKPMRPEE LWDEVGRLMA
AEPAGEAVER KPWPEEEFLK NYSQLVAGKL EEKVRELTET NESLLRLNSE LERRVVERTS
QLEAANRELD MFSYSISHDL RAPLRHLEGF SQALIDEYAT KLNHTGREYL ERLRKSARRL
TDMIDALLEL SRHTRGKLVK ESVDLTSIAK EVAAQLARSQ PERKVSMEVA EGMMVRGDSR
LLKVVLEQLI GNAWKFSQPR GEEALVEVFP TELEGRPACA VRDNGVGFEM EYADKLFSPF
QRLHAQDEFP GRGIGLAIAK RIITRHGGKM EAQAELGKGA TFTFSV