Gene GM21_2195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2195 
Symbol 
ID8137531 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2563905 
End bp2565413 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content63% 
IMG OID644869810 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_003022005 
Protein GI253700816 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value0.067402 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGGAT TAGCCATCGA AGCCCCCGGC TTCACCCTTT TGTACGTGGA GGATGAGCAG 
GTCACCAGGG AGACGGTGCT CTCCCTTTTG CAGCGGCGCT TTCCCAAGCT CGCCCTTTTA
AGCGCGACAA ACGGCGCCGA AGGCATGGCG CTTTTCCGGG AGCACGCGCC CGACCTCGTG
GTGACCGACA TCAAGATGCC GGCCTTGAGC GGCATCGACA TGGCCCGGGA GATGATGGAG
CTGAAACCGT CCCTTCCGGT GATCGTCACC AGCGCTCACT CCGACATGGA GTACCTGATC
GAGTCGATCG AGCTGGGGAT CAGCCGCTAC GTCCTGAAGC CGATAGAAAG CTCGAAGCTC
TTCGCCGCGG TGGAAAGCGC TCTGACGGCG CTCGGCAGGG AAAGGGAGCT GCAGGCCCAC
CAGGCGTTCG TGCGCAGGCT CTCGCGGGCT GTGGAGCAGA GCGCCAGCAG CATCGTCATC
GCCAACCCGG AAGGGGTGAT CGACTACGTG AACCCGAGGT TCACCACGCT GACCGGCTAC
GAGGCCTGCG AGGTGCTGGG AAAGAACCTG AAGGCGCTGA AGGGAAAAGC CGAACTCTGG
GACCGGATCG CAGCCGGTGA GGAGTGGCAC GGCGAGTTCG AGGGGGTGAA GAAAAACGGC
GAGATTTTCT ACGAGTCCAC CTCCCTCTCC CCCGTCTACG ACGAGACCGG CGCTCTCAGC
GATCTGGTCG CGGTGCAGGA GGAGATCACC GAGAGGGTGC TGTCCGCGCG CAGGATCGAG
GCGTTGAACC GAAGCCTCGC GGCGCGCGCC GAGGAACTGG AACTCGCCAA CCGGGACCTG
GAAGGGTTCA GCTACACGGT CTCGCACGAC CTGCGCACGC CGCTGACCAA CATCAACGGC
TACTGCCAGG TGATACTGGA GCTTTACGGC GCGACGCTGG ACGAGCAGTG CAAGGATTTC
ATCAACATCA TCTTCGACGA AACCGTCAAC ATGAACCGGT TGATCAAGAC GCTGCTCGAA
TTCTCCCGGG TGAGCCGCAG CGAGATGACC CGGTCGCAGA TTGACTTAAG CCAGCTTACC
TCCCTGGTCT GCGCCTCCCA GCAACTGAGC GAACCGCAGC GGCGGGTGAC CTTCAGCATA
GCTCCCGGGA TGACGGCCCT GGGGGACCCC GACCTTTTGA AGGTCGTGCT GCAAAACCTG
ATCTCAAACG CCTGCAAATA CTCGGCGAAC CGCGAAGACG CCGTGGTCGA AATAGCCTCC
CTGGACGAGA GCGGGGAACT CGTTTACTTC GTGCGCGACA ACGGCGCCGG CTTCGACATG
ACGCTGGCCG ACAAACTGTT CAGCCCCTTC CAGCGGCTTC ACTCCGAGCG GGATTTCAAG
GGGTTCGGCA TAGGGCTTGC CACCGTGCAG CGCATCATCC AGCGCCACGG CGGCAGGATC
TGGGCCGAGG GAGAGGTGGG GCTCGGGGCC TGCTTCTACT TCACCCTCCC CTCCCCGACC
GAGGCATAG
 
Protein sequence
MGGLAIEAPG FTLLYVEDEQ VTRETVLSLL QRRFPKLALL SATNGAEGMA LFREHAPDLV 
VTDIKMPALS GIDMAREMME LKPSLPVIVT SAHSDMEYLI ESIELGISRY VLKPIESSKL
FAAVESALTA LGRERELQAH QAFVRRLSRA VEQSASSIVI ANPEGVIDYV NPRFTTLTGY
EACEVLGKNL KALKGKAELW DRIAAGEEWH GEFEGVKKNG EIFYESTSLS PVYDETGALS
DLVAVQEEIT ERVLSARRIE ALNRSLAARA EELELANRDL EGFSYTVSHD LRTPLTNING
YCQVILELYG ATLDEQCKDF INIIFDETVN MNRLIKTLLE FSRVSRSEMT RSQIDLSQLT
SLVCASQQLS EPQRRVTFSI APGMTALGDP DLLKVVLQNL ISNACKYSAN REDAVVEIAS
LDESGELVYF VRDNGAGFDM TLADKLFSPF QRLHSERDFK GFGIGLATVQ RIIQRHGGRI
WAEGEVGLGA CFYFTLPSPT EA