Gene GM21_2098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2098 
Symbol 
ID8137434 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2440037 
End bp2441680 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content63% 
IMG OID644869713 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_003021908 
Protein GI253700719 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.000413163 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCGGATG AGCCCACCCG GGGTAGCGAG GAGATCCCCC TGCCGCGCGA GGTCGGCGGC 
ATAGCCCTTT TGTACGTGGA GGACGAGGCG GACGCGAGGG TCATGGTGGC GAAGATGCTC
AGCATGAACT ACCCCCACCT CACCATCTTT CACGCCGACA ACGGCGCCAC AGGGCTCGAG
CTTTACCGGC AGCGGCGCCC CGACATCGTG ATGACCGACA TCAACATGCC TGTCATGGAC
GGGATCAGGA TGTCCCGGGA AATCAAGTCC CTCAACCCTG AAGTCCACAT CATCGCAGTA
ACCGCCCACA GCGACACTTC CTACCTTTTA AACGCCATCG AAATCGGCGT GCACAACTAC
GTGCTGAAGC CGCTCAACTA CCAGGAACTA TTCGCCGTGC TCGACAAGGT GCTGGAGCAG
GTGTCGCTGA AGCGCCTGGT CGGGGAGCAG AACCAGCGCA TCGTCGAGAG CGAGCAGCAG
TTGGCGGGCT CCCAGCGCAT CGCCCACCTG GGAAGTTGGG AGTGGCACGT GGAGTGCGGC
AGCATGAAAT GGTCGGACGA ACTCTACCGC ATCTGCGGCC TCGATCCCGG CTCCCTCACC
CCTGACTACC AGCGTTTCCT GGAGCTTTTC GCGGTCGAAG ACCGCCCGGT CATCGAGGGG
CTCATGCATG ACGCCCTGAA GGGAGAGGCG GAGGAGGATC ACCGTTTCTG CCGCGTGCTC
CGCCCAGACA ACTCCAGAAG GATCGTCCGG GTGGAAGCGG ACCTCTTGAC GGAGGAGCGC
GGCCTAAAGG CGACGGTGAT CGGCACCTGC CACGACGTGA CCGAGTTGAA GGAGGCGGAG
GAACAGGTTC GGCTTTTGAC CGAGGACCTG GAGCGGCGCG TGATCCAGCG CACCTCGCTT
TTGCAGGCGA CGGTGAGCGA GTTGGAGAGT TTCAGCTACT TCGTCTCGCA CGACCTGCGC
GCGCCGGTGG CGCGGCTGGA AGGGTACTGC AAGGCGCTTT TGGAGGACTG CGGGGACGAG
GGGGAAAACA ACTGCCGGCA GTACGCCGAG CGCGCCGAGC ACGTGACGCA GCAGATTAAG
CAGATCATCG GCGCCTTCAA CAGCCTGACC CACTACGCGC GCTGCGCCCT GGCCATAGGC
GAGACCGACC TGAGCGCGAT GGTACGGGAG GTGGCCCAAG AGCTGCAACA GGGCGAGCCG
CAGCGCCGGG TCGAGGTTCT GGTGGCGCCC GGTATCAGGG TGCGGGGGGA CGCGGAACTG
CTGCGCACCG CCGTGAGCGA GCTTTTGAAA AACGCCTGGA AGTTCACCTC CAAAACGGAG
TACGCCCGCA TCGAGTTCGG CAGGCGCGAG CAGGAAGGGG TCTCGGTCTG CTACGTGAAG
GACAACGGGG CCGGTTTCAA CATGAAGTAC GCCGACAAGC TCTTCAAACC TTTTCAGACC
ATCCACACCC CAGGCGAGTT CGATTGGAAC GGCACCGGGA TCGGTCTCGC CACGGTTCAC
AGCATCATCC TGCGTCACGG CGGCCGGGTC TGGGCTGAGG GAGAGGTCGG CTACGGAGCG
ACCTTCAGCT TCACCCTGGA ACCCAATCCG GAAACGGGTA GCTACCTGAC GGAGGGGCTT
TCCTCCCGGG AACAAAGGGG CTAG
 
Protein sequence
MPDEPTRGSE EIPLPREVGG IALLYVEDEA DARVMVAKML SMNYPHLTIF HADNGATGLE 
LYRQRRPDIV MTDINMPVMD GIRMSREIKS LNPEVHIIAV TAHSDTSYLL NAIEIGVHNY
VLKPLNYQEL FAVLDKVLEQ VSLKRLVGEQ NQRIVESEQQ LAGSQRIAHL GSWEWHVECG
SMKWSDELYR ICGLDPGSLT PDYQRFLELF AVEDRPVIEG LMHDALKGEA EEDHRFCRVL
RPDNSRRIVR VEADLLTEER GLKATVIGTC HDVTELKEAE EQVRLLTEDL ERRVIQRTSL
LQATVSELES FSYFVSHDLR APVARLEGYC KALLEDCGDE GENNCRQYAE RAEHVTQQIK
QIIGAFNSLT HYARCALAIG ETDLSAMVRE VAQELQQGEP QRRVEVLVAP GIRVRGDAEL
LRTAVSELLK NAWKFTSKTE YARIEFGRRE QEGVSVCYVK DNGAGFNMKY ADKLFKPFQT
IHTPGEFDWN GTGIGLATVH SIILRHGGRV WAEGEVGYGA TFSFTLEPNP ETGSYLTEGL
SSREQRG