Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2098 |
Symbol | |
ID | 8137434 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 2440037 |
End bp | 2441680 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644869713 |
Product | multi-sensor signal transduction histidine kinase |
Protein accession | YP_003021908 |
Protein GI | 253700719 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.000413163 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCGGATG AGCCCACCCG GGGTAGCGAG GAGATCCCCC TGCCGCGCGA GGTCGGCGGC ATAGCCCTTT TGTACGTGGA GGACGAGGCG GACGCGAGGG TCATGGTGGC GAAGATGCTC AGCATGAACT ACCCCCACCT CACCATCTTT CACGCCGACA ACGGCGCCAC AGGGCTCGAG CTTTACCGGC AGCGGCGCCC CGACATCGTG ATGACCGACA TCAACATGCC TGTCATGGAC GGGATCAGGA TGTCCCGGGA AATCAAGTCC CTCAACCCTG AAGTCCACAT CATCGCAGTA ACCGCCCACA GCGACACTTC CTACCTTTTA AACGCCATCG AAATCGGCGT GCACAACTAC GTGCTGAAGC CGCTCAACTA CCAGGAACTA TTCGCCGTGC TCGACAAGGT GCTGGAGCAG GTGTCGCTGA AGCGCCTGGT CGGGGAGCAG AACCAGCGCA TCGTCGAGAG CGAGCAGCAG TTGGCGGGCT CCCAGCGCAT CGCCCACCTG GGAAGTTGGG AGTGGCACGT GGAGTGCGGC AGCATGAAAT GGTCGGACGA ACTCTACCGC ATCTGCGGCC TCGATCCCGG CTCCCTCACC CCTGACTACC AGCGTTTCCT GGAGCTTTTC GCGGTCGAAG ACCGCCCGGT CATCGAGGGG CTCATGCATG ACGCCCTGAA GGGAGAGGCG GAGGAGGATC ACCGTTTCTG CCGCGTGCTC CGCCCAGACA ACTCCAGAAG GATCGTCCGG GTGGAAGCGG ACCTCTTGAC GGAGGAGCGC GGCCTAAAGG CGACGGTGAT CGGCACCTGC CACGACGTGA CCGAGTTGAA GGAGGCGGAG GAACAGGTTC GGCTTTTGAC CGAGGACCTG GAGCGGCGCG TGATCCAGCG CACCTCGCTT TTGCAGGCGA CGGTGAGCGA GTTGGAGAGT TTCAGCTACT TCGTCTCGCA CGACCTGCGC GCGCCGGTGG CGCGGCTGGA AGGGTACTGC AAGGCGCTTT TGGAGGACTG CGGGGACGAG GGGGAAAACA ACTGCCGGCA GTACGCCGAG CGCGCCGAGC ACGTGACGCA GCAGATTAAG CAGATCATCG GCGCCTTCAA CAGCCTGACC CACTACGCGC GCTGCGCCCT GGCCATAGGC GAGACCGACC TGAGCGCGAT GGTACGGGAG GTGGCCCAAG AGCTGCAACA GGGCGAGCCG CAGCGCCGGG TCGAGGTTCT GGTGGCGCCC GGTATCAGGG TGCGGGGGGA CGCGGAACTG CTGCGCACCG CCGTGAGCGA GCTTTTGAAA AACGCCTGGA AGTTCACCTC CAAAACGGAG TACGCCCGCA TCGAGTTCGG CAGGCGCGAG CAGGAAGGGG TCTCGGTCTG CTACGTGAAG GACAACGGGG CCGGTTTCAA CATGAAGTAC GCCGACAAGC TCTTCAAACC TTTTCAGACC ATCCACACCC CAGGCGAGTT CGATTGGAAC GGCACCGGGA TCGGTCTCGC CACGGTTCAC AGCATCATCC TGCGTCACGG CGGCCGGGTC TGGGCTGAGG GAGAGGTCGG CTACGGAGCG ACCTTCAGCT TCACCCTGGA ACCCAATCCG GAAACGGGTA GCTACCTGAC GGAGGGGCTT TCCTCCCGGG AACAAAGGGG CTAG
|
Protein sequence | MPDEPTRGSE EIPLPREVGG IALLYVEDEA DARVMVAKML SMNYPHLTIF HADNGATGLE LYRQRRPDIV MTDINMPVMD GIRMSREIKS LNPEVHIIAV TAHSDTSYLL NAIEIGVHNY VLKPLNYQEL FAVLDKVLEQ VSLKRLVGEQ NQRIVESEQQ LAGSQRIAHL GSWEWHVECG SMKWSDELYR ICGLDPGSLT PDYQRFLELF AVEDRPVIEG LMHDALKGEA EEDHRFCRVL RPDNSRRIVR VEADLLTEER GLKATVIGTC HDVTELKEAE EQVRLLTEDL ERRVIQRTSL LQATVSELES FSYFVSHDLR APVARLEGYC KALLEDCGDE GENNCRQYAE RAEHVTQQIK QIIGAFNSLT HYARCALAIG ETDLSAMVRE VAQELQQGEP QRRVEVLVAP GIRVRGDAEL LRTAVSELLK NAWKFTSKTE YARIEFGRRE QEGVSVCYVK DNGAGFNMKY ADKLFKPFQT IHTPGEFDWN GTGIGLATVH SIILRHGGRV WAEGEVGYGA TFSFTLEPNP ETGSYLTEGL SSREQRG
|
| |