Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_4132 |
Symbol | |
ID | 8139506 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 4722972 |
End bp | 4724831 |
Gene Length | 1860 bp |
Protein Length | 619 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644871747 |
Product | PAS/PAC sensor signal transduction histidine kinase |
Protein accession | YP_003023905 |
Protein GI | 253702716 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 79 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAACC GGATTCCCAC AGACAAAGAG ACCCTGGAGC AGCAGTTGGC TGCCCTGCGG AGCGAGAACG AGGAACTGGC CCAACAGGTC AAACGGCTGA TCCGGGCGGA AGGAAAGCTC TACGAGTACC AGCAGGTGCT CGACCTCCAG TTGATCGAAT ACAAGGGGCT TTATGACCTG AGCCGGAGGC TGAGCGGGAG CTTCGACATC CAGACCCTGT TCCGCGATAC AGTGCAGTAC GTGGTCCAGC AGCTCGAATA CGAGCGCGCC ATCCTGCTCC GCCGCGAGGA ATCCTTTACC TATAGCGTCT TCGCCCTGGA CGGCTACTAT GATCCCAGCG AAAAGGAGCA GGCCGCCCTT ATCACCATGC GGTACGGCGC CCCCTGCCTC TCCCCCCTTC TTGCCGGCAG AGAGCACGTC ACCTGCAGCG CAACCTCGAC GGAACCCGGA AACGGATGCC GGCGGCGCCT CCTGATGGAC GAGTTCCTGG TATACCCCCT GGGCCACGAC GAGATACCCC ATGCCCTGCT GGTGGTAGGG AACACCTCGG CCAACGCCCC GTTTCACCGG CGGGTGGAGG AGAGCGACCA AGCTCTCTTG AGCATGGGCA ATCTGGTCGG CCTCGTCTCC TCTTTATTGG ACACCCACAT ATTCTTCGAG CGGATGATAG AAGCACGCGA GCAGGAACGT GTCGCCGAGG CGAAGTACCG CAGCCTTTTC GAGAACGCGG CGGAGGGCAT CTTCCGCAGG ACCCCCGAGG GAAAGTACCT GGACGCTAAC CCCGCCCTGG CGCATATGCT GGGCTATGCC TCGCCCGAGG AACTGGTCGC CTCCGTCACC GACATCGGCT CCCAGGTCTA CGTGAATCCC GCCTCATATG CCGAGATGCA GAGGGTGCTG TCGGCGCACG GCAAGGCCGA GGGATTCGAG ACGCAGGTCT ACCGCAAGGA TGGCAGCGTC ATCTGGGTAT CCCTTAGCCT GCGCGCGGTG CGCGACAGCT ATGGGAAGGT CCTCTTCTAC GAGGGGATGT CCGAGGAGAT CACCAAGCGC AAGATCGCGG AGGCAGCCCT GCGCGAGAGC GAACAGAAGT ACCGCCAGTT GAGCGAGGCG CTGGAGCGGC GCGTGAAACA GGCGGTCGAC GAGCTACGCC AGAAGGACAA GATGCTTATC ATGCAGGGGC GGCAGGCTGT GATGGGGGAA ATGCTGAGCA ATATCGCCCA CCAGTGGCGC CAACCGTTGA ACATGCTTGC CTTGCTGGTC CAGGACGTCC AACTGACCCA CAGGCAATCC GGGCTCAGCG ACGACTTCAT CGAGCGGAAC GTCAAAAGGA GCATGGAGAT CATCCAGCAG ATGTCCCGGA CCATCGACGA TTTCAGGTAT TTCTACCGCC CCGACCGGGA AAAGCTGGAG TTCGCGGTGA GCGAGCCGCT GGAAAAAGCG CTGGGATTGT TGGAGGGGAG CTTCAGGACC AACAGCATCG AGATCCAGGT CCTCAAAAGC GGCGAACCCG CCATCAGGGG GTACCTCGGA GAGTTCGTGC AGGTACTGCT AAACATCCTG ATCAACGCGC GCGACGCGCT CATCGCCAGC CACGCCGCTT CGCCGCTCAT CACCGTCAGG CTCCACGAAG AAGGAGGGGA AACGGTGGTG AGCATCGCGG ACAACGCCGG AGGCATCCCC GACGGGATCA AGGAGAAGAT CTTCGAACCC TACTTCACCA CCAAAGGGCC CGACCAGGGA ACCGGCATCG GCCTTTTCAT GTGCAAGACC ATCATCGAGA AGAGCATGAA CGGCAGGCTA ATCGCCAGAA ACAGCGGCGA GGGAGCCGAG TTCGTCATCA CCGTCCCCAA AACTCCCTGA
|
Protein sequence | MKNRIPTDKE TLEQQLAALR SENEELAQQV KRLIRAEGKL YEYQQVLDLQ LIEYKGLYDL SRRLSGSFDI QTLFRDTVQY VVQQLEYERA ILLRREESFT YSVFALDGYY DPSEKEQAAL ITMRYGAPCL SPLLAGREHV TCSATSTEPG NGCRRRLLMD EFLVYPLGHD EIPHALLVVG NTSANAPFHR RVEESDQALL SMGNLVGLVS SLLDTHIFFE RMIEAREQER VAEAKYRSLF ENAAEGIFRR TPEGKYLDAN PALAHMLGYA SPEELVASVT DIGSQVYVNP ASYAEMQRVL SAHGKAEGFE TQVYRKDGSV IWVSLSLRAV RDSYGKVLFY EGMSEEITKR KIAEAALRES EQKYRQLSEA LERRVKQAVD ELRQKDKMLI MQGRQAVMGE MLSNIAHQWR QPLNMLALLV QDVQLTHRQS GLSDDFIERN VKRSMEIIQQ MSRTIDDFRY FYRPDREKLE FAVSEPLEKA LGLLEGSFRT NSIEIQVLKS GEPAIRGYLG EFVQVLLNIL INARDALIAS HAASPLITVR LHEEGGETVV SIADNAGGIP DGIKEKIFEP YFTTKGPDQG TGIGLFMCKT IIEKSMNGRL IARNSGEGAE FVITVPKTP
|
| |