Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_2023 |
Symbol | |
ID | 5166094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | - |
Start bp | 2359242 |
End bp | 2362307 |
Gene Length | 3066 bp |
Protein Length | 1021 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640549517 |
Product | multi-sensor hybrid histidine kinase |
Protein accession | YP_001230786 |
Protein GI | 148264080 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0168563 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTCCCC TGCCCGAACT TACCATCGCT CAGTTCAAGT CTCGCATCAG ATTTTTTGTC GTACTCCTCA TCATCGCCGT CATCGGTATT CTGGCTTGGC ATATCGATTC CGAACGCAAC GCCATCATCA CAGCCGCCGA ACTGCAATCA CAAGGCTACT CCCGTGCACT GAGCGAACAT GCCGGCAGTG CCTTTGCCGA GGCTGACCGT GCTTTGAGGG AAGTGGTCAA CGATATTGCC GAACGGGGAG GCATCGACCA CATTGAACGA CGCACGCTGT TCAACATTAT CAGGCGGCAG GGTGGCGACA CGCCGCAGAT AGGCTCCATC TTCCTGACCG ACCGCTCCGG CATCATGGTA GTCAATTCCC TTGAATTTCC ATCAAAACAG ATCAATGTCT CTGATCGTGA GTATTTTATT TTTGACCGGG ATAAACCCGA CACAGGCCTC TTCATCAGCC GTCCGTTAAT CAGCAGACTC GTAAACCGCT GGCGTTTTAC CATGACGCGG CCGCTTACCT ACCCTGACGG CAGGTTCGCC GGTCTTGCGG CAGTTGCCTT CGAGATTAAC TACTTCCACC GTTTCTACAC CTCCATCAGC CTCGGCCCCC GTGGCAAGGT GCAGCTGATT CGAACCGATG GCTCGCCGTT GTTGAGCGAG CCTTTCACAG AGAACGCCTT TGCGGTCGAT TTCAAACGAT CGGCCCTTTT CCGGGATCAC CTTGGCACGA CACAATCCGG CACCTTCCAT GATAATAAAA ACATAGAGGA CCAGTCGCCG CGCATCGTCT CATATCATCG TCTCTCACGC TTTCCGGTCG TGGCAGTTGT CTCGTTGCAC CGGGACGACA TCCTGACGCC ATGGAAACGT AAAGTTACAT ATGAAACAGC CACGACCCTG GGTCTTTGCC TTGCTCTTTT TTTGCTGATG CAACTTCTCT TGCGCCATCT CGACCAGTTG CAGGCGACGC AGAACTCGCT GCGGGAACAG CAGGAACAGG TTCGGATCAA GGCTGCACAG ATCGATTCGG CCAATGACGC CATACTGCTG ATGGATACCG ATGGACGGCT GGTACATTTC AACAACGCCC TCTGCCAGAT GATCGGATAT GACCAGAACG AGCTTCAGGG GGCACTGCTG CACGACTTCG AACCACCGGA GTTTGCAGCC CGCCTCGTAC CCGCAATCAG TTCAATCATG GAGCATGGCG AAGCGATTTT CGAATCCGCA TATCTCACCA AAAGAGGCGC GATCTTGCCC GTTGAGGTCC ATGCCCGGAC AATGGAGAGC GAGGGAACAA AACGTATCCT CAGCATCGTC AGGGACATCA GCGAGCGTAA ACATAGCGAA CTCCGGGAAC AGACGAGACT GCGGATACTA GAAGAGATGG CCACCGGCGT AGACCTTTCT GAACTGCTCA CGAACATAGT CCGTTTTGTG GAGCAAGAAC GAAAGGGGTT GCTCTGCTCC ATATCGATCG CGGATGAAAC CGGCCAGTTC CTGCGCCATG GCGCCGCTCC GAGCCTCCCC CAGTTCTACA ACAAGGCGGT CGACGGAGTA CGGATTGCCG AGGGGATGGG GTGTTGCGGA ACGGCGGCGC ATCGCCGCCA ACGGGTAGTT GTGGAGGATG TGGATGGAGA TCCTTTATGG AAGGGTTTCA GGCCCGTCAG CGAGGCGGGC CTGCGAGCCT GCTGGTCGGA GCCGGTGAAA TCGTCGCAGG GCGAGCTGCT GGGAACTTTT GCCTGTTACC ATCGCGAGCG GCACGCCCCC GACGAAACCG AGATCCAGTT GATCGAGTCT GCCGCGCACC TCGCCAGTAT CGCCATTGAG CGTTTCAAGT CGGAAGAACT GAAGAGCCAA CTTGAGGCAC AACTGCATCA CGTGCAGAAA ATCGAGGCGA TCGGCCAACT GGCGGGCGGA ATTGCTCACG ATTTCAACAA CCTCCTGACT CCCATCATCG GCTATGCGGA TATGATCCAG CACAAGCTGC CAGAGGGAGA CCCGTTGTCA GGCAAGGTGA ACGGCATCAC GGCTGCCGCC TTCAAGGCGC GCGACCTGAC TCAGCAACTG CTCAGCTTTG GCCGCAAGCA GATGCTCGAC ATGAAGTCCG TGGATTTGAA CCAGGTACTC GGCAACTTTC AGGACATTCT GCGGCGAACC ATCCGGGAAA ATATCACCAT CGATATCCGC CTGGCATCGG GAGGAATTGC GATCTGGGCA GACCGTGGAC AGATCGAACA GATTCTTCTG AACCTGGTCG TCAATGCCCA GGACGCCATA TCGGGCAACG GTATGATCCT GATGGAAACG GGCCACGTAA TGCTGGACAA TGAGTACACC AGGCTGCATC CCGGCGTGCA GCCCGGCCCC TATGCGCTAC TGGACTTCAG CGACAACGGC TGCGGCATGA ATGATGAAAC TCTAAGCCAT ATCTTCGAGC CGTTTTTCAC CACCAAGCAG GTTGGACACG GCACAGGACT CGGCCTGGCA ACCGTCTACG GCATCATCAA GCAGCATGAA GGATACATTT CCGTCAAGAG CCGTGTCGGA GAAGGAACAA CCTTCAGCAT TTACCTGCCG CTCAGCAGGG AACCAATCAC GACAACGTCT GCCGACTCCG TTGTTACGGC GACTACGGAC GGGGCAACAA CGGGAGAACG AACCATCCTT GTCGTCGAGG ACAATGAGAT GGTCCGCACC ATGACTGTCG AACTGCTGGA ATCATCCGGT TACCGAGTAT TGGTGGCGGA CCGTCCATCT GCGGCTGTAG AATTGATGAA CCGGTATGGC AACAGTGTTG CGCTGCTGGT TTCAGATGTC ATCATGCCGG AAATGAGCGG CCAGGAACTG CACGAAAACC TGCTTGAAAC TTTTCCCGAC CTTAAAGTCC TTTACATCTC CGGCTACACG AACGAACTAT TCGTGCATAG GGGAATGCTT GAGGAAGGGG TCAACTTCTT GCAGAAACCG TTCACCACTG AAAAGCTGCT CAAAGAAGTT CAACGCATCG CAAGTGAGGC AGAAGAACCG GCAAATCAAC TTTCTTTCAA GCTGAATCCT TGCTGA
|
Protein sequence | MLPLPELTIA QFKSRIRFFV VLLIIAVIGI LAWHIDSERN AIITAAELQS QGYSRALSEH AGSAFAEADR ALREVVNDIA ERGGIDHIER RTLFNIIRRQ GGDTPQIGSI FLTDRSGIMV VNSLEFPSKQ INVSDREYFI FDRDKPDTGL FISRPLISRL VNRWRFTMTR PLTYPDGRFA GLAAVAFEIN YFHRFYTSIS LGPRGKVQLI RTDGSPLLSE PFTENAFAVD FKRSALFRDH LGTTQSGTFH DNKNIEDQSP RIVSYHRLSR FPVVAVVSLH RDDILTPWKR KVTYETATTL GLCLALFLLM QLLLRHLDQL QATQNSLREQ QEQVRIKAAQ IDSANDAILL MDTDGRLVHF NNALCQMIGY DQNELQGALL HDFEPPEFAA RLVPAISSIM EHGEAIFESA YLTKRGAILP VEVHARTMES EGTKRILSIV RDISERKHSE LREQTRLRIL EEMATGVDLS ELLTNIVRFV EQERKGLLCS ISIADETGQF LRHGAAPSLP QFYNKAVDGV RIAEGMGCCG TAAHRRQRVV VEDVDGDPLW KGFRPVSEAG LRACWSEPVK SSQGELLGTF ACYHRERHAP DETEIQLIES AAHLASIAIE RFKSEELKSQ LEAQLHHVQK IEAIGQLAGG IAHDFNNLLT PIIGYADMIQ HKLPEGDPLS GKVNGITAAA FKARDLTQQL LSFGRKQMLD MKSVDLNQVL GNFQDILRRT IRENITIDIR LASGGIAIWA DRGQIEQILL NLVVNAQDAI SGNGMILMET GHVMLDNEYT RLHPGVQPGP YALLDFSDNG CGMNDETLSH IFEPFFTTKQ VGHGTGLGLA TVYGIIKQHE GYISVKSRVG EGTTFSIYLP LSREPITTTS ADSVVTATTD GATTGERTIL VVEDNEMVRT MTVELLESSG YRVLVADRPS AAVELMNRYG NSVALLVSDV IMPEMSGQEL HENLLETFPD LKVLYISGYT NELFVHRGML EEGVNFLQKP FTTEKLLKEV QRIASEAEEP ANQLSFKLNP C
|
| |