Gene Gura_2023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_2023 
Symbol 
ID5166094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp2359242 
End bp2362307 
Gene Length3066 bp 
Protein Length1021 aa 
Translation table11 
GC content55% 
IMG OID640549517 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_001230786 
Protein GI148264080 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0168563 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTCCCC TGCCCGAACT TACCATCGCT CAGTTCAAGT CTCGCATCAG ATTTTTTGTC 
GTACTCCTCA TCATCGCCGT CATCGGTATT CTGGCTTGGC ATATCGATTC CGAACGCAAC
GCCATCATCA CAGCCGCCGA ACTGCAATCA CAAGGCTACT CCCGTGCACT GAGCGAACAT
GCCGGCAGTG CCTTTGCCGA GGCTGACCGT GCTTTGAGGG AAGTGGTCAA CGATATTGCC
GAACGGGGAG GCATCGACCA CATTGAACGA CGCACGCTGT TCAACATTAT CAGGCGGCAG
GGTGGCGACA CGCCGCAGAT AGGCTCCATC TTCCTGACCG ACCGCTCCGG CATCATGGTA
GTCAATTCCC TTGAATTTCC ATCAAAACAG ATCAATGTCT CTGATCGTGA GTATTTTATT
TTTGACCGGG ATAAACCCGA CACAGGCCTC TTCATCAGCC GTCCGTTAAT CAGCAGACTC
GTAAACCGCT GGCGTTTTAC CATGACGCGG CCGCTTACCT ACCCTGACGG CAGGTTCGCC
GGTCTTGCGG CAGTTGCCTT CGAGATTAAC TACTTCCACC GTTTCTACAC CTCCATCAGC
CTCGGCCCCC GTGGCAAGGT GCAGCTGATT CGAACCGATG GCTCGCCGTT GTTGAGCGAG
CCTTTCACAG AGAACGCCTT TGCGGTCGAT TTCAAACGAT CGGCCCTTTT CCGGGATCAC
CTTGGCACGA CACAATCCGG CACCTTCCAT GATAATAAAA ACATAGAGGA CCAGTCGCCG
CGCATCGTCT CATATCATCG TCTCTCACGC TTTCCGGTCG TGGCAGTTGT CTCGTTGCAC
CGGGACGACA TCCTGACGCC ATGGAAACGT AAAGTTACAT ATGAAACAGC CACGACCCTG
GGTCTTTGCC TTGCTCTTTT TTTGCTGATG CAACTTCTCT TGCGCCATCT CGACCAGTTG
CAGGCGACGC AGAACTCGCT GCGGGAACAG CAGGAACAGG TTCGGATCAA GGCTGCACAG
ATCGATTCGG CCAATGACGC CATACTGCTG ATGGATACCG ATGGACGGCT GGTACATTTC
AACAACGCCC TCTGCCAGAT GATCGGATAT GACCAGAACG AGCTTCAGGG GGCACTGCTG
CACGACTTCG AACCACCGGA GTTTGCAGCC CGCCTCGTAC CCGCAATCAG TTCAATCATG
GAGCATGGCG AAGCGATTTT CGAATCCGCA TATCTCACCA AAAGAGGCGC GATCTTGCCC
GTTGAGGTCC ATGCCCGGAC AATGGAGAGC GAGGGAACAA AACGTATCCT CAGCATCGTC
AGGGACATCA GCGAGCGTAA ACATAGCGAA CTCCGGGAAC AGACGAGACT GCGGATACTA
GAAGAGATGG CCACCGGCGT AGACCTTTCT GAACTGCTCA CGAACATAGT CCGTTTTGTG
GAGCAAGAAC GAAAGGGGTT GCTCTGCTCC ATATCGATCG CGGATGAAAC CGGCCAGTTC
CTGCGCCATG GCGCCGCTCC GAGCCTCCCC CAGTTCTACA ACAAGGCGGT CGACGGAGTA
CGGATTGCCG AGGGGATGGG GTGTTGCGGA ACGGCGGCGC ATCGCCGCCA ACGGGTAGTT
GTGGAGGATG TGGATGGAGA TCCTTTATGG AAGGGTTTCA GGCCCGTCAG CGAGGCGGGC
CTGCGAGCCT GCTGGTCGGA GCCGGTGAAA TCGTCGCAGG GCGAGCTGCT GGGAACTTTT
GCCTGTTACC ATCGCGAGCG GCACGCCCCC GACGAAACCG AGATCCAGTT GATCGAGTCT
GCCGCGCACC TCGCCAGTAT CGCCATTGAG CGTTTCAAGT CGGAAGAACT GAAGAGCCAA
CTTGAGGCAC AACTGCATCA CGTGCAGAAA ATCGAGGCGA TCGGCCAACT GGCGGGCGGA
ATTGCTCACG ATTTCAACAA CCTCCTGACT CCCATCATCG GCTATGCGGA TATGATCCAG
CACAAGCTGC CAGAGGGAGA CCCGTTGTCA GGCAAGGTGA ACGGCATCAC GGCTGCCGCC
TTCAAGGCGC GCGACCTGAC TCAGCAACTG CTCAGCTTTG GCCGCAAGCA GATGCTCGAC
ATGAAGTCCG TGGATTTGAA CCAGGTACTC GGCAACTTTC AGGACATTCT GCGGCGAACC
ATCCGGGAAA ATATCACCAT CGATATCCGC CTGGCATCGG GAGGAATTGC GATCTGGGCA
GACCGTGGAC AGATCGAACA GATTCTTCTG AACCTGGTCG TCAATGCCCA GGACGCCATA
TCGGGCAACG GTATGATCCT GATGGAAACG GGCCACGTAA TGCTGGACAA TGAGTACACC
AGGCTGCATC CCGGCGTGCA GCCCGGCCCC TATGCGCTAC TGGACTTCAG CGACAACGGC
TGCGGCATGA ATGATGAAAC TCTAAGCCAT ATCTTCGAGC CGTTTTTCAC CACCAAGCAG
GTTGGACACG GCACAGGACT CGGCCTGGCA ACCGTCTACG GCATCATCAA GCAGCATGAA
GGATACATTT CCGTCAAGAG CCGTGTCGGA GAAGGAACAA CCTTCAGCAT TTACCTGCCG
CTCAGCAGGG AACCAATCAC GACAACGTCT GCCGACTCCG TTGTTACGGC GACTACGGAC
GGGGCAACAA CGGGAGAACG AACCATCCTT GTCGTCGAGG ACAATGAGAT GGTCCGCACC
ATGACTGTCG AACTGCTGGA ATCATCCGGT TACCGAGTAT TGGTGGCGGA CCGTCCATCT
GCGGCTGTAG AATTGATGAA CCGGTATGGC AACAGTGTTG CGCTGCTGGT TTCAGATGTC
ATCATGCCGG AAATGAGCGG CCAGGAACTG CACGAAAACC TGCTTGAAAC TTTTCCCGAC
CTTAAAGTCC TTTACATCTC CGGCTACACG AACGAACTAT TCGTGCATAG GGGAATGCTT
GAGGAAGGGG TCAACTTCTT GCAGAAACCG TTCACCACTG AAAAGCTGCT CAAAGAAGTT
CAACGCATCG CAAGTGAGGC AGAAGAACCG GCAAATCAAC TTTCTTTCAA GCTGAATCCT
TGCTGA
 
Protein sequence
MLPLPELTIA QFKSRIRFFV VLLIIAVIGI LAWHIDSERN AIITAAELQS QGYSRALSEH 
AGSAFAEADR ALREVVNDIA ERGGIDHIER RTLFNIIRRQ GGDTPQIGSI FLTDRSGIMV
VNSLEFPSKQ INVSDREYFI FDRDKPDTGL FISRPLISRL VNRWRFTMTR PLTYPDGRFA
GLAAVAFEIN YFHRFYTSIS LGPRGKVQLI RTDGSPLLSE PFTENAFAVD FKRSALFRDH
LGTTQSGTFH DNKNIEDQSP RIVSYHRLSR FPVVAVVSLH RDDILTPWKR KVTYETATTL
GLCLALFLLM QLLLRHLDQL QATQNSLREQ QEQVRIKAAQ IDSANDAILL MDTDGRLVHF
NNALCQMIGY DQNELQGALL HDFEPPEFAA RLVPAISSIM EHGEAIFESA YLTKRGAILP
VEVHARTMES EGTKRILSIV RDISERKHSE LREQTRLRIL EEMATGVDLS ELLTNIVRFV
EQERKGLLCS ISIADETGQF LRHGAAPSLP QFYNKAVDGV RIAEGMGCCG TAAHRRQRVV
VEDVDGDPLW KGFRPVSEAG LRACWSEPVK SSQGELLGTF ACYHRERHAP DETEIQLIES
AAHLASIAIE RFKSEELKSQ LEAQLHHVQK IEAIGQLAGG IAHDFNNLLT PIIGYADMIQ
HKLPEGDPLS GKVNGITAAA FKARDLTQQL LSFGRKQMLD MKSVDLNQVL GNFQDILRRT
IRENITIDIR LASGGIAIWA DRGQIEQILL NLVVNAQDAI SGNGMILMET GHVMLDNEYT
RLHPGVQPGP YALLDFSDNG CGMNDETLSH IFEPFFTTKQ VGHGTGLGLA TVYGIIKQHE
GYISVKSRVG EGTTFSIYLP LSREPITTTS ADSVVTATTD GATTGERTIL VVEDNEMVRT
MTVELLESSG YRVLVADRPS AAVELMNRYG NSVALLVSDV IMPEMSGQEL HENLLETFPD
LKVLYISGYT NELFVHRGML EEGVNFLQKP FTTEKLLKEV QRIASEAEEP ANQLSFKLNP
C