Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU3419 |
Symbol | |
ID | 2688165 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | - |
Start bp | 3762534 |
End bp | 3764858 |
Gene Length | 2325 bp |
Protein Length | 774 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637128114 |
Product | sensor histidine kinase |
Protein accession | NP_954459 |
Protein GI | 39998508 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTCGAT TGGTGGGATT GCGCTGTTTC GTTTGTGTAT TGTCGCGCCT GTGGCCGGTC ATTATGCTCG TTGGGGTGTA TTCCGTTCCG GCAAGCGCCG ACTACGAGGC GCGGGCGGCC AACCACATCC TGGTGCTCCA TTCCTATCAC CCCGGGTATC CGTGGACCGA CCAGGTCATG GCGGGAATCC AGGATGTGCT GTCGAATTCC TTCGAACGGA TTCACATGGA CGTGGAATAT CTCGATGTGA AGCGCCACCG GCAGTCCGCC CAGGTTTCCC GCATGTTCGA CGCGGTGTTG CACCACAAGC TGCGGCAACG GTCATTCAAT CTGGTGATCG CATCCGGTAA CGAGGCGCTC GACTACGCTC TCGCCAATCG TCGTGACCTG TTCCGCGGTG CACCTATCGT ATCCTGCGCA ACCGTCGGCC CCGATCCGCT CCCCACCGCC TCGGCCTGGC ATACGGGCGT CAGGGCCGAT CCCGATTTTA CGGGAGTCAT CAGGCAGGCT CTGGCGCTTC ATCCCGGCAC GACCAGAATG ATCGTCATCG GCGGCACCCG TGAGCTGACG GATCGCCTTA ATACCCAGCG ACTCATGGCG GCAAGCCAGG CATTCGCGGG GCGGGTAACG TTCGATTACT GGAATGACCT GTCGGCCGAG GAGATCATCG CACGACTCCA GTCCCCTGTC CGCGGGGCCG TAATCCTCAT CAACGGGTCG ATTCGCGACC GTTCGGGGAA TCTTCTCTCG TTCCACGAGC AGACAGGCCT GCTGCGGCAG GGGGGGCTTC CCCTCTACAG CTTCTGGAAC GTTTTTCTCG GCGAGGGGAT CGTGGGGGGG CCGTTCGTGG ATGTGCACGA GCAGGGGCGA ATTGCCGCGC GGACGGCACT GCGAGTGCTT CATGGGGAGC CGGTGGCCGA TATTCCGGTG GAGCAATCGG TCATCACCGT ACCGACCTTT GACTACGAGG AACTGCAGCG ACTCGGGATA TCTCCCCGGC ACCTGCCGCC AAAGCACATT CTGGTCAACG GACCCAAGCC GTTCTACAAC CTGAGCAAGT CCCAGTTTCT CTGGTCTGTC GCGATTCTGC TCGGCTCCCT GAGCATCACC CTGCTGCTCA CCTGGAGCAT CCTGCTCCGC CGCCGGGCCG AGCTCAGACT GCGCCAGAGC GAGCAGAACT ACCGGCAGCT TTCCCGGCAG TTCGGGATCA TTCTGGACGG CATCCCCGAC AGTCTCACCC TCATTTCCCA CGACATGAAG GTGGTCTGGT CCAACAAGGT GGCCGGTGAT CCCTTTGGGG CGCCGTTGCG GACCGTGCCC GGCGAATACT GTTGCGAGAT GCTCTACAAC CGCACCACTC TCTGTGATAA CTGTCCGGCG GTGAGGGCTT TCGAGTCGGG CGAGAACGAG GAATCCACCA TTACCACGCC CGACGGCAGG ACCCTTGAGG TGAAGGCATT CCCGGTCAGG GAGGGGTCGG AGACGGTGAG CCACGTCATC ATGCTGGCCA GCGACATCTC CGACAAAGTC CGTCTTCTGG AGGAAACGGT CAGGACGAGC CGGCTCGCAT CGCTGGGTGA GCTGGCAGCC GGCGTGGCTC ACGAGATCAA CAACCCCAAT GCGGTGATCC TGCTTAACGT GGACCTGGTC AAAAAGGTGT GCCGGGAAGC AATCCCCCTG CTGCTGGATC GCGTTGACCG GCAGAGCGAG TTGGCGATCG GGGGGATCCC CTGGTCGGAA ATGAGCGAGG AGCTGCCACT CCTGCTGACG GAGATGGAAG AGGGCGCCGG CCGCATCAAG CGGATCGTGG ATGACCTCAA GGACTTCGCC CGCGGCGACG GCGCCGACCA GTGCGAGCCT GTGGACCTGA ACGAGGCGGT CCGGGCCTCG GTCCGCTTGG TGGGCAATGC CATTAAAAAT GCCACCGACC ATTTCGCGCT GGAGCTGGCC CCTGGCCTCC CTTCCTTCGA GGGAAGCATC CAACGGATCG AGCAGGTAGT GGTCAACCTG ATCATGAACG CCTGCCAGTC CCTCCAGGAC AAAACGAAAG GGATCACCGT CAGCACCGGC TACGACCTTA TCCATGGGGT CTACACGATT CAGGTGCGCG ACGAGGGGCA GGGGATTCCC CCGGAGGTCC TTCCCCGCAT CACCGATCCG TTCTTCACCA CAAAGCGCGA AACCGGCGGC ACGGGTCTCG GTCTGTCGAT CTGCATGCGC ATCGTCAGAA GTTATGGCGG CACACTCGAA TTCCAGTCCG TCCCGGGAGC CGGGACAACG GCTACCCTGT CCCTGCCGGC CGAAAAGGAG GTTATTGCCG CATGA
|
Protein sequence | MFRLVGLRCF VCVLSRLWPV IMLVGVYSVP ASADYEARAA NHILVLHSYH PGYPWTDQVM AGIQDVLSNS FERIHMDVEY LDVKRHRQSA QVSRMFDAVL HHKLRQRSFN LVIASGNEAL DYALANRRDL FRGAPIVSCA TVGPDPLPTA SAWHTGVRAD PDFTGVIRQA LALHPGTTRM IVIGGTRELT DRLNTQRLMA ASQAFAGRVT FDYWNDLSAE EIIARLQSPV RGAVILINGS IRDRSGNLLS FHEQTGLLRQ GGLPLYSFWN VFLGEGIVGG PFVDVHEQGR IAARTALRVL HGEPVADIPV EQSVITVPTF DYEELQRLGI SPRHLPPKHI LVNGPKPFYN LSKSQFLWSV AILLGSLSIT LLLTWSILLR RRAELRLRQS EQNYRQLSRQ FGIILDGIPD SLTLISHDMK VVWSNKVAGD PFGAPLRTVP GEYCCEMLYN RTTLCDNCPA VRAFESGENE ESTITTPDGR TLEVKAFPVR EGSETVSHVI MLASDISDKV RLLEETVRTS RLASLGELAA GVAHEINNPN AVILLNVDLV KKVCREAIPL LLDRVDRQSE LAIGGIPWSE MSEELPLLLT EMEEGAGRIK RIVDDLKDFA RGDGADQCEP VDLNEAVRAS VRLVGNAIKN ATDHFALELA PGLPSFEGSI QRIEQVVVNL IMNACQSLQD KTKGITVSTG YDLIHGVYTI QVRDEGQGIP PEVLPRITDP FFTTKRETGG TGLGLSICMR IVRSYGGTLE FQSVPGAGTT ATLSLPAEKE VIAA
|
| |