Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU2314 |
Symbol | |
ID | 2687347 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | - |
Start bp | 2531957 |
End bp | 2534905 |
Gene Length | 2949 bp |
Protein Length | 982 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637127007 |
Product | sensory box histidine kinase/response regulator |
Protein accession | NP_953363 |
Protein GI | 39997412 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase [COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.581555 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACGAGG CGGCGAAGGT GGACGCCAGG GCCCAGTTCG AGAAGGATAT CGTGTACCGG CGCTGGAATG CCGGTCACGG CGGAGTATAT GTGCCCGTCA CGGAGCGTGC CCAGCCCAAC CCGTGGCTTG CCGGCCTGAA GGACCGGGAT GTGGAAACCA CCACGGGGAA ACGGCTCACC CTCATCAATC CCGCCTACAT GACACGACAG GTCCACGAGC TCAATTTCGA AAGCTCGGGC ATCCGGGGCC ACATCACGAG CCTCAGACCC ATTCGGCAGG CCAATGCGCC TGACAACTGG GAAACCCGGG CATTGGAATC CTTCGAGCGG GGACAGCAGG AATACTCCTC GGTTGAACTC ATGGACGGTG TCCCCCATCT TCGCCTGATG AGGCCGCTGA TGGTGGAGCA GTCCTGCCTC AGGTGCCATC CCGGCCCGAG CTATTCAGTG GGCGACGTTC GCGGCGGCAT CAGTGTTTCA GTTCCCCTTG CGCCCTATTC GGCCCTTGCC CGGAACAAGA TGGCCGCCAC AGGCGCAGGT CATCTGACGT TCTGGGCGGC AGGGCTCGGG GGCATCATCC TGGCGGCGCG CCGGATCACC CGCGACGACC GCTCTCTTCT CTTGCAGCAG GCACGGCTTG CCGAAAGCGA AGAGCGTAAC CGGCTCCTGT CGGAGGTAGC TCTGGAAGGG ATCGTTATCC ACGACAGGGG CATTGTTCAG GATATGAACG CCAGGTTTGC AAAACTGTTC GGCTACCCGC GGGAGGAGTT GGAGGGAATC AACGTGATAT CCCTGCTGTT CCATCCTGAT GATATCGGTT CCATGATCGA TAAGATGAGC AGGGCACATT CTGAGCCCTA CCAGGTGCGT GGGGTGAAAA AGGACGGAAC GGTCTTTGAT GTTGAGATCG AGGGCTACAA TCTGCAGCAC GGGGACAAGT CCGTCAGGGT CGTGTCGGTG CGGGACATCA CCGAGCGTAG GCGGGCTGAG GAGGCCTTGC GCCAAAGTGA GGCACTCTTC AGGAATCTGT TCGAGCACCA TGCGGCCGTC AAGCTGATCA TTGATCCCGT CAGTGGCGCC ATCGTCGACG CCAATAACGC GGCGGTGAGC TTTTACGGCT GGTCCCGTGA GCAGCTCAGG GCCATGAATA TCCGGGACAT CAACATGCTT TCGCCCGAGG AGCTGGTGAG CGAGCTGGAA AATACCAAAA ATATGGAGCG GATCCACTTC TTTTTCCGTC ACCGCCGGGC GGACGACTCC GTCCGAGATG TTGAGGTGTT CAGCAGCAGG ATCGAGGTGA AGGGAAGGGA GTATCTGCAC TCCATCGTTC ACGACGTTAC CGAGCGCAAA CGTGCGGAGG AAGAGCTCCT TCTCGCCAAG GAACAGGCCG AATCCGCCAA CAGGGCAAAG TCGGAATTCC TCGCCAACAT GAGCCATGAG ATCCGCACTC CCATGAACGG CGTGATCGGC ATGACCGGCC TGCTGCTGGA AACGGGGCTC GCCGACGAAC AGCGGAGATA TGCCGAAATA GTCAGGACCA GCGCCGCATC GTTGTTGCAG GTCATTGACG ATATTCTCGA CTTCTCCAAG ATCGAGGCCG GCAGGCTGGA GATGGAAACC ATCGGTTTCG ACCTGCGCAC CTTGCTTGAT GACCTGGCGG AATCTCTGGC GTTCAAGGCG AATGAAAAGG GGTTGACGTT CACCTGCCTG CTCCGGCCCG AGGTGCCCCG GTTCCTGATG GGCGATCCGG TCCGATTGAA ACAGGTCCTG GTCAACCTCG CGGGGAATGC GCTCAAATTC ACTCACCAGG GCGAAATCTC CGTGGAGGTC GGCTCCCTGA CTAAAACGGG CGACTCGGTC AAGCTCCGTT TTGCCGTGCG TGATACCGGC ATCGGCATTC CGCCCGAAAA AACGGAGCTT CTGTTTGAGA AGTTCACCCA GGCGGATGCT TCGATTACGC GTAAATACGG CGGCACTGGT CTCGGCTTGG CCATCTGCAA ACGGCTGGTG CAGATGATGG GCGGAGAGAT CGGGGTCACC AGCCGGCCCG GCGTGGGCTC CGAATTCTGG TTTACTGCTT CCTTCGGCAC ACAGAACCTC CGGGACTCCT CTCCGGAGCC GGACGGCGGT GACGGTCCGC TGCAGCTGCT GAGCGATCTG GGGCACGACG ATGTCCGTAT CCTGCTGGTC GAAGACAACG CCACCAACCG ACAGGTGGCT CTTGGGATCA TAAAACGGCT CGGCCTGCGG GGCGATGCCG CGGCCAACGG CGCAGAGGCA TTGGACCTGC TGGCGACGAC TCCCTACGAT CTGGTTCTGA TGGATGTTCA GATGCCTGTC ATGGACGGGT TCGAGGCAAC CAGGCACATC CGCGATGTCC GGTCGCCGGT CCTCAACCAC GAGATTCCCG TCATTGCCCT CACTGCCCAT GCGATGAAGG GAGACCGGCG CAAGTGCCTG GACGCGGGCA TGAATGACTA TCTCGCCAAA CCGATCTTCC CTGACGCTCT GGCCGAGATG TTGGTCAAGT GGCTCCCTCG CCGCTGCCGG ATTTCAAACG ACGGTGACGG GGAGCGCGCG AACGCGGTGC CGGCTGTGCC GGATGGAACG CCCGTCTTTG ACCGTCCGGG CCTGGAGGCA CGGCTCATGG GAGACGAATC TCTGGTGACG GAGGTCGTTG CGGCATTCCT GGCTGATATG CCTGACCTGA TCGAACGGCT CAGGGCGGCA GTGGCTGACG GTGAGCCGGC AACCGTTGCC CACTGGGCCC ATACCATCAA GGGGGGGGCC GCCAGTGTCG GGGGAGAGCG GCTGCGGGCG GCCGCGGCAG CCTTGGAGTA TGCGGCGGTT GCGGGAGGCA TGGGCGGTGT CGCCTGCTGC ATGAACATGC TGGAAATCGA ATTCGGCCGA TTATGTGAAA TCATGAGACA AGACTACGTC AAGGAGTGA
|
Protein sequence | MHEAAKVDAR AQFEKDIVYR RWNAGHGGVY VPVTERAQPN PWLAGLKDRD VETTTGKRLT LINPAYMTRQ VHELNFESSG IRGHITSLRP IRQANAPDNW ETRALESFER GQQEYSSVEL MDGVPHLRLM RPLMVEQSCL RCHPGPSYSV GDVRGGISVS VPLAPYSALA RNKMAATGAG HLTFWAAGLG GIILAARRIT RDDRSLLLQQ ARLAESEERN RLLSEVALEG IVIHDRGIVQ DMNARFAKLF GYPREELEGI NVISLLFHPD DIGSMIDKMS RAHSEPYQVR GVKKDGTVFD VEIEGYNLQH GDKSVRVVSV RDITERRRAE EALRQSEALF RNLFEHHAAV KLIIDPVSGA IVDANNAAVS FYGWSREQLR AMNIRDINML SPEELVSELE NTKNMERIHF FFRHRRADDS VRDVEVFSSR IEVKGREYLH SIVHDVTERK RAEEELLLAK EQAESANRAK SEFLANMSHE IRTPMNGVIG MTGLLLETGL ADEQRRYAEI VRTSAASLLQ VIDDILDFSK IEAGRLEMET IGFDLRTLLD DLAESLAFKA NEKGLTFTCL LRPEVPRFLM GDPVRLKQVL VNLAGNALKF THQGEISVEV GSLTKTGDSV KLRFAVRDTG IGIPPEKTEL LFEKFTQADA SITRKYGGTG LGLAICKRLV QMMGGEIGVT SRPGVGSEFW FTASFGTQNL RDSSPEPDGG DGPLQLLSDL GHDDVRILLV EDNATNRQVA LGIIKRLGLR GDAAANGAEA LDLLATTPYD LVLMDVQMPV MDGFEATRHI RDVRSPVLNH EIPVIALTAH AMKGDRRKCL DAGMNDYLAK PIFPDALAEM LVKWLPRRCR ISNDGDGERA NAVPAVPDGT PVFDRPGLEA RLMGDESLVT EVVAAFLADM PDLIERLRAA VADGEPATVA HWAHTIKGGA ASVGGERLRA AAAALEYAAV AGGMGGVACC MNMLEIEFGR LCEIMRQDYV KE
|
| |