Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU2531 |
Symbol | |
ID | 2687804 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | - |
Start bp | 2791476 |
End bp | 2794637 |
Gene Length | 3162 bp |
Protein Length | 1053 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637127221 |
Product | sensory box histidine kinase |
Protein accession | NP_953577 |
Protein GI | 39997626 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.606569 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCTCC CCCGCAGTAT CCGCACCAAG GTGTGGCTCT GTGTTCTCGT GGCCTTCGTG GGTTACCTGA TCGCAACCCT TTCCAGTTAC TACTCCAATC TACTTTTTTC CCACAACCTC GACCACCTCA AGCGGGTCGA ATTTCCCCTG GCCCTCAATG GGAACGATAT CCTGAGCCTC TACCGCAAGC AGTTGAAGCT CTACGAGGAT GCCTTTGTCA CCGGCGAGCG GGATGAAGCG CTCAGGGCCA ATGAACTGGG GCGCGAGATC GTGACGAGCC TGGCCCGGGT AGCTGAGCTC ACCCGGGAGG ACACCGAGCA CCGCTATGGC GACGTCGAGG CGCTCAAGTC CCGCTACGAG GCCTATTTCC GGAAGGCGAC GGAGCTGTAT CCCCGGGTGC TCGGCAGCAC CGACACGTTC CTGAGCGGAG AGATCGCCCG TCTCGGGGCG GACGGCCGCT TGATCCTGGT GGACTTCGAG CGGATGTCGC GGGATTACGT CACCTCGGTC GAGCACCAGA TCGAGCGCAA CCGTGCGCTT GCCCGCGATA CCTCGATCTA TCTTTTCGTC CTGTTCGGGA TGGTGGTACT GCTGGCGGCG CCTGCCATCA CCTTTGTGGC CAACCGGCTC CTGATCCGCC CCCTGGAAGA GCTGCGCGGC ATGGTTACCT CCTTTGCCGG GGGCAGTCTG GACCTTTCCG GACTCCCTGA CTACGATGCC GGAGACGAGA TCGGCTCCCT CTGCGCCTCG TTCAGGTCCA TGGTGGAGGG GCTCCAGGAG ACCACGGTCT CCCGCGACTA CGTGGACAAT ATCATCGAGA GCATGAGCGA CTGCCTGATA GTGGTCGACA CCCGCGCCGC CATCCGGCGG GTGAATCGCG CCGCCCTGAC CATGCTCGGC TACGATGAGG AGCAGTTGCT CGGCATGCCG GTCGGTGTCA TCTTCGCCTG GGAACGTGAC AAGGAGCGCC TGCGGACCGA GGGGCTCAGG TGCTTTGTGG AGTCCATCGG CTCCAGTCCG GCCGAATTCA CCTTCCTGAG CAGCGACGAA CGCCGGACCC CGGTACTGCT CTCCGCGTCT CCCATGTTAG GCCGTGACGG CTCGTTCCAG GGATCGGTCT TCGTTGCGGT GGACATCGGG GAGAGGAAGC GGACCGAGCA GGCCCTGCGG GAAAACGAGC AGCACCTGAA GAACATCCTC GATTCCATCC ACGCGGGAAT CCTTGTCATC GACCCCCAGG ATCACCGGAT CGTGGATGCC AACACCTTTG CCCTGGACAT GATCGGGGCG CCCAAGGGGC TCGTGGTGGA CCGGATCTGC CACAATTTCG TCTGTAGCGC CGTTCAGGGC GCCTGTCCCA TCACCGACCT GCTCGAAAAC ATGGACAACT CGGAGCGGCG CCTGCTGCGG TTCGACGGCA CGTCCATCCC GATCCTCAAA TCGGTGGTGC CGGTCACCTA CCAGGGAAGG AAGCTGCTGA TCGAGAGCTT CATCGATATC TCCGAGCGCA AGTGGGCCGA AGAAATGCTC CGCTACCTCA CCGAGGGGAC GGCTTCGGTG ACGGGCGAGG AGTTTTTCCG TTCGCTCGTC CGGCGTCTCT CCTCGGCCCT CGGGACCCGG TTCGCCTTTG TCACCCGCCT GCTGGACGGT TCTCCCGCCC GGATGCGCAC CCTGGCCTTC TGGACCGGCA ACGGCTACTG TCCCACCATG GAGATGCCCC TGAACGGCAC CCCCTGCGAA CGGGTCATCG CTGACAGCGA CATCCTGTTC CATCCCAGCG GCTTGGGACA GCTCTATCCC TCCGCCGAAA CCATGAAGCG AATGGGTGTC GAGAGCTTCC TGGGCGTGCC GCTGTTCGAT TCCCGCGGCA CCGCCATCGG CCACCTGGCG GTTCTGGACG ATAAGCCCAT GCGCGAGGAC GAGGGGCACC GCTCCCTGTT GCGGATATTC GCGGCCCGGG CCGGGGCGGA GCTCGAGCGG ATGAGGTGGG ACGAGGCCCT GCGCGAAAGC GAGACCCGCT ACAAGGATCT CTTCGAAAAC GCCAACGACC TGATCCAGAG CGTGTCACCC TCGGGCGAGA TCCTCTACGT GAACCGGGCC TGGCGCGAAA CGCTCGGCTA CAGCGAGGAG GAGGTGAAGG GGCTCACCTT TGATCAGATC ATCGATCCGG AGTGCATTGG CCACTGCATG GGCGAGTTCC GCAGGGTCAT GGAGGGCCAG AGCCTGGAGG CGGTGGAGGC CCGGTTCATG GCGAAAGATG GCCGCTCCAT CCTGGTGGAG GGGAGTGTCA ACTGTAACGT GGTGGACGGC AAGCCGCTGG CCTCCCGGGG GATTTTCAGG GATATCACCG AGCGGAAGCA CCATGAGGAC ACGCTCCGGA AATACGCGAC AGAGCTTGAG CAGACCAACG AGGAACTGAA GGACTTCGCC TATATCGTCT CCCACGACCT GCGCGCGCCG CTGGTCAGCA TCAAGGGGTT CTCCATGGAG CTGGTTACGG CTATGGATGA GCTCAGGGGG GTCATAGCGG AGCTTTCCCC GAACATCGAG ATGATGACCC GCGAGCGGCT CCGCCGGCTC TTCGAGCAGG ATATCGACGA GGCGGTCGGC TTCATCAACT CCTCGTCCCA GCGCATGGAT ACCCTCATCA CCGCCATTCT CAACCTGTCG CGCCTCGGTC GCAGGGAACT CAAGACGGAA CCGGTCGACA TGGGCGCTGT CGTGCGTTCG ATCCTCGACT CCCTGGCCCA TCAGATCGAG ACGAACCGTA CCGAAGTGGC GATCCGCGAC CTTCCGGTCA TCACCGCGGA CCGGATTGCC ATGGAACAGA TCATGGGCAA CCTGCTGGAC AACTCCCTCA AGTACCTGGA GCCCGGGCGG CCGGGCCGTC TGGAGATCTG GGCCGATGTG GGCGGCGAGG AGTGCGTCTT CCATGTGAAG GACAACGGGC GCGGCATCAG GGAGGAAGAA ATCCCCCGGG TCTTCGAACT GTTCCGCCGC GCCGGCCGGC AGGATGTCCC CGGCGAGGGG ATGGGGCTGG CCTTCGTGAA GACCCTGGTG CGGCGGCTGG GTGGCCGCAT CTGGTGCGAG TCCGAACCGG GCGTGGGGAG CACGTTCAGC TTCACCCTGC CGTCTTCCTT TTCTCACAAA CTCCCGCTAT AG
|
Protein sequence | MKLPRSIRTK VWLCVLVAFV GYLIATLSSY YSNLLFSHNL DHLKRVEFPL ALNGNDILSL YRKQLKLYED AFVTGERDEA LRANELGREI VTSLARVAEL TREDTEHRYG DVEALKSRYE AYFRKATELY PRVLGSTDTF LSGEIARLGA DGRLILVDFE RMSRDYVTSV EHQIERNRAL ARDTSIYLFV LFGMVVLLAA PAITFVANRL LIRPLEELRG MVTSFAGGSL DLSGLPDYDA GDEIGSLCAS FRSMVEGLQE TTVSRDYVDN IIESMSDCLI VVDTRAAIRR VNRAALTMLG YDEEQLLGMP VGVIFAWERD KERLRTEGLR CFVESIGSSP AEFTFLSSDE RRTPVLLSAS PMLGRDGSFQ GSVFVAVDIG ERKRTEQALR ENEQHLKNIL DSIHAGILVI DPQDHRIVDA NTFALDMIGA PKGLVVDRIC HNFVCSAVQG ACPITDLLEN MDNSERRLLR FDGTSIPILK SVVPVTYQGR KLLIESFIDI SERKWAEEML RYLTEGTASV TGEEFFRSLV RRLSSALGTR FAFVTRLLDG SPARMRTLAF WTGNGYCPTM EMPLNGTPCE RVIADSDILF HPSGLGQLYP SAETMKRMGV ESFLGVPLFD SRGTAIGHLA VLDDKPMRED EGHRSLLRIF AARAGAELER MRWDEALRES ETRYKDLFEN ANDLIQSVSP SGEILYVNRA WRETLGYSEE EVKGLTFDQI IDPECIGHCM GEFRRVMEGQ SLEAVEARFM AKDGRSILVE GSVNCNVVDG KPLASRGIFR DITERKHHED TLRKYATELE QTNEELKDFA YIVSHDLRAP LVSIKGFSME LVTAMDELRG VIAELSPNIE MMTRERLRRL FEQDIDEAVG FINSSSQRMD TLITAILNLS RLGRRELKTE PVDMGAVVRS ILDSLAHQIE TNRTEVAIRD LPVITADRIA MEQIMGNLLD NSLKYLEPGR PGRLEIWADV GGEECVFHVK DNGRGIREEE IPRVFELFRR AGRQDVPGEG MGLAFVKTLV RRLGGRIWCE SEPGVGSTFS FTLPSSFSHK LPL
|
| |