Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU0822 |
Symbol | |
ID | 2687230 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | - |
Start bp | 882167 |
End bp | 884356 |
Gene Length | 2190 bp |
Protein Length | 729 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637125494 |
Product | sensory box histidine kinase |
Protein accession | NP_951879 |
Protein GI | 39995928 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTCA GGACAAAACT CATTATTCTG GTGTCGATTC TCGTAATCAT CCTGATGGTG GTGACCTCGA GGATCACGCT GGGCTACCTG GAGCGGCACC TCCACGACTC CATCGCCGCC CAGCAGACCG CCACCGTCGT CCACGTGGCC ACGGACATCG ACAGCACCCT CCGCTCCATG CTGGAGTTGC TGACGGCAAG CGCCCGGGTC GTCCCGCCCG CAGCACTGGC CGACCCCTCG GGCGCCAAGG CGTTTCTGGA AAGCCGGACC GGCCTGCGCA CGCTGTTCAA CAATCACCTC TTCGTCGTTG ATGCCGCCGG CAACCTCCTG GCAGAAGTGA CCAACCAGGA AGTCCGACGG GTGGAAAACT TCTCCGGTCA CGCCTTCTTC AACAAAGCCA GGGCAACCCG CAAGCCGCTC ATCTCCGAGC TGACCACCTG CTGCACCGGC ACCGCCAACT TGAGGGAAAT CGTCTTTATC AGCCCCATCC TCGGGAAAAA CGGATCATTC AAGGGAGCCT TGCTGGGGGG AATTGACCTG AGCGAGGAAA ACGCCCTGAG CCGGTTCGGC CGGATCACCG TGGGCAACAA GGGGTTCATC CGCATCATTG ACCGCAACCA TATGGTACTG ATCCATGCGG AACGGGAGCA CACCCTGATC AAGGCGGTTC CTGAAGTTGC GCGCCTGGCC GACGCCGCCC GGGAGGGCTA CGTGGGCACC CGCGAAACAA AGGGGCGATA CGGCGACATC CTTCTGACAT CGGTAGCGAA GCTCGGCAGC AAGGATTGGG TCGTGGCGGC GAGCTATCCG CTCACAGTGG CCTACGAGCC CGTGAAGGTG GTACGCCGGC TGTTCATCAT CAGTACCGTG GTCGCCATCC TCGGTGTTCT GGTAGTGGTC TCGCTCTCCA TGCAGTACCT GACCCGCCCC ATTCTGGCGC TGGAGCGGCA CATCAACGAA CTGAGCGGCA AAAAAGGCAA GGAACGGCTG GTGCCGGTAT CGAACGAGGA CGAGTTGAGC CGGCTCACCG AAACCTTCAA CACTATGCTT GCGGAGATCG ACCGGCAAAC CGAGAGCCTC AGGGAGAGCG AGGACCGTTT CCGGGGGGCT TTCGAGCAGG CGGCGGTGGG CATGGCCATC ATCGACCGCG AAGGGCTGCT GCTCAGGACA AACCGGCGCT TCTGCGACAT TACCGGTCGC CATGACGAGG ATCTGGCGGG GCTCGACTGC CTTACCCTGG TGCACCCGGA GGACCGCAAC GCCACCCGGG AGATCATGCC GACCATGGCC GCCACTGAAG GGGAGCCGTT GACGCGTGAG CTCCGGTTCA CCCATGGCAC GGGCCGGACC GTCTGGGCAA ATACCGCGTT TTCGCCGGTC CGGGGGAGGA GTGGCACCGA CGATTCCTTC ATAATGGTGG TGGAGGATGT CACCGAGCGC AAGCGTGCCG AAGAGGAAAT CCTGCGGTTG AACTCCGACC TGGAACAGCG GGTGGCGGAT CGTACCGCCG CACTGGAGAG CGCCAACCGG GAGTTGGAGG CATTCAGCTA CACTGTCTCC CACGACCTGA AGGCCCCGGC CCGCCATATC TCCGGGATCG TCGACATAGT GCTGGAAGAT TGGGGTGGCT GCATGGAGCC TGCGCACCGC GAGTTGATGG AACGTGTGGC CGCGGCAGCC GGCAGGATGC AGTCCATGAT CGACGGATTG CTGGAACTGA GCCGCGTTGG CAGCGATGAA CTGCGGCGCC AGGAGGTTCG CCCGGCCCAC CTGGCCCGGG AGATCTGCAT CGAGCTGGCC GCGGCAGAGC CGGCGCGGCA GGTGGACTGG AGCGTAAAGG ATGTGCCGCC GGCCAATGCC GATCCCGAAC TCCTCATGAC CGTGCTGGAA AATCTCCTGG GCAACGCCTG GAAGTATACC TCGCGAAACG AACGGGCTGA GGTGGAATTC GGCTGCGAAT ACTGTTCGGG AAAAAACATG TACTACGTAA AGGACAACGG CGCCGGGTTC GACATACGCG AGGCCCAGCG GCTGTTCGCC CCCTTCCAGC GCTTCCATCC GGCATCCGAG TTCGAAGGAA ACGGCATCGG GCTCGCCACG GTCGCCAGGA TCATCCATCG CCACGGCGGC ACCATCCGCG CCGAGTCGGC CCCCGGCCGG GGAGCCACCT TCTATTTCTC CCTCGACTGA
|
Protein sequence | MKLRTKLIIL VSILVIILMV VTSRITLGYL ERHLHDSIAA QQTATVVHVA TDIDSTLRSM LELLTASARV VPPAALADPS GAKAFLESRT GLRTLFNNHL FVVDAAGNLL AEVTNQEVRR VENFSGHAFF NKARATRKPL ISELTTCCTG TANLREIVFI SPILGKNGSF KGALLGGIDL SEENALSRFG RITVGNKGFI RIIDRNHMVL IHAEREHTLI KAVPEVARLA DAAREGYVGT RETKGRYGDI LLTSVAKLGS KDWVVAASYP LTVAYEPVKV VRRLFIISTV VAILGVLVVV SLSMQYLTRP ILALERHINE LSGKKGKERL VPVSNEDELS RLTETFNTML AEIDRQTESL RESEDRFRGA FEQAAVGMAI IDREGLLLRT NRRFCDITGR HDEDLAGLDC LTLVHPEDRN ATREIMPTMA ATEGEPLTRE LRFTHGTGRT VWANTAFSPV RGRSGTDDSF IMVVEDVTER KRAEEEILRL NSDLEQRVAD RTAALESANR ELEAFSYTVS HDLKAPARHI SGIVDIVLED WGGCMEPAHR ELMERVAAAA GRMQSMIDGL LELSRVGSDE LRRQEVRPAH LAREICIELA AAEPARQVDW SVKDVPPANA DPELLMTVLE NLLGNAWKYT SRNERAEVEF GCEYCSGKNM YYVKDNGAGF DIREAQRLFA PFQRFHPASE FEGNGIGLAT VARIIHRHGG TIRAESAPGR GATFYFSLD
|
| |