Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU2401 |
Symbol | |
ID | 2686139 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | + |
Start bp | 2634831 |
End bp | 2637878 |
Gene Length | 3048 bp |
Protein Length | 1015 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637127091 |
Product | sensory box histidine kinase/response regulator |
Protein accession | NP_953447 |
Protein GI | 39997496 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCTCGG CCCCGTATGT TCGAGGAGGT ACCGTGGACT ATACCCTGAA GGATCTTCTG GACATTCCCG GACTGCAGGC GCTGCTCGAT TCTCTGCGGG CGCTTCATCA GCTTGCCTCC TCCGTAATCG ACCCTGAAGG TACCATTCTC GCCGCTTCGG GCTGGCAGCG GCTCTGCACG GAATATCACC GGGCACACCC ACCCTCCCGG CAAAAGTGCA TCGAAAACGA ACTCCGCGTT GGCTCCCAGG CCGGCGAGTC ACTCATGCCT GTCATCCACC GCTGCCCCAT GGGACTTGAG TATGCGGTCA CGCCAATCGT TATCGAGGGA GAGCACCTGG CGAATATCTT TGTCGGCCAG ATTTTCACCT CCTCTCCGGA CGAGTCGTAT TTTGTCCGCC AGGCGCGGCA GTTTGGATTC GACGAGGACG CCTACCTGGC GGAAATGCGG AATGTGCCGA TCTTTGACGA GGAAACGCTT CACTCGTATC TCGCCCTTTT CCGCAGCATT GCCAACATGC TTGCGGAGCA GGGCCTGCAC GTACTGCGAC AGCGCGTGGC CAACGAGGAG CTCCGGAAGA GCGAAAAACG CCACCAGACC ATTCTCCAGG CCGCCCTCGG CGGGCTTTGC ACCATCGACA TGCAGGGCCG GCTGCTCGAA GTCAACGAGG CCTATTGCCA GAAGATCGGC TACAGCCGGC AAGAACTGCT GACCATGAGC ATCTTCGACC TGGAGGCAGC CGAAACGCCC ACCATGACAT CCGCGCACAT GCGGAAGATC ATGGAAGACG GGGAAGGACA TTTCGAATCG CGCCATCGGT GCAAGGACGG AAGCGTTATC GACGTGGAAG TCAACGTCAG GTACTGCCCC GAAGATGGCG GCCAGTTTAT CGCCTTCCTG CGCAACATTA CGGAGCGCAA ACGTGCTGAA GCGCTCCATA GGATGGGGCA GGATATCCTG CTCGCGCTCA ACGAGAATGA AGATATGAAG GAGGCGATCC AGCGGGTTCT GGGTCTGCTG AGATCGGCTA CCGGGGTCGA TGCCGTTGGG ATTCGCCTGC AGGACGGAGA CGATTTCCCC TATTTCTACC AGGAAGGGTT CCCTCCGGAC TTTCTGCAGA AAGAGGACTC CATCGTAGCA AGGAATAAGG ATGGCGGGGT GTGCCGGGAC TCATGCGGCA ACGTCTGCCT GGAATGCACC TGCGGCATGG TGATTTCCGG CAGTACGGAC CACGCCAACC CGCTCTTCAC CCCGGGGGGG AGCGCCTGGA CCAATGATTC GTTTCCCTAC CTCGAAGTGC CGGCCGACCA GGACCCCAGG ACAACTCCGC GCGACGAATG CATCCACCAG GGGTACGCAT CGATCGCCCT GATCCCCATC CGGGCAAAGG GACGAATCGT CGGGCTGCTG CAGCTGAACG ACCGCCGCAA GGGGCGCTTT ACTCGCGAGG GGATCGAAGC CCTCGAAGAC ATCGCCAAGA ACATCGGCGA GGCAATGCTG CGCAAGCAGG CCGAGGAGAA ACTTGTTGCC AGCGAGCGCT TCCTCCGCAT GCTCACCAAT CAGCTGCCGG GGATGGTCGG CTACTGGGAC CGGGATCTCC GCTGTCGCTT CGCGAACGAT GCCTACAAGG AATGGTTCGG CAGGTCGCCG GAGCAGATCA TCGGCCTCAC CGTCCAGGAA TTGATGGGGG AAGAGCTGTT CCGCCTGAGC GAGCCGTACA TCCGGGGCGC CTTGCAGGGG GAACCCCAGA ATTTCGAGCG GGAGCTGGTG AAGCCGAACG GCGAAACGGG CTACACCTGG GCCCAGTACA TTCCCGACAT GGTGAACGAC AAAGCGATCG GCTTCTTCGT GCTTATCTCG GATGTGACCG AACTCAAACG AGCCGAGCAG GAAAAGGCTG CTCTCGAAGT CCAGCTCATG CAGGCCCAGA AAATGGAATC CGTGGGACGT CTGGCCGGGG GTGTGGCACA CGACTTCAAC AACATGCTGA GCGTCATCCT GGGGCATACT GAAATGGCAC TCCTGCGCAT GGACCCCAAC CAACCGACCT ATGCCTCGCT CCGCGAAATC GACAAAGCAG CCCAACGTTC TGCCGACCTT ACCCGGCAAC TCCTGGGCTT TGCGCGGCAG CAAACCGTTT CCCCCAAAGT GCTGAACCTG AATGAAAGCA TTTCGGGGCT GCTCACTATG CTGCACAGGC TGATCGGCGA GAATATCCGC CTTCAGTGGC AGCCGGCGGC GAACCTGTGG CAGGTTCGCA TGGATTCGTC GCAGATCGAC CAGATCATGG CAAACCTGTG CGTTAACGCC CATGATGCCA TTGCCGACGT GGGCGAGATC ATCATTGAAA CAGGGAACTG CACCTTCGAC GAGAGGGACG GTACGATTCA TCCCCATGGG GCAGTCGGTG ATTACGTGAG GATCCGCGTG AGCGACAATG GTTCGGGGAT GGACAAGGAG ACGCGTGCCC ACATCTTCGA ACCGTTCTTC ACCACCAAGG AAGTGGGAAA GGGAACCGGC CTGGGATTGG CGACCATCTA CGGCATCGTC AAGCAAAACA ACGGACTGAT CGATGTTGCC AGCGAGCCGG ACAAGGGAAC GACCTTCTCC ATCTATCTCC CCCGATTCAA AGGAAGCGAG GAGCCGGCGA CGCCGAGAAC GGCAACGTCC TCCTTCAACC GGGGCAATGA AACGGTCCTG CTGGTGGAGG ATGAACCGGC TATCCTGGCG ATGACAACGG AGATTCTGAC CCAACTGGGC TACGCGGTCC TGCAGGCGCC CACCCCCACC GAGGCGTTGC GCATCGGCTG CGAGCACAAG GGGGAGATCC ATTTATTGAT GACGGACGTC ATCATGCCCG AAATGAACGG ACGGGAGCTG GCCAAGAGCC TCTCACAGTT CTATCCCCGG ATCAGGTGCC TGTACATGTC GGGGTACACC TCGGACATCA TTTCCCCCCA CGGCGTGCTC AACGACAGCG TCCACTTCCT CCAGAAACCG TTCGACTTTA CTACCCTTTC GCTAAAGCTG CGCGAGGTGC TCGACTGA
|
Protein sequence | MGSAPYVRGG TVDYTLKDLL DIPGLQALLD SLRALHQLAS SVIDPEGTIL AASGWQRLCT EYHRAHPPSR QKCIENELRV GSQAGESLMP VIHRCPMGLE YAVTPIVIEG EHLANIFVGQ IFTSSPDESY FVRQARQFGF DEDAYLAEMR NVPIFDEETL HSYLALFRSI ANMLAEQGLH VLRQRVANEE LRKSEKRHQT ILQAALGGLC TIDMQGRLLE VNEAYCQKIG YSRQELLTMS IFDLEAAETP TMTSAHMRKI MEDGEGHFES RHRCKDGSVI DVEVNVRYCP EDGGQFIAFL RNITERKRAE ALHRMGQDIL LALNENEDMK EAIQRVLGLL RSATGVDAVG IRLQDGDDFP YFYQEGFPPD FLQKEDSIVA RNKDGGVCRD SCGNVCLECT CGMVISGSTD HANPLFTPGG SAWTNDSFPY LEVPADQDPR TTPRDECIHQ GYASIALIPI RAKGRIVGLL QLNDRRKGRF TREGIEALED IAKNIGEAML RKQAEEKLVA SERFLRMLTN QLPGMVGYWD RDLRCRFAND AYKEWFGRSP EQIIGLTVQE LMGEELFRLS EPYIRGALQG EPQNFERELV KPNGETGYTW AQYIPDMVND KAIGFFVLIS DVTELKRAEQ EKAALEVQLM QAQKMESVGR LAGGVAHDFN NMLSVILGHT EMALLRMDPN QPTYASLREI DKAAQRSADL TRQLLGFARQ QTVSPKVLNL NESISGLLTM LHRLIGENIR LQWQPAANLW QVRMDSSQID QIMANLCVNA HDAIADVGEI IIETGNCTFD ERDGTIHPHG AVGDYVRIRV SDNGSGMDKE TRAHIFEPFF TTKEVGKGTG LGLATIYGIV KQNNGLIDVA SEPDKGTTFS IYLPRFKGSE EPATPRTATS SFNRGNETVL LVEDEPAILA MTTEILTQLG YAVLQAPTPT EALRIGCEHK GEIHLLMTDV IMPEMNGREL AKSLSQFYPR IRCLYMSGYT SDIISPHGVL NDSVHFLQKP FDFTTLSLKL REVLD
|
| |