Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3146 |
Symbol | |
ID | 8138497 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 3649660 |
End bp | 3651225 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644870750 |
Product | PAS/PAC sensor signal transduction histidine kinase |
Protein accession | YP_003022931 |
Protein GI | 253701742 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 137 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGATA AAGATCTCAA TTACAACTCA GGCACGCGTA AGCAGCAGGA GCCCGGGCGA CTGCAGGGGG ACCTACCCTT CCGGCTGATG GTGGAGCAGG TGAAGGAGTA CGCAGTCGCG CTTTTGGATC CGCAGGGAAA CATCTCCTCG TGGAACGCCG GGGCGGAGCA GATCACCAGG TATCGGGGCA GCGAGGTGAT CGGGAAGCAT TTTTCTCTGC TCCATCCCAA GGAGGAACTC CGTTCGGGGG CGCCGGAAAG GGAGCTCGCC ATAGCACGCT CGCTGGGGAG CCTTGACCTG CAGGGGTGGC GGCTGAAAAA AGACGGCAGC CGCTTCTGGG CTGGAATCAC GCTCACCGCT ATCTATGATC AAGAGGAGAA TCTCGCCGGA TTCGCGCTCT TCGCACACGA CGACAGCGAG AAACGCGCCT CGGACGAGGC GCTGCTTAAA AGCCGCAACA TGCTGGAGCG GCTCTTCGAG ACGGCTCCGG ACGGCATCGT GGTGGTCGAC GGCAACGGCG TCATCCGCAG GACGAACCAG CAGGCGGAGA TCACCTTCGG CTACATGCGG GAGGAGATGC TCGGGCAGCG CATCGAGCTC TTGATTCCGG AGCGGTACCA CAAGCGCCAT CGCCAACACC GGCGCAACTA TTTCGCCGAC CCGCGCGCCC GCAAGATGGG GATCGGCCTC GAACTGTACG GACGCAACAA GGACGGTCAC GAGATCCCGG TGGACATCAT GCTGAACCCC ATCGAGACGC CGGACGGGAC CTGGGTCTTC GCCGTGATCC GCGACATAAC CAGCCAAAGA CAAGGCGAGG CGAAGATACT GGAGTTGAAC CTTGCCCTGA GAAACCAGCT CGAGCAGTTG GGGGCAAGCA ACAGGGAGCT GGAATCCTTC AGTTACTCCG TTTCTCACGA CCTGCGGGCC CCGTTGCGCC ACATCATCGG GTTCGTAGAC CTGTTGAACG CCAAGGCCGC GAACGTTCTG GACGAGAAGA GCCGTCACTA TCTGGAGGTA ATCAGCGACG CGGCCAACAA GATGGGATTG CTGATCGACG ACCTGCTGGC CTTCTCGCGC ATGGGGCGCA GCGAAATGAT GAAGGGGTGG GTAGACCTGG GACTGCTGGT GAGAGAGATA GTGAACGACC TGGAAAGCGA CAGCAAGGAG AGGGAGATCC AATGGGACAT AGCCCCGCTC CCCATCGTGC TGGGTGATGC GGCAATGCTG CGCCAGGTGC TCATCAACCT CGTCGGCAAC GCGGTCAAGT TCACCCGTTC GCGGGAAAAA GCAAGAATTG CCATCGGCGC CATCGACCGG GAGCAGGAAA CGGAGATCTT TGTAAGGGAC AACGGGGTCG GGTTCGACGA GGCTTACGCG AGCAAGCTTT TCGGCCTTTT CCAGCGTCTG CATGCCAATG AGGAGTTCGA GGGAACCGGG GTCGGGCTGG CTATCGTGCA GCGGATCGTA CTGCGGCACG GCGGCAGGGT CTGGGCCGAG GGCGAGGTCG ATGGCGGAGC GACCTTCTGG TTCTCGCTCC CGAAGGGAGT AAACCCGGTA CCCTAG
|
Protein sequence | MTDKDLNYNS GTRKQQEPGR LQGDLPFRLM VEQVKEYAVA LLDPQGNISS WNAGAEQITR YRGSEVIGKH FSLLHPKEEL RSGAPERELA IARSLGSLDL QGWRLKKDGS RFWAGITLTA IYDQEENLAG FALFAHDDSE KRASDEALLK SRNMLERLFE TAPDGIVVVD GNGVIRRTNQ QAEITFGYMR EEMLGQRIEL LIPERYHKRH RQHRRNYFAD PRARKMGIGL ELYGRNKDGH EIPVDIMLNP IETPDGTWVF AVIRDITSQR QGEAKILELN LALRNQLEQL GASNRELESF SYSVSHDLRA PLRHIIGFVD LLNAKAANVL DEKSRHYLEV ISDAANKMGL LIDDLLAFSR MGRSEMMKGW VDLGLLVREI VNDLESDSKE REIQWDIAPL PIVLGDAAML RQVLINLVGN AVKFTRSREK ARIAIGAIDR EQETEIFVRD NGVGFDEAYA SKLFGLFQRL HANEEFEGTG VGLAIVQRIV LRHGGRVWAE GEVDGGATFW FSLPKGVNPV P
|
| |