Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1273 |
Symbol | |
ID | 8136599 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 1483114 |
End bp | 1486128 |
Gene Length | 3015 bp |
Protein Length | 1004 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644868886 |
Product | PAS/PAC sensor hybrid histidine kinase |
Protein accession | YP_003021091 |
Protein GI | 253699902 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 124 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCACC ATCGCCGGGG GCACCGCCTC CCGCTCGTGG CAGTCACCCT CGCCGTCATC GCCCTCCTCC TGGTCCCGAA AGGGGCGCCG GCACAGGAAA CTGCGACCAT CCAGTTGAAA TGGCTGCATC ATTTCCAGTT CGCGGGGTAC TACGCGGCGC TGGAGAAAGG GTTCTACCGC CGGGCCGGCC TGGATGTGAC CATCAAGGAG GGGGGGCCGA GAACCGAGGT GGAGGATGAG GTCCTCTCGG GCAGGGCTGA CTTCGGCGTG GGAACCTCGG CGATCCTTTT GCGCCGCGCC CGCGGCGAAG ACCTGGTGGT GCTGGGGCAG ATCTTCCAGC ATTCGGCCGC CGTCCTGATC ACCCCCCGCA GCACCGGAAT CCGCTCCATC CCCGACATGG CGCGCCGCAG GTTCATGTAT TCCAACCAGC ACGGCGACAT GCTGACGCTT TTGCTGCAAA ACGGCGTCAA CGAAAAAGAC CTCGTGCAGG TCCCCCACAA CGGCGACCCC CGCGACCTCA TCGGCGGCAA GGCAGACGTC ATGATGGGGT ACAGCTTCAA CGAGCCCTTC ATCCTGGAGC AGGAAGGGAT ACCCTACCTC CTCTTCTCCC CGCTTACCTA CGGCATAGAT TTTTACGGCG ACAATTTCTT CACCACGCGG GCCAACATCG AGGCAAGGCC GGAACTGGTC CGCGCCTTTC GCGAGGCGAC GCTGGAGGGG TGGCGCTACG CGATGGCCAA CAAGTCCGAG GTGGTGGACC TGATCCTCGC CAAGTACTCC CGGAAGAAAA GCCGCGACTG GCTCATGTTC GAGGCGAACC AGATGGAGAC CCTGATCCAG CCCACCCTGG TCGAGTTGGG GTACCAGAAC CCGGAGCGCT GGCGGAACAT CGGGGAGTCG TTCGCAAAAC TCGGCATGGT CCCGCAGAAT TTCAACACCA GCGGCGTGAG CTACGACCCC GCTCCCGGCA AGTACTATCG CGTCATCCTT CAAATACTGC TGGTCTGCGG TTCCGTCATA GCGGTGCTGG TCGTCATCGT GATGAAGTTC AGGCAGTTGA ATGGAACACT CAAGGCCCAG GTGGCCGAGC GCCAGGCGGC GGAGGAGGCG CTCAGGGAGA GCGAGGAGCG GCTGCGCGTC ATCTTCGAGA CCTCGCAGGC CGGCATCATC ATGGTCGACC CCAAAGGGAT CATCCGCTTC GCGAACAAGA GGATGGCCGA GATGTTCGGC TGCCCTCACG ACAAGCTGAT CGGCTCCGAC TACCGAAGCC ATCTCCATCC GGAACAGTGC GAGGTGGGGA GCCAACTCAT GGAGAAGCTG ATCCGCGGAG AAATGGAGCA GGCCTGCACG GAACGCCGCT ACCTGTGCGG TGAGCAGGGG GATTTCTGGG GCTACCTCTC CGGCAGGAGG CTGGAGGCCC CCGACGGCAA GCTGCAGGCG CTGGTCGGGA TCATCTCCGA CATAACCGAC CGCATCAAAG CGGACGAGGC CCGGGGGAAG GCGCTCATGC TGGTGGAGAC CCTCCTGGCC CACTCCCCGA TGGGGATCAC CGTCTTCGAC GGGGAGAGCG GCGCCTGCAT CCTTTTGAAC CAGGCCGCCG CAGGGATCTC CGGCGGCACC AGGGAAGCGC TTCTAGGGCG CGATTTCCGG GGGGTGCAAC CCTGGCGCGA AGCGGGGCTC ATCGCGGCGG CGGAAAAGGT CCTCTCCGAC GGGATCCCCC GCCCCTTCGA GGCGGAACTC CGCGGCTCCC TGGGAAAAGA CGTCATGCTG CGCTGCCACC TTTCCCGGTT CGACCTGGAG GGAAGAGCGC ACCTGCTGGT GCTGGAGCAG GATGTCACCG AGGAGATGCG CCTGGAGCGG GAGAACAAGC GGATCGAGGC GCAGATGCTG AACATGCAGA AGCTGGAGAG CCTCGGGGTG CTGGCGGGGG GAATCGCCCA CGACTTCAAC AACATCCTTA CCGGGATAGT GGGTAACATA AGCTTCGCCC AACTGGCGCT CCCCGCGGCC CACAAGGCGG CGGCGCCGCT TCTGAAGGCC GAGAAGGCCT GCCAGAGAGC GGCGGAACTC GCCTCCCAGC TTTTAACCTT CGCCCGGGGG GGGCAGCCGA TCAAGAAGGC GTTCTCCGTC AAGCCGCTGG TCGGGGAATC GCTCTCGCTG GTTCTGCGCG GCACCAACGT CAAGGGGGTC ATCGACATCG CCGACGATCT TTGCGTCATC GAGGCGGACG AGGGGCAGAT AAACCAGGCT TTCAACAACA TCATCATCAA CGCCGTGCAC GCCATGCCGG GGGGGGGAAC CCTCACCATA GCGGGCGAGG ATGCCGTGAT GGAAGCAGGC AACCGCTTCG GCCTCGCGCC GGGCCCCTAT GTCCGGCTGA GTTTCAGCGA CCAGGGATGC GGCATCCCCG AAGCGGACAT AGAAAGGATC TTCGACCCGT ACTTCACCAC CAAGACCAGC GGCAGCGGCC TGGGGCTTGC ATCGACCCAC TCCATCATCG CCAGGCACGG CGGCATGATC CTCGTGGACT CGGTTCCGAA AAAAGGAAGC ACCTTCATTA TCTACCTCCC CTCCACCGGG AATTCGGTGG CGGAAGAGGC GGGGCAGGAC AAGGCCGAGC GCTTGCACGG GGGGGGACGG ATGGTTGCGG TGATGGACGA CGAGGAGATG ATCAGGGACC TTACCCGCGC CATGCTGGTC GAGCTCGGTT ACCGTGTGGA GGTATGTTGC GACGGCGCCG AGGTGGTCGA ACTCTATCGA GCCGCCTGCG CCCGCGGGGA ACGCTACTCA GCCGTAATCA TGGACCTCAC CGTCCCCGGG GGGATGGGGG GCAAGGATGC GGCGCTCCGG ATCCTGGAGC TCGACCCGAA GGCGCGGCTG ATCGTTTCCA GCGGCTACTC CAACGACCCC GTCATGTCCG AGCACGAAAG CTTCGGCTTC TGCGCCACGC TGGTCAAGCC TTACACCGCC GACGACATCG CCAGGGTGCT GGGGGAGGCG ATTAACGGCA ATTGA
|
Protein sequence | MNHHRRGHRL PLVAVTLAVI ALLLVPKGAP AQETATIQLK WLHHFQFAGY YAALEKGFYR RAGLDVTIKE GGPRTEVEDE VLSGRADFGV GTSAILLRRA RGEDLVVLGQ IFQHSAAVLI TPRSTGIRSI PDMARRRFMY SNQHGDMLTL LLQNGVNEKD LVQVPHNGDP RDLIGGKADV MMGYSFNEPF ILEQEGIPYL LFSPLTYGID FYGDNFFTTR ANIEARPELV RAFREATLEG WRYAMANKSE VVDLILAKYS RKKSRDWLMF EANQMETLIQ PTLVELGYQN PERWRNIGES FAKLGMVPQN FNTSGVSYDP APGKYYRVIL QILLVCGSVI AVLVVIVMKF RQLNGTLKAQ VAERQAAEEA LRESEERLRV IFETSQAGII MVDPKGIIRF ANKRMAEMFG CPHDKLIGSD YRSHLHPEQC EVGSQLMEKL IRGEMEQACT ERRYLCGEQG DFWGYLSGRR LEAPDGKLQA LVGIISDITD RIKADEARGK ALMLVETLLA HSPMGITVFD GESGACILLN QAAAGISGGT REALLGRDFR GVQPWREAGL IAAAEKVLSD GIPRPFEAEL RGSLGKDVML RCHLSRFDLE GRAHLLVLEQ DVTEEMRLER ENKRIEAQML NMQKLESLGV LAGGIAHDFN NILTGIVGNI SFAQLALPAA HKAAAPLLKA EKACQRAAEL ASQLLTFARG GQPIKKAFSV KPLVGESLSL VLRGTNVKGV IDIADDLCVI EADEGQINQA FNNIIINAVH AMPGGGTLTI AGEDAVMEAG NRFGLAPGPY VRLSFSDQGC GIPEADIERI FDPYFTTKTS GSGLGLASTH SIIARHGGMI LVDSVPKKGS TFIIYLPSTG NSVAEEAGQD KAERLHGGGR MVAVMDDEEM IRDLTRAMLV ELGYRVEVCC DGAEVVELYR AACARGERYS AVIMDLTVPG GMGGKDAALR ILELDPKARL IVSSGYSNDP VMSEHESFGF CATLVKPYTA DDIARVLGEA INGN
|
| |