Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2778 |
Symbol | |
ID | 8138121 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3226226 |
End bp | 3228496 |
Gene Length | 2271 bp |
Protein Length | 756 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644870381 |
Product | PAS/PAC sensor signal transduction histidine kinase |
Protein accession | YP_003022570 |
Protein GI | 253701381 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 117 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGGCCA AGCAAGTCAT GAAGATGCCA TCATCGCTGC AGAGCTTTGC ACGACAGGTT ATTGCCACTA CTCTTCTCAT CAACCTTATT GTCCTGGGCG TCACGCTATG GTCGCTGCGC GACAGTCGGA TGCATTACGA GGAACGGATT ACCTTGAATA CCGTGAACCT CTCTCTGGTT CTGGAAAAGT ACTTGAACGG CGTTATGGAA AAAGTGAACT TGACGCTTCT GGCCCTGGGC GATGAAGTAG AGCAGCAACT GACTGCCGGG CGCCTTGACG GCGACCGCTT GAATGCGCAG ATGAAACGGC AGCTTTCTCG ACTCCCGGAG GTGGATGGCA TTCGCATGAC TGACGCCCAA GGCCGGGTAA TATACGGAAC CGGTATGACC CCAGGGGCAC ACCCCAGCGT TGCCGACCGC GATTACTTCC ATTATCTGCG CAGCAACCCA ACTGCTGGTC TTGTCATCTC CAAACCGCTT ATCAGCCACA TCAGCGGCAA GAATGTTGTC GTCGTCGGCC GACGGATTAA TCGATCAGAC CATTCTTTTG GCGGGGTAAT ATACGTTGCA ATAATTGTCG AGCACTTTGC AACGCTATTT TCTACCATAA ATGTCGGTCC TCATGGCGCG ATCACACTCA CTGACACCAA GAGTATTGTC ATCGCCCGCT CCCCGGTGCC TGGGCATACT GGCAGTTACC TTGGTAAAGA GCTGAAATCG GCAGGGCTGA AGAAGTTGAT TGATGAGGGG CTGACAGTGG GAAGCTACCG AAGCAAATCA GCGCTTGACG GGGTGCAACG CACTATCACC TTCCGGGTAA TCAATGGGTA CCCGCTCCTG GTCTTTGTAG GGATGGCGCC TTCCGATTAC CTGCACGAGT GGCGATTGGA TGCTTTGAAG ATGGGGGGAC TGGTCGTCTC TTTCATGCTG ATAAGCATAA TCACCAGTAG GCTGATCTAT GAAAGGCGGA AACGCGAGAA ACTGGCAGAG GCGGAATTGT ATCAACATAA GGTATATCTG GAGAGCATCG TGGTGCAACG GACCTCCGAC CTTGAAACCA GGAACCGGGA GTTACAGGAG TCGGAGGGGG TGCTGAAAAC CATCTTGGAC AATGTCTATG ACGCTATCGT CATCCATGAC GCTTCAGGGC GGATACTGCA GGTGAACAGG CGCTGGCGCG AGATGTATGG TGTCTCTGAA AACGAGTCGG AGACTCTCAC CATTGCCGAT TTCTCCACTG ATCCGCCACC ACCTGAGGAA CTCGCTTCCT TGTGGGGACA TGTCCTCGCC GGGAACTCAA ACTTCTTCGA GTGGCCCGCC CGCCGGCCGC ATGATGGCTC CAAAGTCTGG GTGGAAGCCT TCCTTTGTCC GATAAGATTG AAGGAGCAGA ATCTGATCAT GGGGTGCGTA CGAGACATAA CCGAGCGCAA GGCAACTGCA CAGGAACTAC AGAAGTATCG GAATCATCTC GAGGATCTTG TACAGGAGCG GACTGAGGAG TTGGCAAGAG CTGTCGAGAA GACGCGCAGG GAAACGGAGC AGCGGATTGC TGCGGTCGAG GAACTGCGAC AAAAGGAGCG GCTTCTCATC CAGCAGAGCC GATTGGCTGC AATGGGGGAG ATGATGGGCA ATATTGCACA TCAGTGGCGC CAGCCGCTAA ACATTCTGGG TCTCATCGTC CAGGAGCTCC AGATATGCCA CCAGAAGGGC ACTTTGGATA ATAAGCTCGT CAATACCCTG GTACCGAAAG CGATGAAGGT GATCGCACAT ATGTCGCAGA CGATTGACGA TTTCCGCAAC CTGCTGAGCC CCGACACGTC AAGAACCGTC TTCAGTGTCA ACGAAGTTGT TGAAAGGGTC CTGTCAATTA TGATTCTTGA GGCAAAGGTG GATGTCATCG CAGAGGAGGA GTGCTTCGCG GAGGGTGCTA GAAACGAGTT TTCACAGGTC ATTATAAATG TTCTGGCCAA CGCGAACGAT ATCTTCCGGG AGCGGCAAGT TTCGGCATCG CGGATCATCA TCCGAATTCT GCCTCAGGAC CTCAAGTCGG TGGTAACCAT CGCCGACAAC GGCGGCGGGA TCCCTGAGGA AATAATGTGC AAGATCTTCG ATCCATATTT CACCACCAAG GCGCCCGACA GGGGTACCGG CATAGGCCTG TTCATGTCGA AGACCATCAT CGAACAGAGG ATGAAGGGGG CGCTGTCAGC CCGCAATACA GCTGAAGGCG CAGAGTTCAG GATTGAGGTT CCTGCAGGTA CCAAGCCCTA G
|
Protein sequence | MGAKQVMKMP SSLQSFARQV IATTLLINLI VLGVTLWSLR DSRMHYEERI TLNTVNLSLV LEKYLNGVME KVNLTLLALG DEVEQQLTAG RLDGDRLNAQ MKRQLSRLPE VDGIRMTDAQ GRVIYGTGMT PGAHPSVADR DYFHYLRSNP TAGLVISKPL ISHISGKNVV VVGRRINRSD HSFGGVIYVA IIVEHFATLF STINVGPHGA ITLTDTKSIV IARSPVPGHT GSYLGKELKS AGLKKLIDEG LTVGSYRSKS ALDGVQRTIT FRVINGYPLL VFVGMAPSDY LHEWRLDALK MGGLVVSFML ISIITSRLIY ERRKREKLAE AELYQHKVYL ESIVVQRTSD LETRNRELQE SEGVLKTILD NVYDAIVIHD ASGRILQVNR RWREMYGVSE NESETLTIAD FSTDPPPPEE LASLWGHVLA GNSNFFEWPA RRPHDGSKVW VEAFLCPIRL KEQNLIMGCV RDITERKATA QELQKYRNHL EDLVQERTEE LARAVEKTRR ETEQRIAAVE ELRQKERLLI QQSRLAAMGE MMGNIAHQWR QPLNILGLIV QELQICHQKG TLDNKLVNTL VPKAMKVIAH MSQTIDDFRN LLSPDTSRTV FSVNEVVERV LSIMILEAKV DVIAEEECFA EGARNEFSQV IINVLANAND IFRERQVSAS RIIIRILPQD LKSVVTIADN GGGIPEEIMC KIFDPYFTTK APDRGTGIGL FMSKTIIEQR MKGALSARNT AEGAEFRIEV PAGTKP
|
| |