Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0361 |
Symbol | |
ID | 8135668 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 435895 |
End bp | 437838 |
Gene Length | 1944 bp |
Protein Length | 647 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644867978 |
Product | PAS/PAC sensor hybrid histidine kinase |
Protein accession | YP_003020200 |
Protein GI | 253699011 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.00025473 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGATGAGA TCCGTGACAA CCATAGCTTC GCCTTTTTGG AGCAGAAGGT CGAGGAGCGA ACCCGCGAAT TGAAACAGGA GATCCAGGAG CGCCTGCGGG CCGAAGGGCA ACTGGCCGAG GCGCGGGACC ACTACCTGAA CATCCTGGCC GAGGCCCCCG CGCTGATCTG GCGTGCGGAC ACGCAGGCCA AGTGCGACTG GTTCAACAAC ACCTGGCTTT CTTTTACCGG CCGCACCATG GAAGAGGAAT ACGGCGACGG CTGGGCGCAG GGGGTCCACA CCGACGACCT GGAGCGCTGC GTTGCCATCT GGCTCGGGGC GTTTCACCGG AAGGTCCCGT TCGAAATGGA GTACCGGCTG CGACGCCACG ATGGCGTATT CCGCTGGATC CTGGACATCG GCCGCCCCTT CTCCGGGCTC GACGGCAGCT TCGCCGGGTA TATCGGTTAC TGCTTCGACA TCACCGACCG CAAGGGGGCG GAGATGGAGC TGATAGTGGC AAGGGAGGCC GCCGAGGCTG CCAGTAAAGC GAAGACCGAG TTCCTGGCCA ACATGAGCCA CGAGATCCGC ACCCCGATGA ACAGCATCAT CGGGATGACG CAGCTATTGG CCTACACGGA GCTTTCAGCC GAGCAGAAGG AGTACGTCGA CGGAATCCTC ACCTCTTCGG AGGGGCTCCT TGCCATCATC AACGACATTC TCGATCTTTC GAAAGTGGAG GCAGGGAAGA TAGAACTGGA GTCGCGGAAT TTCAGCCTCA GACAGAACAT CAACGAAATC ATCAGGACCC AGACGGCGGC GGCCCACGAA AAGGGGCTCC AACTTAAAGT CTTCATCCCG GAAGAGATAC CAGACGCACT GGTCGGCGAT CGGCTGAGGC TAAAGCAGGT GCTACTCAAC ACAATCGGCA ACGCCATCAA GTTCACCGCA AGCGGGAGCA TTGCCGTAAC GGTAGCGCTG GCGGAAAAAC AAGAGGACGC GGCTCGCCTT ACATTCAGCA TCGCCGATAC CGGCATCGGG ATAGCGCCGG AGTCCCTGGA CCGCATTTTC GCGCCTTTCG CGCAGGAAGA CACCTCCACC ACCAGGAGGT ACGGCGGCAC CGGTCTTGGG CTTTCCATCA GCACGAAGCT GGTCCGGCTC ATGGGGGGAC GGATCTGGGC GGAAAGCCGC AAGAACACCG GCAGCACCTT CCACTTTGAG ATTCCCTTCC GGCTCTGCGC CAAGAAGGCC CGGCCCAAGG CGTCCTTAAA TACCGCAAGG AAAGTCGTTT GGGACGGGGC CAAGCTCCGC ATCCTGCTGG CCGACGACCA GGAGATGAAC CGCAACATCA TGGAGAAACT CCTGGGGCGG CTGGGGCACG AACTGGAGAG TGCTGCGGAT GGCGGCGAGG CGCTGGCAAA ATGGGAAAAG GGGGACTTCG ACCTCATCCT CATGGATGTC GAGATGCCTG GTATCGACGG CACGGAAGCG ACCCGCGTGA TCCGGGAGAC CGAGACCCCG CTGGGAAAGC GGACCCCCAT CATCGCCCTC ACCGCCCATG CGCTTAAGAA CCACCAGGAG ATGCTCCTCC GAGTAGGCTA CGACGGGTAC GTACCCAAAC CCGTGGAGAT GTCCACGCTG TTACGGGAGA TGAGGCGCTG CTTGCGCCTG CCCGAGCTTG ACACAGCAGG CACAAGCGAA GCTGTCGACG GAGTCCCCAC GACTCCCGCG GCAAACCAGC CTGGCATAGT GGACAGGCAG CAACTTGCAG ATATACTCAG TTCAATCAGT TCGCTGTTGC GGCAAAGGAA CATGAAGGTC CTGGACCAAG TAAACGACCT CTCCGCACTG ATTCCCGGGT CACCTCTTCT AGAACAGTTA AACCAGCAGA TACGGCATTT TGATTGCCAA AGCGCACTAA ATACAGTCAG AGAAATCCTT CTAGATTACG ACATCCACTC CTGA
|
Protein sequence | MDEIRDNHSF AFLEQKVEER TRELKQEIQE RLRAEGQLAE ARDHYLNILA EAPALIWRAD TQAKCDWFNN TWLSFTGRTM EEEYGDGWAQ GVHTDDLERC VAIWLGAFHR KVPFEMEYRL RRHDGVFRWI LDIGRPFSGL DGSFAGYIGY CFDITDRKGA EMELIVAREA AEAASKAKTE FLANMSHEIR TPMNSIIGMT QLLAYTELSA EQKEYVDGIL TSSEGLLAII NDILDLSKVE AGKIELESRN FSLRQNINEI IRTQTAAAHE KGLQLKVFIP EEIPDALVGD RLRLKQVLLN TIGNAIKFTA SGSIAVTVAL AEKQEDAARL TFSIADTGIG IAPESLDRIF APFAQEDTST TRRYGGTGLG LSISTKLVRL MGGRIWAESR KNTGSTFHFE IPFRLCAKKA RPKASLNTAR KVVWDGAKLR ILLADDQEMN RNIMEKLLGR LGHELESAAD GGEALAKWEK GDFDLILMDV EMPGIDGTEA TRVIRETETP LGKRTPIIAL TAHALKNHQE MLLRVGYDGY VPKPVEMSTL LREMRRCLRL PELDTAGTSE AVDGVPTTPA ANQPGIVDRQ QLADILSSIS SLLRQRNMKV LDQVNDLSAL IPGSPLLEQL NQQIRHFDCQ SALNTVREIL LDYDIHS
|
| |