Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1280 |
Symbol | |
ID | 8136606 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1493617 |
End bp | 1496763 |
Gene Length | 3147 bp |
Protein Length | 1048 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644868893 |
Product | methyl-accepting chemotaxis sensory transducer |
Protein accession | YP_003021098 |
Protein GI | 253699909 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 0.0818053 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGTGGT TTCACGATTT GAAGCTGGGC GCGAAACTGA TGACGGGGTT TATTTCCGTC GCCCTCATAG CCGCGGTGAT CGGCTATGTC GGCATAATGA AGCTGAAGGC AATCGACGAG GCTGACATAA GGCTGTACGA AAAGATCACT ATTCCTATCT CTCAGCTCCA GAAGATCTCC ACCTCCTTCC AGAGGATCCG CATCAATATC CGCGACATGG TCGCGGCGAC CGACGCCGGC GAGCGCAAGA GCTCCGGTGA CGGCATCGTC CAGTTGCGCG AAGAGATCAA CAAGACTTCC GCCGAGTTCG AGAAGACCAT CATGACCGAG GAGGAGCGCA AGCTCTTCGC CGAGTTCGAG CGGGCCCACG GGGTGTACGG CCCGCGGCTG GACGAGATGG TCCAAATGGC CAAGGCGAAC CGGACCCAGG AGGCGCACGC GCTGATGATG GGGGACGGCG CCAAGGGGGC CAAGGAGGAG CAGGCAGCTA TCGACGCCCT GGTGGAAGCG AAACTCGCCC AAGCGAAGCT CACCGCCGAC GGCAACAAAG CCCTTGCCCA GAGCGCCACC ACCTTCTTGC TGGCCCTGCT CGCGGCCGGG GTGCTCCTGG CGGTCGCCTT GGGGTACTTC ATCACCAAGC TGGTGCAGGG ACAGCTGGGA GCCGACCCCA AAGAGGTGAG CGAGGTGGCG AGCCGAGTCG CGAAGGGCGA CATGTCGGTC GCCATCGATC TCGCCGGGAA GAAGGGGGAC AGCGTGATGG CCGCCATGCA CTCCATGGTC GAGTCGATCA AGGCGCTGGT CGCCGACACG GCGCTGCTCT CCGAGGCGGC CGTGGCCGGA AAGCTCGCCA CCCGCGCCGA CGCAGGCAGG CACGAAGGTG ACTTCAAGAA GATCGTGCAG GGGGTGAACG ACACGCTGGA CGCGGTGATA GGACCTCTGA ACGTGGCGGC GGAGTACGTG GACCGGATCT CGAAGGGGGA CATCCCGCCG AGAATCACCG ACAGCTACAA CGGCGACTTC AACGAGATCA AGAACAACGT GAACCAGTGC GTGGAGACCT TGAACGCGCT GATCTCGGAC ATGAACAACA TGTCAGCCCA GCACGACCTG GGGGACATCG ACGTGCAGAT CGAGGCCGAG AGCTACCAGG GGGTCTACCG GCAGATGGCG GCCGGGGTTA ACAACATGGT GAACGGCCAC ATCGCGGTGA AGAAGAAGGC GATGGCCTGC ATCGGCGAGT TCGGGCGCGG CAACTTCGAG GCGCCGCTGG AGCAGTTCCC CGGGAAGAAG GCCTTCATCA ACGACACCAT CGAGCAGGTG CGGGTGAACC TGAAGGCGCT GATCGCCGAC GCCGACATGC TGGCCGAGGC CGCCGTGGCC GGAAATCTCG CCACCCGCGC CGACGCAGGC AGGCACCAGG GGGACTTCAG GAAGATCGTG CAGGGGGTGA ACGACACGCT GGACGCGGTG ATCGGGCCGC TGAACGTGGC GGCGGAGTAC GTGGACCGGA TCTCGAAGGG GGACATCCCG CCGAGAATCA CCGACAGCTA CAACGGCGAC TTCAACGAGA TCAAGCAGAA CCTCAACAAC TGCATCGACA TCATGAACAA CCTCCTTGCC GAGGCGGACA AGGTGGTCGA GGCTGCGGCG GAGGGGAGGC TCGACGAACG GGCGAACCCG CAACTTTTTG TCGGCGGCTG GAAGGAACTC GTCGTCGGGG TGAACAACAT CGTCGCGAAC ATCGTAAATC CCCTAATGGT AACGGCAGAT TATGTCGATA AGGTGGCAAA AGGCGTGATC CCGCCGACGA TCGTGACGGA GTACAAGGGG CAGTACAACA TCATCAAGGA GAACCTGAAC GCTGTGGTGA AGATGATGAA CGAGCTCCTG GAGCAGACGG ACATCATCAT CAAGGCCGCG GCCGACGGCG ACCTGGACCG CCGCGCCGAC GCCTCGCTCT TCGTCGGGGG GTGGAACAAG CTGGTGGCAG GGGTGAACGA CACGGTGACC AACATCGTCA ACCCCCTCAT GGTGACGGCC GACCACGTGG ACCGCATCGC CAAGGGGGAC ATGCCACCGA CGATCACCGC CGAGTACAAG GGGCAGTACA ACCTGATCAA GAACAACCTG AACGTGCTGA TCGAGGCGAT CAACAAGATA ACGGACGCGG CCAAGGAGGT CTCCAACGGG AACCTCATGG TGAGCCTGAA GGAGCGTTCG GCGCAGGACG AGCTGATTCA GGCGCTCTCC GCCATGGTCG GGAAGATCAC CGAGGTGGTG ACCGAGGTGA AGCAGGCGGC CGACAACGTG GCCAGCGGGA GCGTGCAGCT CTCCGCCAAC GCCCAGTCCA TGTCGGAAGG GGCCTCGCAG CAGGCGGCGG CGGCCGAGGA AGCCTCCTCC TCCATGGAGG AGATGTCGGC CAACATCAGG CAGAACGCGG ACAACGCCAT GCAGACGGAG AAGATCGCGG TGAAGTCGTC CGCCGACGCT CAGGAAGGGG GCAAGGCGGT CGCCGAGACG GTGCAGGCGA TGAAGGACAT CGCCGGGAAG ATCTCCATCA TCGAGGAGAT CGCGCGGCAG ACCAACATGC TGGCACTGAA CGCGGCCATC GAGGCGGCGC GCGCGGGCGA GCACGGCAAA GGGTTCGCGG TGGTGGCCTC CGAGGTGAGA AAGCTCGCCG AGAGAAGCCA GGTGGCGGCG GGGGAGATCT CGGAGCTTTC GGTTTCCAGT GTCGAGGTGG CCGAGAAGGC CGGGGAGATG CTTTCCGGCA TCCTTCCCGA CATTCAGAAG ACGGCGGAGC TGGTGCAGGA GATCAACGCC TCCAGCAAGG AACAGGACAC CGGCGCCCAG CAGATCAACA AGGCGATCCA GCAGCTGGAC CAGGTGATCC AGCAAAACGC GTCCGCCAGC GAGGAGATGG CCTCCACCGC GGAGGAGCTC TCTTCCCAGT CGGCGCAGCT GCAGTCGACC ATAGCCTTCT TCAGGGTCGA CAACTCCGCC GGAGCGCATC AGGCGGCGAA GCATCTCACC AAGTCCGTCT CGAAGGGTGC CGCTAAGGTC AAGCAGGCGC CGGTAACCAA GCGCGCCATC GGGCACGACC TGGCCATGTC CGATCCGGAG CAGGACACGG AGTTCGAAAG GTTTTAA
|
Protein sequence | MRWFHDLKLG AKLMTGFISV ALIAAVIGYV GIMKLKAIDE ADIRLYEKIT IPISQLQKIS TSFQRIRINI RDMVAATDAG ERKSSGDGIV QLREEINKTS AEFEKTIMTE EERKLFAEFE RAHGVYGPRL DEMVQMAKAN RTQEAHALMM GDGAKGAKEE QAAIDALVEA KLAQAKLTAD GNKALAQSAT TFLLALLAAG VLLAVALGYF ITKLVQGQLG ADPKEVSEVA SRVAKGDMSV AIDLAGKKGD SVMAAMHSMV ESIKALVADT ALLSEAAVAG KLATRADAGR HEGDFKKIVQ GVNDTLDAVI GPLNVAAEYV DRISKGDIPP RITDSYNGDF NEIKNNVNQC VETLNALISD MNNMSAQHDL GDIDVQIEAE SYQGVYRQMA AGVNNMVNGH IAVKKKAMAC IGEFGRGNFE APLEQFPGKK AFINDTIEQV RVNLKALIAD ADMLAEAAVA GNLATRADAG RHQGDFRKIV QGVNDTLDAV IGPLNVAAEY VDRISKGDIP PRITDSYNGD FNEIKQNLNN CIDIMNNLLA EADKVVEAAA EGRLDERANP QLFVGGWKEL VVGVNNIVAN IVNPLMVTAD YVDKVAKGVI PPTIVTEYKG QYNIIKENLN AVVKMMNELL EQTDIIIKAA ADGDLDRRAD ASLFVGGWNK LVAGVNDTVT NIVNPLMVTA DHVDRIAKGD MPPTITAEYK GQYNLIKNNL NVLIEAINKI TDAAKEVSNG NLMVSLKERS AQDELIQALS AMVGKITEVV TEVKQAADNV ASGSVQLSAN AQSMSEGASQ QAAAAEEASS SMEEMSANIR QNADNAMQTE KIAVKSSADA QEGGKAVAET VQAMKDIAGK ISIIEEIARQ TNMLALNAAI EAARAGEHGK GFAVVASEVR KLAERSQVAA GEISELSVSS VEVAEKAGEM LSGILPDIQK TAELVQEINA SSKEQDTGAQ QINKAIQQLD QVIQQNASAS EEMASTAEEL SSQSAQLQST IAFFRVDNSA GAHQAAKHLT KSVSKGAAKV KQAPVTKRAI GHDLAMSDPE QDTEFERF
|
| |