Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3453 |
Symbol | |
ID | 8138820 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 3989571 |
End bp | 3991169 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644871068 |
Product | methyl-accepting chemotaxis sensory transducer |
Protein accession | YP_003023233 |
Protein GI | 253702044 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 0.0235494 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATAA GCAAGAAGAT TCTGATCAGC AACGTGGGCA TGGTGCTCAT CGCGACCATT ACCACTTCGG CTATTTCGCT CTACGTCACC AAGAAGGAGA TCACGCGCCA GGTCAACGTT TCACTCGGCT CGCGGATCAA TGCTTTCCGC GAACTAATCA GCAGGAACGA TGGCAGCATA CTGCTGGTGG ACGGCAAGCT GCAGGCGAAC GGGGTGACGC TGGACGGCGA CAACGCCCTT ACCGACAGGA TGAAAGAGAT CTTCGGGGGG GAGGCGACCA TCTTCAGGTA CGACGTCCGG GTCGCGACCA CGATCAAGAA GGAAGACGGC GCCCGCGCGG TCGGCACCAG GCTCCAGGGG CCCGCACGCG AGGCGGTCAT CGATCGCGCG GCTCCCTACC AGGGGGAGGC AAGCATTCTG GGGGTTTCGC ACTTCGCCTC ATATCTCCCC CTGAAAGACG GCAACGGCAA GGTGATCGGC GCTCTCTTCG TGGGCGAGAA AAAGTCCGAG TACCTTGCGG TTTTCGACCG CCTGAAATAC CTGATCCTCG CCCTGTCCGC ACTGCTCGGT GCAGTCCTGG CTCTGGCTGG ATACCTGGCT CTGCACAAGG CGCTGATGCC GTTGCGAGAG TTGATCAGGA CTTTGCAGGA TGTAGCGGAA GGGGACGGCG ACCTCACCCA CCGCCTAAAC GAATCGACCG ATGAGATAGG TACCGCCAGC CGCTATTTCA ACCGATTCAT CGACCGGGTC CATACAATCG TGCAGACGGT GGCCGACAAC GCGAACTCCG TGGCAAGCGC GAGTTCCGAG CTGCACTCCA GCACCGAGAG GCTTGCCGAC ACCACTGAGG CCGTAGCCGT GCAGACAGAA ACCGTTTCGA CCGCAGGGGA GGAGATGGCT GCCACTTCCG CGGACATCTC CAAGAACTGC CTGAGCGCGG TCGACAGCGC CCAGCGAGCC TGCGAGATGG CGCGCTATGG CTCCGCCGAC GTCGAGCGCA CCATCGACGG AATGAAGCTC ATCAACGAGA AGGTGCGAGC CACCTCTGAG AGCGTCGGCA ATCTGGGGGT AAAGTCGGAA CAGATCGGCG ACATCATCGG CACCATCCAG GACATCGCGG ACCAGACCAA CCTCCTCGCC TTGAACGCGG CGATAGAGGC GGCTCGCGCC GGGGAGCAGG GGCGCGGCTT TGCGGTCGTC GCAGACGAGG TGCGCCGGCT AGCCGAAAGG ACCACCAGCG CCACCAAGGA GATCGAGGTC AACATAAGGT CGATCCAGGA AGAGACCGCC CGGGCGGTGC AAGTAATGCA CGAAAGCGCC AGGGAAGCTG CCAAGGGGGC CGAAGATTCC ATCAAATCCG GTGAGAGTCT GGAGGAAATT CTGAAACAGG TCAACGAGGT GACGCTGCAG ATAGGGCAGA TCGCAACGGC TGCCGAGGAG CAGAGCGCGA CCAGCCGCGA GATCAGCAAT AACGTGCACC AGATCACAGG GATCATTCAG GGCGCAGCCA GGGACAACCG TGCATCCATG TCGACTGCGG ACGAGTTGAA CCGGCTCTCG GAGAGTTTGA AGCTGCAGAT CTGCAGATTC AGGTACTAA
|
Protein sequence | MNISKKILIS NVGMVLIATI TTSAISLYVT KKEITRQVNV SLGSRINAFR ELISRNDGSI LLVDGKLQAN GVTLDGDNAL TDRMKEIFGG EATIFRYDVR VATTIKKEDG ARAVGTRLQG PAREAVIDRA APYQGEASIL GVSHFASYLP LKDGNGKVIG ALFVGEKKSE YLAVFDRLKY LILALSALLG AVLALAGYLA LHKALMPLRE LIRTLQDVAE GDGDLTHRLN ESTDEIGTAS RYFNRFIDRV HTIVQTVADN ANSVASASSE LHSSTERLAD TTEAVAVQTE TVSTAGEEMA ATSADISKNC LSAVDSAQRA CEMARYGSAD VERTIDGMKL INEKVRATSE SVGNLGVKSE QIGDIIGTIQ DIADQTNLLA LNAAIEAARA GEQGRGFAVV ADEVRRLAER TTSATKEIEV NIRSIQEETA RAVQVMHESA REAAKGAEDS IKSGESLEEI LKQVNEVTLQ IGQIATAAEE QSATSREISN NVHQITGIIQ GAARDNRASM STADELNRLS ESLKLQICRF RY
|
| |