Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0039 |
Symbol | |
ID | 8135338 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 51174 |
End bp | 52721 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644867656 |
Product | methyl-accepting chemotaxis sensory transducer |
Protein accession | YP_003019884 |
Protein GI | 253698695 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | [TIGR00058] hemerythrin family non-heme iron proteins [TIGR02481] hemerythrin-like metal-binding domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 62 |
Fosmid unclonability p-value | 0.0381098 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTTTAT TCTCTCGCTT CATAACGATC AACTTGATCG CTGTCGCCGC CACGATTGGC GCAGCCGTTG CCATGGGCAG CGGAGTTGGT CTGTTTGCCG CCGGCGCCGT GATAATCCTC CTCTCCGCCG TGGCCTACGG TCTGTCGTCA CGTGGGGAAA CCAAGGCGTT GGAGGAAATG GCCGTCGCCC TGGAAAGCGC CGCGGCTGGC GATCTTTCCT ACCGGGTCAC GGCCGGCGGC AACGGCGAGA TAGGACGCAT AGGGGCCGCA TTCAACACCA TGATGGGCGA CTGGAACAAG ACCATGCACA AGTTCTTCAC CGTCACCGAT CTGGTACGCG ATTCCGTCGC CCTGGTGAGC GCGACTAACG ACGCCATGGC TGCCGCGGCC GAGGACGTCG CGCTGCAGGC CTCCACCATA GCTACCGCCA GCGAAGAGAT GTCCGCCACC TCCGGCGACA TCGCGCGCAA CTGCCTCTAC GCGGCCGAGA ACGCCCACAG GGCCACCGAG GAGACCACCT CCGGCGCGGA GATCGTCAGT AACAGCGCAA GGCTCATGGA GAACATCGCC CAGCGCGTGA TGGCCACCTC CAGCTCCGTC GCAGGCCTTG GCGAGCGTTC CGACCAGATC GGCGCCATAG CCGGGACCAT CGAGGACATC GCCGACCAGA CGAACCTGCT CGCCTTGAAC GCTGCCATCG AGGCCGCCCG CGCCGGCGAG ACCGGCCGTG GCTTCGCCGT CGTCGCCGAC GAGGTGCGCG CCCTTGCCGA GCGGACCACC CGCGCCACCA AAGAGATCGA CGCCATGATC AAGTCGATCC AGACCGAGAC CAGAGCCGCC GTCGGCTCCA TGGGAGAAGG GGTCGAGCAG GTGAACCAGG GGACCGCCGA AACCTGCCGC TCCGGCGAGG CGCTGAACGG CATCCTCAGA ATGATCAACG ACCTGACCAT GCAGCTCTCC CAGATCGCCA CGGCGGCCGA GGAGCAGACG GCGACCACCC ACGAGATCAC CAGCAACATC CAGATGATCA CCAACGTGGT CAACAGCAAC GTGGAAAGCG CCCGCGACAC CAGGGCGGCC ACCGGGAAGC TGGTCCAGCA GGTGGACGAG CTGCACCAAC TGGTGTCGCA CTTCCAGCTC TCCGACGCCA TGGTCTGGGA CCAGAGCTTC GCCACCAGCA TCGGCACCTT CGACGATCAG CACAAAAAGC TCTTCGCCAT GGTGAACGAA CTGAACCAGG CCATGCAGCA CAAGCGGAGC AAGGAGGCGA TCGGATCGGT CTTGAACCGC CTGATCGAGT ACACCGGCAG CCACTTCGCC GCCGAGGAAG AGGTCTTCCG CAAGACCGGC TACCCCGAGG AAGAAGCCCA CGTCAGGGCG CACCGGGACC TGGTGCAGCA GGTAGTGGCG CTGCAGCAGA AATTCAACGC CGGCGAGACC CTCCTTACCC ACGACGTCAT CGAATTCCTG CAGAACTGGC TGGTGAAGCA CATCAAGGGG ACCGACGTCC GCTACACCTC CCACCTGACC AAGGCGGGGG TCCGTTGA
|
Protein sequence | MSLFSRFITI NLIAVAATIG AAVAMGSGVG LFAAGAVIIL LSAVAYGLSS RGETKALEEM AVALESAAAG DLSYRVTAGG NGEIGRIGAA FNTMMGDWNK TMHKFFTVTD LVRDSVALVS ATNDAMAAAA EDVALQASTI ATASEEMSAT SGDIARNCLY AAENAHRATE ETTSGAEIVS NSARLMENIA QRVMATSSSV AGLGERSDQI GAIAGTIEDI ADQTNLLALN AAIEAARAGE TGRGFAVVAD EVRALAERTT RATKEIDAMI KSIQTETRAA VGSMGEGVEQ VNQGTAETCR SGEALNGILR MINDLTMQLS QIATAAEEQT ATTHEITSNI QMITNVVNSN VESARDTRAA TGKLVQQVDE LHQLVSHFQL SDAMVWDQSF ATSIGTFDDQ HKKLFAMVNE LNQAMQHKRS KEAIGSVLNR LIEYTGSHFA AEEEVFRKTG YPEEEAHVRA HRDLVQQVVA LQQKFNAGET LLTHDVIEFL QNWLVKHIKG TDVRYTSHLT KAGVR
|
| |