Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3968 |
Symbol | |
ID | 8139342 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4553009 |
End bp | 4554538 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644871584 |
Product | methyl-accepting chemotaxis sensory transducer |
Protein accession | YP_003023742 |
Protein GI | 253702553 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.00000000439597 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTAAGG CGGTCAACCA GGCGGTGAAG GAGTTGTCCG AGGCGATGAT GGCGGGGAAG CTCGACGTGC GCGCGGACCT CAAGGGGCTC AAGGGGGAAG ACGCGGAAAC GGTCCGGCTC ATAAACGGCA TGATCGACGC CCTGATCGCC CCCATGAGGC TCGCCGGCGG CGCGCTGCGG GAGATCGCCC ACGGCAACCT CCCCCCTTTC GTCATCGACG AGTACCAGGG GGAGTTCCAC CAGATCAAGC AGGACATAAA CACCCTTTTG GCCATCCTCT ACGGCATCCA CGCCGAGGCG GTGCACCTGA CCAACAGCAT CGGCGAGGGG AAGCTGAAGA CCCGGGGGAA CGACTGGGAT TACCAGGGGG TCTGGAGGGA GCTGATCGCG GGGTTCAACG GGACGCTCGA CGCGGTCATC GCCCCTATCC GCGAGGCGGG AGAGGTGCTG GAGCGCCTGG CGCGCTACGA CCTGAAAAGC AGGATGAGCG GGAAGTACCG CGGCGAGCAC GCCGCGATCC GCAAGGCGAT GAACTCGACG GCGGTTGCGT TGAACGACGC CATAGCCCAG GTCGGCGAGG CGGTAGGGCT CGTCTCCGAC GTGGGGCGGC GCATTACCAG CGTCAGCTCC TCCTTCGCCC TTGGGGCCAG CGAACAGAGC AAGGAGCTGG GGGAGACCTC GGTAAGCCTG ACGCAACTTT CCCGGAGCGC CGCCCAGAAC GCGCGGAGGT CGAAGGAGGC TCATGCCGAC GCCAAGAAGG CGACCGACGC CATGCGCCTG GCCAAGGAGG CGATGGGGCG GATGCTGGCG TCCATGGACG AGATCAGCGC TGCTGCCGAA AGCACCGTCT CCATAGCCGG GGAAATAGAC GGCATCGCCC AGGAGACCGG CGTCCTGGCG TGGAGCACCG TCGAGAAGGC GGCCCGCATG AGAATATCCG CGGGTGGGTT CGGTGTCGTG GCCCAGGAGA TCCGCAAGCT TTCCCGGCAG TGCTCCCAGA CGGCGAACTC CATGAAGGAG TTCGAGAAGA AGCTGGGTGC GGAGCACCAG GAGGAATTCG GCGCCCTGAT CGCGAGCCTG TTGCAGATCG CCAGATTCTC GAACCTGTTG GGGGTGAACG CCGCCGTCGA AGCGGCCCAC GTCGAGGGAG CCGGCAACGA GTTCCAGGCG ATGACCGACG AGATACACAC CCTGGCGGTC AGGTCGGCCG ACGCGGCGAA AAGTACCGGG ACGCTCACCA AGTCCTCCCA GGACCTGGCG CGGCAAGGGG TGGTGCTTTC GCGCGAGATC GACCTGGAGC TGGAAGGTGC TGTGGAGGCG GCGCAGGCGA TAGCCCGTTT CGCCGACGAA ATCCTGGCCG GCATCGAGGG GCAGACGGCC AGGATCGAGG AGATAAACGC GAGGGCGGTC CACATAACCG GTGTCACCGA GAAGAATGCC TCCGGCGCGG CCGACTCGCT CGTGGCGGCG CAGGAGCTAG AGGCGCAGGT CGCCAAGCTC TCCACCATGG TGAACCGGTT CAGCTTCTGA
|
Protein sequence | MSKAVNQAVK ELSEAMMAGK LDVRADLKGL KGEDAETVRL INGMIDALIA PMRLAGGALR EIAHGNLPPF VIDEYQGEFH QIKQDINTLL AILYGIHAEA VHLTNSIGEG KLKTRGNDWD YQGVWRELIA GFNGTLDAVI APIREAGEVL ERLARYDLKS RMSGKYRGEH AAIRKAMNST AVALNDAIAQ VGEAVGLVSD VGRRITSVSS SFALGASEQS KELGETSVSL TQLSRSAAQN ARRSKEAHAD AKKATDAMRL AKEAMGRMLA SMDEISAAAE STVSIAGEID GIAQETGVLA WSTVEKAARM RISAGGFGVV AQEIRKLSRQ CSQTANSMKE FEKKLGAEHQ EEFGALIASL LQIARFSNLL GVNAAVEAAH VEGAGNEFQA MTDEIHTLAV RSADAAKSTG TLTKSSQDLA RQGVVLSREI DLELEGAVEA AQAIARFADE ILAGIEGQTA RIEEINARAV HITGVTEKNA SGAADSLVAA QELEAQVAKL STMVNRFSF
|
| |