Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU2372 |
Symbol | |
ID | 2688091 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | - |
Start bp | 2598947 |
End bp | 2601130 |
Gene Length | 2184 bp |
Protein Length | 727 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637127063 |
Product | methyl-accepting chemotaxis protein, putative |
Protein accession | NP_953419 |
Protein GI | 39997468 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.204529 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCAAGG ATATGAAACT CGGTTACCGG CTCATAGGCT CCTTTGCCAT CATGGCCGCC ATCGTTGCTG TCACCGGCTT CATCGGCATC CGTTCCATCG GGATGGTTGG AAACAGGGTG TCGGATCTCA TGCAGACCCG CGCGGATCAG CAGAAGCTGG CGTTGCAGCT CCAGGCCGCC GAGCGCACCA GCCGCGTCGC CCTGCTGGAG GCGATGATGG GGCACGTGGA TACCAGGATC CTGGCCGCGA ACGTGGAGTC CTATCGCAAG AACCGCGACA TCTTCAGACG CTACAGCAAC GCGCTGCTCA AGGGCGATCC GGCCCTGGGC ATCCGGCCGG ACTCCGTCGA TACCGTCATG GAAGAGCATG CCAAGGCGCT CCTCGACACC TGGGCCGAGT ATGAAAAGGT GGCGGACAGG ATCATTGCCT ACAAGTCCGG GGTCCTGAGC GGTTCGGTGT CTCCTTCGGT GCTCGTCGAG ACCAGGCTTA TCTCCGAGTT GTCCGGGGCA AGCGAATTCG TGGCCCGCGA CATCGACGAC CTCATCGAGA CGGTCAAGGG GCTGATGCAG GTGGTGGGGC AGGAAACGCG CCAGATCCGG GCCTCGGTCA GCATTACCTT CGTCATTGTC ATCATCGGGG CTGCCGTGCT GGCCTTCGTT TTCGGCGTCG TGGCCACCCG CAACATCATC CGGCGGGTGA ACATGATGGT GACGGCGCTG AACAAGGGAG CGGAAGGGGA TCTGACCGTC CGGGTGACCA CCGATGCCAC GGACGAGCTG TCGCTGCTCG GCCGGGATTT CAACATCATG CTGGAAATGC TGGGAGAGCT GGTCCGCAAG GTGAACCGCT CCCTGGTGGA GGTGGGGCAG GTCTCCGCCA ACATCTTCGA GGCTTCGCGC CGGGTCATGG CCGCGGCGGA AGTCCAGGCC GAGGGAGTGT CCCTGACCTC GTCGGCCGTT GCCGAGATCA ATACTTCCAT CAAGGAGGTC TCCCGCGGCG TCGACGGCCT TTCCCTTTCC GCCTCGGAAA CCTCGTCGTC GATCCTCGAG ATGGCCGCCA GCATCGAAGA GGTGGCCGTG AACGTGGACT CACTGGCCCA GGCCGTTGAT GAGGTAAGCT CCTCGGTGAT GGAGATGGCC GCGTCCATCA AGCAGATCGC CAACAGCGTC GTGAGCCTCC AGGATGTGAC CACCACCACT GCCTCTTCCG TGGCCGAGAT GGATAGCTCG ATCAGGCAGG TCGAGAAGAA TGCCATGGAG ACCGCATCCA TTTCAGAGGG GGTGCGGCGG GACGCGGAGA TGGGCAAGGT CTCCGTCGAA GCGACCATCG CCGGTATCAA CGAGATCAAA CGCTCTTCCC GGATCACCTC GGAAGTGATC GAAACCCTTT CCGTCCGGGC CACCGACATC GGCGCGATTC TGTCGGTCAT CGACGAAGTG GCCGAGCAGA CCAACCTGCT GGCCCTGAAC GCCGCCATCA TTGCGGCCCA GGCGGGCGAA CACGGCAAGG GTTTCGCCGT GGTGGCCGAC GAGATCAAGG AGCTGGCCGA GCGGACCACC AGCTCCACTC GCGAGATCGC CCAGCTGATC AAAGGGGTCC AGGATGAAAC CGCCCGGGCC GTGGAGGCTA TCGAGCTGGC CGAGAAGAGC ATCGCCGACG GCGAGGCCCT GTCCCAGAAG TCGGGCGAAG CCCTGGCCAA GATCGTTACC GGCGTCCAGG GAGCCACGGC CCAGGTGGAG AGCATTGCCC GGGCCACCAT GGAGCAGGCC AAGGGAAGCC AGATGATCCG TAGCGCCATG GAGCGGGTTT CGGACATGAT CGCCCAGGTG GCCGGCGCCA CCCGGGAGCA GGGCAAGGGG AGCGACATGA TCATGGCCGC CGCGGAGCGG ATGAAAGGGC TTACCTCCCA GGTCAGGACC TCAACCCGGG AGCAGAGCAA GGTGGGTGCG TTCATCGCCC GTTCCACCGA GAACATCACC GACATGATTC AGCAGATCAA GCGGGCCTGC GACGAGCAGT CGCGGGGCAG CGATCAGATC ATCCGCGCCG TGGAGGACAT CCAGGAGTCG ACCTCGACCA ATCTCGGCTC GGCCCGGATG ATGGACGATG CGGTTTCACG GCTGTCCCGC CAGTTGGAAG CCCTTGAGCG GGGTATGAGC AGTTTCAAGG TGGAGAATCG GTAA
|
Protein sequence | MFKDMKLGYR LIGSFAIMAA IVAVTGFIGI RSIGMVGNRV SDLMQTRADQ QKLALQLQAA ERTSRVALLE AMMGHVDTRI LAANVESYRK NRDIFRRYSN ALLKGDPALG IRPDSVDTVM EEHAKALLDT WAEYEKVADR IIAYKSGVLS GSVSPSVLVE TRLISELSGA SEFVARDIDD LIETVKGLMQ VVGQETRQIR ASVSITFVIV IIGAAVLAFV FGVVATRNII RRVNMMVTAL NKGAEGDLTV RVTTDATDEL SLLGRDFNIM LEMLGELVRK VNRSLVEVGQ VSANIFEASR RVMAAAEVQA EGVSLTSSAV AEINTSIKEV SRGVDGLSLS ASETSSSILE MAASIEEVAV NVDSLAQAVD EVSSSVMEMA ASIKQIANSV VSLQDVTTTT ASSVAEMDSS IRQVEKNAME TASISEGVRR DAEMGKVSVE ATIAGINEIK RSSRITSEVI ETLSVRATDI GAILSVIDEV AEQTNLLALN AAIIAAQAGE HGKGFAVVAD EIKELAERTT SSTREIAQLI KGVQDETARA VEAIELAEKS IADGEALSQK SGEALAKIVT GVQGATAQVE SIARATMEQA KGSQMIRSAM ERVSDMIAQV AGATREQGKG SDMIMAAAER MKGLTSQVRT STREQSKVGA FIARSTENIT DMIQQIKRAC DEQSRGSDQI IRAVEDIQES TSTNLGSARM MDDAVSRLSR QLEALERGMS SFKVENR
|
| |