Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_3303 |
Symbol | mcpG |
ID | 3722005 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007494 |
Strand | + |
Start bp | 363644 |
End bp | 365248 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640072972 |
Product | putative methyl accepting chemotaxis protein, McpG |
Protein accession | YP_354812 |
Protein GI | 77465309 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.627808 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGACTCT CGATCCGCAA CAAGCTGGGG GCGAGCTTTC TCGTCGTCTT CGCCATGGCG GGCACGACCA ACCTTTATGC GCTCTACACG ATCAACAGGC TGAACGACGA CATGGCCATC CTCGTCACCC GCGATTTCGA GCGCGTGCGG CTGGCCGAGC GGCTGAATGC CGAGCAGCTG CGCATGAAAA GCGCGATTCG CGACCACATG CTCTCGCCCC CCGCCGGCAA GGCTGCCGAG AAGGCAGAGG TGGCTGCGGC GCGGGCGAAT ATGCGCGAAA GCTTCGATTC GCTCGTGGCG ACGTCGCCGG AGGCGGACCT GGCGGCGCTG GCCGAGTACG AGGATCTCTG GACCCAGTCG ATCCGCGTCA ACGACAAGGC CCTGCTCCTG TCGGAGGAGG GCCGGACCGA AGAGGCGCGC AAGCTCCTCT GGGGCGACTA TTTCACCCGC CTGCAGGCCA GCCGGATGGA GATCATGGAG GGGCTGCGCG ACCGGAGCCT CGTCGCGCTG AAGGCGACCC AGGCCGAGGC GGCCGAGGCG CGTCGCCTGA CGGTGCTGCT GATGGCGAGC TCCATCGCGG CGGGAACTCT GATCGCCTTC CTCGCGGCGG GCTGGCTGAT CCTCTCGATC GGGCGCGGCC TCGGCCGGGC GCTGGCGCTG GCGCGGCGGG TGGCGGACGG GGACCTGACC GAGACGGCCG TGCTGCGGGG CAATGACGAG ATCACCGACC TTCTCCGCGC GCTCAATGCG ATGGTGGTGC GGCTGCGCGA TGTGGTGGGC CGCGTCACCG CCGCCGTGGG CGACGTGGCC TCGGGCAGTG CCGAGGTCGC CGCCACGTCC GAGCAGTTGA GCCAGGGCGC GAGCGAACAG GCGGCCGCGA CCGTGCAGGC CTCGGCCTCC GTCGAGGAAA TCGCCGCAAC TGTCCGGCAG AGTGCGGAAA ATTCCGGCCA GACCGAGCGG ATGGCGCGGT CTTCGGCCGA AGCGGCCCGG CAGAGCGGCG CGGCCGTCTC GGATGCCGTA TCGGCGATGC GCGGCATCGC CGAGCGGATC CATGTCGTGC AGGAGATCGC CCGCCAGACG GACCTTCTGG CGCTGAATGC CGCGGTCGAG GCGGCGCGGG CGGGCGAGCA CGGGCGGGGC TTTGCCGTGG TGGCCACGGA GGTCCGCCGG CTGGCGGAGC GGAGCCAGGC CGCCGCGGCC GAGATCTCGG ATCTGTCCTC CGCCACCGCG CGCTCGGCGG TGGCCGCGGG CGGGATGATC GAAAACCTCG TGCCCGACAT CGCGCGGACG GCCGACCTTG TTTCCGACAT TTCTTCGGCT TCGCAAGAGC TTGCGGCGGG TGCTTCACAA GTGAGCCAGG CCCTTCACCA GCTCGATACG GTGACGCAGC AGAACACGGC CGCGGCGAGC GAGCTTTCGG GCCGGGCCGC GGATCTGTCG GAACGGTCGG AGGATCTGCG CGCGGCCGTC AGCTACTTCC GCACCGGCGG TCCCGCGGTG CAGGCGCCGC TCAAGGCCGA GAGAACGACC CTGATGCAGA GGGACTCCGA TGCACGGCTT GAGGAGGATC TGGACGCGGC CTTCGCGCGA GCGCGCGTCG CGTGA
|
Protein sequence | MRLSIRNKLG ASFLVVFAMA GTTNLYALYT INRLNDDMAI LVTRDFERVR LAERLNAEQL RMKSAIRDHM LSPPAGKAAE KAEVAAARAN MRESFDSLVA TSPEADLAAL AEYEDLWTQS IRVNDKALLL SEEGRTEEAR KLLWGDYFTR LQASRMEIME GLRDRSLVAL KATQAEAAEA RRLTVLLMAS SIAAGTLIAF LAAGWLILSI GRGLGRALAL ARRVADGDLT ETAVLRGNDE ITDLLRALNA MVVRLRDVVG RVTAAVGDVA SGSAEVAATS EQLSQGASEQ AAATVQASAS VEEIAATVRQ SAENSGQTER MARSSAEAAR QSGAAVSDAV SAMRGIAERI HVVQEIARQT DLLALNAAVE AARAGEHGRG FAVVATEVRR LAERSQAAAA EISDLSSATA RSAVAAGGMI ENLVPDIART ADLVSDISSA SQELAAGASQ VSQALHQLDT VTQQNTAAAS ELSGRAADLS ERSEDLRAAV SYFRTGGPAV QAPLKAERTT LMQRDSDARL EEDLDAAFAR ARVA
|
| |