Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3810 |
Symbol | |
ID | 4898561 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | - |
Start bp | 941938 |
End bp | 943623 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640114414 |
Product | methyl-accepting chemotaxis sensory transducer |
Protein accession | YP_001045662 |
Protein GI | 126464549 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.296432 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAAGG CCTCCATGAA ACTGACCATA AAGATCAAGC TCGCCATGAC CTTCCTTCTG GTGTTCCTGC TCATGGGGGC AGGCACCATC TTCGGGCTCC TGGACCTGCG CTCCGCGAAC CAGACGCTGC GGTCCATCGT GGACGTCCAG GCCGCCCGGG TCGAAGCTGC CAGCCGGCTC GAGATCCAGC AGAGCGAGTT CAACGTGGTG CTGCGCGATT ATGTGACCGC CCCGACGGCG ACCGAGCGCG CCAAGCTCAA GCAGGACATC ACCCGGATCC GCGCCGAAAT GAGTGCCAGC ATCGAGCGGC TCCAGACACT CGCGGATGCC GAGGGAGAGA AGCTGATCGC CGCCTATGCC GATCAGCGCA AGGCCGCCGC CGCCTTGAAC AATCGGGTCT TCGCGCTGGC GGACGCGGGC GATGTCGCCG GCGCCTCCCG CCTGCTGGCG GGCGAGTCCC GCGCGGGCAT GGTGAAGCTC GCGGCGAACC TCGAGACCTT CCGCACGCTC TACAAGGACC AGATGACCGA GGCCACGGCC GAAGCGGATC GCGAGCTGAC GGAGAGCCTC GTCAATCTGT CGGCGCTCGC GCTGGCGGGC ATTCTCGCCG GAACCATCGC GGCCACCTTC CTCATCCTCT CCATCGGGCG CAGGCTCGGC CGGGCGCTGG CGCTGTCGCA GCGCGTCGTG CAGGGGGATC TGACCACCCT GGCCGAGGAG CGCGGCTCGG ACGAGATCGC GCAGCTCCTC AAGGCCAACA ACGCCATGAT CGTGAAACTG CGCGAGGTTC TGGGGCGGGT GTGGCTCGCC ACCGACCAGG TGGCGGCGAA CAGCCAGACC ATGGCCGCCA CCTCCGAACA GCTGTCGCAG GGCAGCAGCG AACAGGCGGC CTCCACCGAA GAGGCCTCGG CCTCGGTCGA GGAGATGGCC GCCAACATCC GCCAGACCGC CGACAGTGCG GGTGAGACCG AACGGATCGC CGCAAAATCG GCCGAAGATG CGCGGGCCTC GGGCGATGCC GTGCGCGAGG CGGTGGCAGC GATGGCCTCC ATCGCCGACC GCATCCTCAT CGTGCAGGAG ATCGCGCGCC AGACCGACCT TCTCGCCCTC AATGCCGCGG TCGAGGCCGC GCGGGCGGGC GAGCACGGGC GCGGTTTCGC CGTCGTGGCC AGCGAGGTCC GCAAGCTCGC CGAGCGCAGC CAGGCGGCGG CGGCCGAGAT TTCCGCGCTG TCGGCGCGCA CGTCCGGCGT GGCGGCCACG GCGGGCGAGA TGCTCCAGCG GCTCGTGCCG GACATCGAGC GCACGTCCGG CCTCGTCTCC TCGATCTCCG TCGCCTCGCG CGAACTGTCG ACGGGGGCGC AGCAGGTGGC GCTCGCCATC CAGCAGCTCG ATCAGGTCAC CCAGCAGAAC AGCACCGCCG CTGAAGCTCT GGCGAGCGGG GCGGGTGAAC TCTCCGTCGA GGCGGACCAG CTGAAGGAGG CGGTGGGCTT CTTCCGCACC GGAGAGACAC AGGCCCCGGT CGCGCCGCAG ACGCGCGCGG TTGTGCCGCA CGCACGGCCT TCGGCGCGCG CCCTGCCTCA GCCGTCCCTG CGGCCGGTGC GGTCCTCCAA GGGCTTCGAC TTCGACATCG GCGAGAGCGA GTTCGACGAG CTCGACGCGG CCTTCCAGCG CGCGGGCACC CGCTGA
|
Protein sequence | MMKASMKLTI KIKLAMTFLL VFLLMGAGTI FGLLDLRSAN QTLRSIVDVQ AARVEAASRL EIQQSEFNVV LRDYVTAPTA TERAKLKQDI TRIRAEMSAS IERLQTLADA EGEKLIAAYA DQRKAAAALN NRVFALADAG DVAGASRLLA GESRAGMVKL AANLETFRTL YKDQMTEATA EADRELTESL VNLSALALAG ILAGTIAATF LILSIGRRLG RALALSQRVV QGDLTTLAEE RGSDEIAQLL KANNAMIVKL REVLGRVWLA TDQVAANSQT MAATSEQLSQ GSSEQAASTE EASASVEEMA ANIRQTADSA GETERIAAKS AEDARASGDA VREAVAAMAS IADRILIVQE IARQTDLLAL NAAVEAARAG EHGRGFAVVA SEVRKLAERS QAAAAEISAL SARTSGVAAT AGEMLQRLVP DIERTSGLVS SISVASRELS TGAQQVALAI QQLDQVTQQN STAAEALASG AGELSVEADQ LKEAVGFFRT GETQAPVAPQ TRAVVPHARP SARALPQPSL RPVRSSKGFD FDIGESEFDE LDAAFQRAGT R
|
| |