Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_0965 |
Symbol | |
ID | 4895264 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 994885 |
End bp | 996462 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640111551 |
Product | hypothetical protein |
Protein accession | YP_001042848 |
Protein GI | 126461734 |
COG category | [N] Cell motility |
COG ID | [COG1360] Flagellar motor protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.238986 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGACTGG TGCGGCGCGG CGGACGGCGC GAGGTCAACA TCTGGCCGGG CTTCGTCGAT GCGATGACCG CGCTGCTCCT TGTCCTCATG TTCGTGCTGA CGATCTTTCT CGCGATCCAG TCCCTCCTGC GCGAGACCAT CACGACCCAG GACAGCGAAC TCGACAGCCT CACCGGCCAG CTGGCCGATC TGGCCGATGC GCTGGGGCTC GAGCGGCAGA AGAGTGCGGA CCTCACCGCC GCAGTCGATG CGGCGCGTGC CGAGGGCGAG CGGCAGGCCG CGACCATCGC CACCCTGACC GCCACCCTCT CCGCCCGCGA GGGCGAGCTG GCCGCCGCGC AGGGCCGCAT CGCCTCGTTC GAAGAACAGG TGGCGACCCT TCTGGCGGAC CGTGACCGCG CCCGGGGTGA GACCGCCCGG CTCACCGCCT CGGTCGAGGA ACTGGAAGCG GCCCGCAAGA CGCTCCTCTC CGAGCAGGAG GCGGCACAGC TTGCGCTGGC GCAGGCGCGC TCCGAGATCG ACGCGCAGAC CGAGGCCGCC CGCCTCGCCG CCGCCCGCCG CGAGGCGCTC GATGCGCTGG TGGCCGATCT GCGCGGCAAG GTCAGCGAGA CCGAGAAGAA GCTCTCGGAC GAGGAGGCCG CCCGGCTGGC CGACGCCGCG GCAGCCGAGG CCCTGCGCGA AAGGCTCAAG AATTCCGAGA CCGAACTGAC GGCCATGACG CTCGCGCTCG AGGAGCAGCG GCGGAAGGCC GAAGAGACGC TCACCCTGCT CGCCGCGGCC AAGACCGATG CGGCGCGGGC GGTGAGCGAG GCCGATCAGC GCGCCGCGGC GCTTGCCGCC GCACGCGAGG CGCTGAATGC GCGCGAAGGC GAGGGGGCCG AGGCCGCCCG TCGCGTGGCG CTCCTGAACG AGCAGGTCGC GGCGCTGCGG GCGCAGCTCG GCTCGCTGCA GGGCCTCCTC GAGGCCGCCG AAGCGCAGGA CGCGGCCAAC AAGGTGCAGC TCCAGAGCCT CGGGACCCAG CTCAACTCCG CTCTGGCGCA GGTCGCCTCC GAGCAGAAGC GCCGCGCCGA ACTGGAAGAG GCCGAGCGGC TGCGCCTCGA GGCCGAAAAC AAGGATCTGG CGCGCTTCCG CTCCGAATTC TTCGGCCAGC TGCGCCAGGT CCTCGCGGGG CGCGAGGGCG TCCGCGTGGT GGGCGACCGC TTCGTCTTCT CCTCCGAAGT CCTGTTCGAG CCGGGCTCGG CCGAGCTCGC GCCCGAGGGC CGCGCGCAGA TCTCGGGGGT GGTGCAGACG CTGAACGAGA TTCGCACGCA GATCCCCGAG GGCATCGACT GGATCATCCG CGTGGACGGG CACACGGACA ACGTGCCCCT GTCCGGCTTC GGCGCCTTCC GCGACAACTG GGAGCTGAGC TCCGCCCGCG CGCTCTCCGT CGTGCGCTTC ATGCAGCAGA GCCTGGGCTT CCCGCCCTCC CGCCTCGCCG CCACGGGCTT CGGCGAATAC CGCCCCGTCA CGCCGGGCGA CAGTCCCGAT GCGCGGGCGC AGAACCGCCG GATCGAACTG AAGCTGACCG AGCGCTAG
|
Protein sequence | MGLVRRGGRR EVNIWPGFVD AMTALLLVLM FVLTIFLAIQ SLLRETITTQ DSELDSLTGQ LADLADALGL ERQKSADLTA AVDAARAEGE RQAATIATLT ATLSAREGEL AAAQGRIASF EEQVATLLAD RDRARGETAR LTASVEELEA ARKTLLSEQE AAQLALAQAR SEIDAQTEAA RLAAARREAL DALVADLRGK VSETEKKLSD EEAARLADAA AAEALRERLK NSETELTAMT LALEEQRRKA EETLTLLAAA KTDAARAVSE ADQRAAALAA AREALNAREG EGAEAARRVA LLNEQVAALR AQLGSLQGLL EAAEAQDAAN KVQLQSLGTQ LNSALAQVAS EQKRRAELEE AERLRLEAEN KDLARFRSEF FGQLRQVLAG REGVRVVGDR FVFSSEVLFE PGSAELAPEG RAQISGVVQT LNEIRTQIPE GIDWIIRVDG HTDNVPLSGF GAFRDNWELS SARALSVVRF MQQSLGFPPS RLAATGFGEY RPVTPGDSPD ARAQNRRIEL KLTER
|
| |