Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A3097 |
Symbol | |
ID | 3836543 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 3564389 |
End bp | 3567190 |
Gene Length | 2802 bp |
Protein Length | 933 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637827212 |
Product | hypothetical protein |
Protein accession | YP_428179 |
Protein GI | 83594427 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | [TIGR02917] putative PEP-CTERM system TPR-repeat lipoprotein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.681672 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACGGT CACTGAAACG GTTCTGGTCG ACGACAAGCG CCCTCGCCCT GGTCCTGGCG CTCGCCGCCT GCAACGACGA GGCCAAAGAC AAGGCGCGGG CCTCGGCCAA GGCCTATGAC AGCGCTATCA CCGCTTGGCG GGCCGGCGAC CAGAAGGCGG CGCTGATCCA TGTCAGCAAC GCCCTCAAGG CGGCGCCCGA CAACCGCGAC GCCAAGATCC TGATGGGCGA GATCACGCTC TCGGGCGGCG ATGTCTGGTC GGGCGAAAAG CTGCTCAAGG AGGTCCGCGA CGGTGGGGCG CCGGCCGAGA CATGGCTGCG TCCCCTGGCC AAATCCCTAA TCCTTCAGCA GAAATTTGAC GAAGCCCTGA CTCTTGCCCG CTCCATCCCG GAAACCGAGC TGGTCGGCGC GGTAAAGACC ATCGAAGGCC TCGCCCTGTT TGGCAAAGGC CAGGACGGGG CGGCGCGGCT GGCCCTGGAC AAAGCCCTCG ACATCGACCC GGCCGACAAG GACGCCCTGA TCGGCGCGGC CCAGATCGAA ACCATGGCCG GCCATCAGGA CGCCGCCCGC GCCCTGCTCG CCCGAGCGGC GGCGGCGGCG CCCGATGATG TCGATGTGCT GGTGGCCCAG GCCGATACCG CGCTCAGCGC CAATGATCCG GCGGCGGCCG AGGGGCTTTT CTCCCAGGCC GCGGCCCGTC TGCCGCTCAA CCCGCTGATC CGCCTGTCGC TGGCCCAGGC CCAGATCGAG GCCGGCAAGA ACGCCGAGGC CCGCCAGACG CTCAATACGG TGCTGGCCGA TATTCCGGCC CACCCCTGGG CGCTGTACCT GCGCGGCCTC ACCGCCTATC GCACCAACGA TATGACCGCC GCCGACAAGG ATCTGACCGC TGCCCTGGCC TTGGCCAAGA CCCTGCGCCC GGCGATTTTC CTGGCCGGGG TGGTCAAATA CAACATCGGC GAATACGAAC AGGCCTCGCG GCTGCTTGCC GGGCTGACCG AGACCGAGGG CAAAAGCAAC AAGACCGCCG ATGCCGTGCG CGCCGCCGCC CTTTTGAAAC TGGGCCGCGA CGACGAAAGT TATCGTCTGC TCCGCCCCTT GGCCGCCGGT GACGGCGAAA CCGCCGATCT TTACGCGATG GCCGCCGTCG CCGCCCAAGG AGCCGCCGCC ATGGCCGACA GCGAAGGCTA TTACCAGAAA GCCGTTGTTC TGCGCCCCGA CGACCCCGCC TTGCTGACCA ATCTCGGCAT AGTCAAACTG GCGCGCGGCG ACACCACCGC CGGCGAAGAC ACCTTGAACC GGGCGGCCGA GCTTGAAGGC GACGACAAGA AGGCGCTGCT GCTGCTGTTC TCCTCGTTGC TGCAGAAAAA GGACTTCGAC AAGGCCGAGG CCCTGGCCGG GGACACCAAG CGGAAATATC CCGATCGCGC CTGGGGCTGG ACGATGGACG GCATGATCCA GGCCAGCCGG GGCGACACCG CCATGGCGCG CGCCGCCTTC GAAACCGCCG TTCGCAAAGA GCCAACGGCC GGTGACGCCG TGCGCAATCT GGCCCTGACC GCCCTGCAGT CGGGCGATAC CGAAGGCGCC CGCGGCGTGG TCGAAGGCTA TCTGCGGACT AATGCCGGGG ATTCGGCGAT GGCGATGATC GCCGCCGCCG TGGCCAATAA GCGCAATGAT CTGGTCGCCG TCGAAAAATG GCTGCGTCAG GCCCTGGAGC GCGATCCGGC CAACATGGAG GCCGCCAGCA ATCTGGCGTC GCTGCTGACC TCGACCTCCC GTCCCCAGGC GGCAATCATC GTCGCCCAGG ACGCCCTGCG GATTTCTCCC GATACTCCGG CGGTGATGGA AGCCCTGGGC AAGGCCCAAT TGCTGATCGG CGATTATACG GCGGCGGCCG ATGTGCTGCG CCGGGCGGTG GCGATCAAAC CCAGCGGGCT GACCTATTAT CTGCTGGCGA CCGCCTATCT CAATCTCAAC GACCCGCCCC GCCTCAAGGA GGCCCTGGAG TCGGTGGTCA AGCTGCAGCC CGATCATGTC GATTCGCGGG TGATGCTGGC GACCATGATC GTTGATGGCG GGTCTTTGAG CGACGCCAAG ACCGCCGTCG AGGGCGTGAC CACGGATTTC CCGGGCGATC CCCGGGCCCA GGAGGTTCGC GCCCGCTATC TGGCCAAGGC CGAGGGGCCG GCCAGCGCCA TCACCTTCCT TGAAGCCTCG CTGACCGATC CCAACACCCG TCCGCGCAAT CTCGTCATGC TGCTGGCCTC GGCCTATAAT GAAAAGGGCG ATGGCGCCAA GGCAAGCAGC CTGCTTGAAG ACTGGGTGGC CAAGAACCCC GATGACTACC CCGGCCGCCT GTCGCTGGCG ACCCAGCAGA TCGCCGCCAA TCAGTTGGAA AAGGCCAAGA CGACCCTGGA AAAAGGGCTG GAGCGGGTTC CCAACGACTG GATCGCCCGC AACAACCTTG CCGAGGTGAT GCTCCGTCTT GGCCAGACCT CGGCCGCCTA TGACCAGATC GTCATCGCCC GACGCTCGGG GGGACCACAG CCGGCGCTGC TTGATACCGA GGGCCAGATC CTGCTCAAGA TGGGCAAGGC CAGCGAGGCC GTCGAGATCC TGCGGCTGGC GACGATCGAT CGCAACGCCC CGCCCACCTA TGGCCTGCAC CTCGCCCAGG CCCTGGCGGC GGCGGGCAAG AAAGACGAGG CCGGCGAGCG TTTGCGGGCC TTGCTCGACA AGAACAACGC CTTCGAGGGG GCCGAACAGG CCCGGGCGCT TTTATCCGAG TTGCAGGGAT GA
|
Protein sequence | MARSLKRFWS TTSALALVLA LAACNDEAKD KARASAKAYD SAITAWRAGD QKAALIHVSN ALKAAPDNRD AKILMGEITL SGGDVWSGEK LLKEVRDGGA PAETWLRPLA KSLILQQKFD EALTLARSIP ETELVGAVKT IEGLALFGKG QDGAARLALD KALDIDPADK DALIGAAQIE TMAGHQDAAR ALLARAAAAA PDDVDVLVAQ ADTALSANDP AAAEGLFSQA AARLPLNPLI RLSLAQAQIE AGKNAEARQT LNTVLADIPA HPWALYLRGL TAYRTNDMTA ADKDLTAALA LAKTLRPAIF LAGVVKYNIG EYEQASRLLA GLTETEGKSN KTADAVRAAA LLKLGRDDES YRLLRPLAAG DGETADLYAM AAVAAQGAAA MADSEGYYQK AVVLRPDDPA LLTNLGIVKL ARGDTTAGED TLNRAAELEG DDKKALLLLF SSLLQKKDFD KAEALAGDTK RKYPDRAWGW TMDGMIQASR GDTAMARAAF ETAVRKEPTA GDAVRNLALT ALQSGDTEGA RGVVEGYLRT NAGDSAMAMI AAAVANKRND LVAVEKWLRQ ALERDPANME AASNLASLLT STSRPQAAII VAQDALRISP DTPAVMEALG KAQLLIGDYT AAADVLRRAV AIKPSGLTYY LLATAYLNLN DPPRLKEALE SVVKLQPDHV DSRVMLATMI VDGGSLSDAK TAVEGVTTDF PGDPRAQEVR ARYLAKAEGP ASAITFLEAS LTDPNTRPRN LVMLLASAYN EKGDGAKASS LLEDWVAKNP DDYPGRLSLA TQQIAANQLE KAKTTLEKGL ERVPNDWIAR NNLAEVMLRL GQTSAAYDQI VIARRSGGPQ PALLDTEGQI LLKMGKASEA VEILRLATID RNAPPTYGLH LAQALAAAGK KDEAGERLRA LLDKNNAFEG AEQARALLSE LQG
|
| |