Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A0195 |
Symbol | |
ID | 3833886 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 237870 |
End bp | 243644 |
Gene Length | 5775 bp |
Protein Length | 1924 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637824273 |
Product | hypothetical protein |
Protein accession | YP_425287 |
Protein GI | 83591535 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGACAGT ATTCGTGGGC CAGCGCCGCC GCCAATCTCG TTTTTGGCTA TCGCCCGGTG TGGACGCCGC AGGACACCGC GGCGCCCTAT CCCTATTTCG AACCGCCGGT GGCGGCGCTG TCGAGCGTGA CGCCGAACGT GACCCCCCAC GATGTCGACT ATTGGGCCCC CTATCCCACC ATCACCTTGC ATGACTCCGC CAGTTCGCTG CTTTCCGGCC TCAACGCCTA TCGCGTCACC GGCTTGCAGA AGGGAGTGGT CGTCCAGCCC GGTCCGCTCA TGCTCAGCGA TGCCGATGGC GCGAAGGCGC TCAAGCAGGC CGGCGACTGG ATCGGTCTCG ATCTCGCCAA AGCCCGCAGC TACATGCTGG TGACGACGCG GCGCGCCAAC GCCACCTTGC GCCATCCCTA TTTCGCCGAG CCGGGGGAGG CGGACCGCGA CCACTATCTG ACGGCCGAGG CTTTGGCGCT GATCGCGGCG CTCAAGCCGA TCGATGCCGC CCAGTTTGTC GCGCCCTTTT CCCATGCCGC CATCAGCCAG GGCGACGCCG CGTCCTATCT CGCCGCCATC ACCGCGCTCG GCTCGCATGT CGTCGCCAGC GAAGTCTTCG GCGATGTGCT GTTCCAGGTC TTCGCCTTTG ACGCCGAGCA TTTCCGCGTG ATCGAGGCGG CCTTTCAGCG TCAGGCGACC CTGCAAGCCG ACGGCCATCT GGCGGTGACG GGCGCGCCGG CGGCGAGTTG GGCCTATTGG ACCACCCCGG TCACCGACAA CGGCTTTGGC TATGTGTCCG AGTACGGCGC GCTGCGTTTG CAAAGCCGCG ACCCGGCCCT CGACGCCGCC CTCACGGAGG GGAAATGGGC GAGCCCTTAT GTTCCCAAGG ACATCGCCTC GATCTTCGCC TATGCGACGG ACTACCGGCT TATCGAGGAT TTCGAGTCGG TCGTGCCGAG CGCCCTGGTG CTGGCGCCGC TGGCCGCCTT GATCTCCAAC CCCTATGTCG CCGGCCCGTG GGACCGGATC GTCAAGGGCG CCCTGTTGCA GAAATGGGGC GATGCGGTGC GCATTCCCCA GCCCTCGCCG GTCGATATCG ACTGGTCCAC CTTCTTTCCC GACGCCAACG CCTCCTGGGC CTCGGGTATC GTCACCCCGA CCATCGACGT CTATCAGGAA CTGGTCGATA TCTCTCGGGT GTCCTTCCTC GGCGCCGATG TGGTCGGCGC GAATTTCCAG ATGAACTCCT TTACCTGCTT TTCCCAGGTG CTGGAGGCCA ATGTGCCGGC GGGCAGCGGT CCGGTGATGC TGCCAAGCGA TCGCGTGGTG CTGGTCGCCG GAACCATCGA CGCCAGCCGG TCGATCGAAA CGCCGGTTCT CACCATGAGC GAGGCGGCGC TTCTTGGTCT TACCGTGGTG TGCGACACCA TGTTCGGCGC CGTGCTTTTC GCCGCCACGA CGCGGTCGGG TGCGGTGATC CGCAAGGCGG CGCTCGATGG CCTGCTCTTT GAAACCACCT CGGTCCAGCC CGCGACGGGG CGCGCCGGAG TGGCCATCGC CGGCGCGCTC GCCAACACGC CGTCGCCCGA GCTTGTGCTC AAGCTCAAGG GCAGCATCGG CATGTCGGTG ACGGCGGCTG AGTCGCTCCT GCATGCCCGT AGCGACGACG CCGAGCGGAT CCGCGGCATA GAGCAGGCCT ATCTGCGCTG GCTCGCCTCC ATCATCCCGG CCGATACCGA GGATCTCGAT CTGGCCTATT CGCGCTCGCA CGCCCTCTAC ATCGCCGAAA ACGTGGCGAC CTTCGGCACC GACATCGTCT TCGTGCCCTA TGTGACCTAT GAAACCTACC GACCCTACGT GTCGAGCATG ATCACCCAGG CGCAGACCCT GTCGCAGACG ATCATCGCCA ACGAAATCAT GATCCGCGAC ACGATCAACA GTTATAAGGT CATGACCTCG CTGGCCGATC TCAACGACAA CGTCAAGCGG ATCGGCGGCG TGCTGACGGG GTATTTCAAG GCGCTGGCCG ATGGCCAGAA GGCCATGAAC GGCTATTACG ACCAAGTGCT CGACCAACTT GGCCAGCAGC AGGCGGAAAC GCTGAAGGAG ATCGCCACTC TCGGCGCCAA GCTTGATGCC CAGCGGGTGG TGATCAGCAA CACCGGCGAT CCCCCCGGCA TCGTTCAGAC CTTCCAAGAC GACTACGCCG ATTATGCGCG CGATCAGGTC GCCGCCTGCG TGGTTTCGGT GGTGGAGGGC CTGTTCACGG CCGGGCTCTC CATGGCCGCC ATTCCCAGCG AAGCGGCGGG CGGTGTGCTG AAGGCGCTGA AGGCGTTGAA AGACACCTAT GACAAGATCA AGGCCATCGT CGCCACGCTG GAAAAGCTGC AGTCGGCGCT GAAGACGATG AACAAGGTGG CCAGTCTCAA CGACCTGTCC AAGACCATCG CCGCCGCCGG CAAGCTCAAT GAATTGCAGA TGCCGACGCA ACTGGAGTTC CAGACCCTCG CCGAAAACGT CCGGGCGGCG CTCACCAATG TGCCCGACAG CGGCCGCCTC AATCAGGACA AGGCCAACCT CATCGCCGCC GTCAACACGC TTTCCAATAT CGGCAGCGCA CTGGTTCAAA CACAGTCGCG GTCGGTGGCC CTGGCGATGG AGATCGTCAA TCAAAACCGG CTGAAGACCA TCAATGGCCA ACAGCAGGCG TCGATGTCGG CGCTGAGCGA TAAGCTCAAT CTGAACAGTG TTTCCCGCCC GCCCGACATC AACAGCATCG ATCTCATCGG CACGACCGGC CCGTTGCAGT TCCAGCTCAA ACAAGTGCTT TCGGTTCTCG CCAACACGCT GGAGCTCCAG GACGGCGCCG TTCAATATGA ATATTTCGGC GATCCGGCGA TCATCACCTC ATTTTCGCTG CAGAACCTGC TGGCGGTGGT GTCGCAGCAG GACCTCAATG CCATCCACGG CATCGGCAAC CTCAATCCGC CGCCGCAGGA TGTCGCTGAT CCCATCACCT TCCGCATCAC CGACGTGCCG GCGAGCCGGC TGACCGGCGG TAATACCTTC CGCTTCCAGA TCGATCTCGC CGCCGTTCCC TTCCTGACCT ATGACATGGT CCGCGTGCGC CGCGTGGTGC CGCGCATCGT CCAGGGCATC CGCTCCAGCG CCTCGGGGCG CTATGAACTG GCCTTCTCGA CATCGGCGAA CCCGTTCATG GATCGGGCCT ACAACCGCAA GGCCCGGCTT TTCGCCACGG ATTTGCGCAA GTTCGGCCCC TATGTCTACG ATATCTCCTC TGGCGCCCTG ATCAGCGGTG GTTCGGAGGG ACCGTTTGAC AACCGGGTGA CCCAAGTCAC GCCCTTTGCC GAATGGGACA TCGCCCTGCC GGCCGACCGG GCCAGCAACA AGGACCTTGA AACGCGCGTC CTTCTCGATA TCGAGGTCGA TTTCTATATC ACCGCCCATT ACGACGACCC GATTCGCCGC CAGCGCGTGG TGCGGGCGGC CAATCGCAAG CTGCTGGCGG CCAATGCGCT GACGGCCAAT GCCCTGGCGG CCGCCGGCGA CAGCGGTCCG ACCCTCGCCT CCTTGCAAAC CCAGATGTAC CAGAACCAAG AGGTGCTCCA GGGCTGGGAT GCGGCGTTCA CCACGTTGGT GGGGCCGGTC AACGCCTTCT TGTACTGGCA GTTCAACCAG ATGACGGGCG GCACCAATCA GATGGCCGTC GCCAGTTACT ACTGCGATAA CGTCATCCCC TTTGGCAAGC TGGCGCTGAC CACCGTAACC CAATTGGCGT TTTCGCTGAG CAATCCGCTG GTGCAGTTCA TTCCCGGCAA CGACAGCGTG ACCGTGATTC AGACCATCAT CAGCGGCACG ATCAAGACCG GCTCCATGCA GGTGGACAAG AAAACCTTCG TGCCCGCGCA GTGCAGTCTG CCCGCCGACC CGGTGACCTT CACCGCCAAC CCGAGCGACA GCACCCTGAC GCTGAGTGTC AGCCCGGTGT TTGCCGAAGG CATGGTGGTG ACGCTGGGCT CCACCGGCAC GCTGCCCGCG CCCTTGCAAG GGGGCGACCA GACCTATTGG CTCGTCGCCC TGAAGACGGT GGGCAAAGTG ACGACGGTCC AAGTGTCAGA GACCGCCGGG GGCAAGCCGA TCGTGCTGAC GAATACCGGG GCGGGCACGC ACACCATTTC GCCCGCCATC GACTGGAACG ATCCTTATGA GATTGACGTT TCGAAAAACC CCTATGTCCG CGGCAGCGTG CCGCTGGCCC AGGTGGAAGG CGTGGTGACG CCGCCCGATG GTCAGGGCAG CAAGGATGAC ACCCGCACGG TGCTGCTTGA TTTCCCGTCG GGCTCGTTCA CCTTCCAGCA GTTCAGCGTC AACCCGCCGG ACTGGGACCC CGAGAAACAC GGCACCCAGA TTTCCAATGC GCTGGCCAAT TATTTCGCCT TCCACGAAAT CAAATACGTC GTGCAGACGA TCAACCTCAA AAACCTCAAT GCCGATACGC ATCTGACCCC GAGCCTCTTC AAGCTTCACG CCGGCACCAC CCGCGCCGGG AACAACATTC TGCAAATGCA GATCGTGACG ACATCGAAGA AACCGCAATC GACCGGTCTA GTTCAGGTCG ATGAGCCGGT GCCCTATAAT CCCGCCAATC CGGTGCCGGG GGGCAGCGAT TTCACCGCCT CGCTGATCCT GAGTTCCAAG CTCACCTTCG AGCATGTTTT CGTCGCGAAC TTCAACCAGG GATCGGCCAA CCTCCAGGCC AAGGCGGTGG CTCCGGCCCA GGGTTATCAA ACCTATACCG CCAAGATCGC CAGCGGTACG GTCGCGGCGA ACGTCGATTT CAAGAGTGAA TATGACGTTC ATGGAACACG GGTGAAGTAC CGGATCTCGG CCTCGGGCAA TACGATCAAT TGGGATCTGG CCGGGCTGGA GTTTAAGCCT TCGGAACACG GCGGCTATGA TCTTTATTAC TCAAACGGCG ATGCCACCAA GCCGGAGGGG GGAACGACGG TTGCCTTCCA ATACAGCCAA TGGATCCCGC CTTACTCGTA CGACATGGTC TATGTCCCGG GCTATTGGAC CGATTGGGAC GACAACTCGG CCATCGCTTA TGTCACCATG ACCGGCAGCT ATCCACTGAA GACCACGGGG CAAGGGTTGG CGCAGGTGGT CGAGTTCGCC AACACCAATC CGAGCGTGAC CTTCTCCAAG GCCTCCAATC TTACGCCGCA GGGACCGTGC GACTGCAACG ACAACGACCT GAAGATCGCC CTGCTCAATG CGCTCGGCGC GGCGGTGCCG GCCAAGTTGA AGGCCAGCAT CGAGAAGGTG CAGTTCAAAT CGATTTCGGT TCTGGCCCTG GAGTCCTTGC TGTTTCCGGC CGATCAACTG GTCAACATGC GCGACGCCTC GGTGCCGGGC GACCTGCTGG TGGTCGGCAG TTTCTATAAC AAGGTGCGCA AGACGGCCGC CGCCTATGAC GTCACCCTTT CGGCTTCCTC GGGGGCCAAG GGCGTGTTCG GGACGACCGC TTTCCAAAAC GGCCAGGGCA ACGGCAGCGC CACGATCAGC GGTCTGCCCA AGGCCTTCTC GTTCCAATAC GGGCCGATCG AGCCCGCCCT GGGAGGCATG GTGACCTATA CGGTCGATAT CGAGGCCGGC ACCATCAACC CGGGCACCCT TCTGCTTGTC GTGGTGCAGC CCGACGTCGA CAAGGCTCCC AAGCAGGTGG TGCTTCTGCC GCCGGGCTTT GGGGTGGGAA CCTGA
|
Protein sequence | MGQYSWASAA ANLVFGYRPV WTPQDTAAPY PYFEPPVAAL SSVTPNVTPH DVDYWAPYPT ITLHDSASSL LSGLNAYRVT GLQKGVVVQP GPLMLSDADG AKALKQAGDW IGLDLAKARS YMLVTTRRAN ATLRHPYFAE PGEADRDHYL TAEALALIAA LKPIDAAQFV APFSHAAISQ GDAASYLAAI TALGSHVVAS EVFGDVLFQV FAFDAEHFRV IEAAFQRQAT LQADGHLAVT GAPAASWAYW TTPVTDNGFG YVSEYGALRL QSRDPALDAA LTEGKWASPY VPKDIASIFA YATDYRLIED FESVVPSALV LAPLAALISN PYVAGPWDRI VKGALLQKWG DAVRIPQPSP VDIDWSTFFP DANASWASGI VTPTIDVYQE LVDISRVSFL GADVVGANFQ MNSFTCFSQV LEANVPAGSG PVMLPSDRVV LVAGTIDASR SIETPVLTMS EAALLGLTVV CDTMFGAVLF AATTRSGAVI RKAALDGLLF ETTSVQPATG RAGVAIAGAL ANTPSPELVL KLKGSIGMSV TAAESLLHAR SDDAERIRGI EQAYLRWLAS IIPADTEDLD LAYSRSHALY IAENVATFGT DIVFVPYVTY ETYRPYVSSM ITQAQTLSQT IIANEIMIRD TINSYKVMTS LADLNDNVKR IGGVLTGYFK ALADGQKAMN GYYDQVLDQL GQQQAETLKE IATLGAKLDA QRVVISNTGD PPGIVQTFQD DYADYARDQV AACVVSVVEG LFTAGLSMAA IPSEAAGGVL KALKALKDTY DKIKAIVATL EKLQSALKTM NKVASLNDLS KTIAAAGKLN ELQMPTQLEF QTLAENVRAA LTNVPDSGRL NQDKANLIAA VNTLSNIGSA LVQTQSRSVA LAMEIVNQNR LKTINGQQQA SMSALSDKLN LNSVSRPPDI NSIDLIGTTG PLQFQLKQVL SVLANTLELQ DGAVQYEYFG DPAIITSFSL QNLLAVVSQQ DLNAIHGIGN LNPPPQDVAD PITFRITDVP ASRLTGGNTF RFQIDLAAVP FLTYDMVRVR RVVPRIVQGI RSSASGRYEL AFSTSANPFM DRAYNRKARL FATDLRKFGP YVYDISSGAL ISGGSEGPFD NRVTQVTPFA EWDIALPADR ASNKDLETRV LLDIEVDFYI TAHYDDPIRR QRVVRAANRK LLAANALTAN ALAAAGDSGP TLASLQTQMY QNQEVLQGWD AAFTTLVGPV NAFLYWQFNQ MTGGTNQMAV ASYYCDNVIP FGKLALTTVT QLAFSLSNPL VQFIPGNDSV TVIQTIISGT IKTGSMQVDK KTFVPAQCSL PADPVTFTAN PSDSTLTLSV SPVFAEGMVV TLGSTGTLPA PLQGGDQTYW LVALKTVGKV TTVQVSETAG GKPIVLTNTG AGTHTISPAI DWNDPYEIDV SKNPYVRGSV PLAQVEGVVT PPDGQGSKDD TRTVLLDFPS GSFTFQQFSV NPPDWDPEKH GTQISNALAN YFAFHEIKYV VQTINLKNLN ADTHLTPSLF KLHAGTTRAG NNILQMQIVT TSKKPQSTGL VQVDEPVPYN PANPVPGGSD FTASLILSSK LTFEHVFVAN FNQGSANLQA KAVAPAQGYQ TYTAKIASGT VAANVDFKSE YDVHGTRVKY RISASGNTIN WDLAGLEFKP SEHGGYDLYY SNGDATKPEG GTTVAFQYSQ WIPPYSYDMV YVPGYWTDWD DNSAIAYVTM TGSYPLKTTG QGLAQVVEFA NTNPSVTFSK ASNLTPQGPC DCNDNDLKIA LLNALGAAVP AKLKASIEKV QFKSISVLAL ESLLFPADQL VNMRDASVPG DLLVVGSFYN KVRKTAAAYD VTLSASSGAK GVFGTTAFQN GQGNGSATIS GLPKAFSFQY GPIEPALGGM VTYTVDIEAG TINPGTLLLV VVQPDVDKAP KQVVLLPPGF GVGT
|
| |