Gene Rru_A3097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A3097 
Symbol 
ID3836543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp3564389 
End bp3567190 
Gene Length2802 bp 
Protein Length933 aa 
Translation table11 
GC content67% 
IMG OID637827212 
Producthypothetical protein 
Protein accessionYP_428179 
Protein GI83594427 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID[TIGR02917] putative PEP-CTERM system TPR-repeat lipoprotein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.681672 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACGGT CACTGAAACG GTTCTGGTCG ACGACAAGCG CCCTCGCCCT GGTCCTGGCG 
CTCGCCGCCT GCAACGACGA GGCCAAAGAC AAGGCGCGGG CCTCGGCCAA GGCCTATGAC
AGCGCTATCA CCGCTTGGCG GGCCGGCGAC CAGAAGGCGG CGCTGATCCA TGTCAGCAAC
GCCCTCAAGG CGGCGCCCGA CAACCGCGAC GCCAAGATCC TGATGGGCGA GATCACGCTC
TCGGGCGGCG ATGTCTGGTC GGGCGAAAAG CTGCTCAAGG AGGTCCGCGA CGGTGGGGCG
CCGGCCGAGA CATGGCTGCG TCCCCTGGCC AAATCCCTAA TCCTTCAGCA GAAATTTGAC
GAAGCCCTGA CTCTTGCCCG CTCCATCCCG GAAACCGAGC TGGTCGGCGC GGTAAAGACC
ATCGAAGGCC TCGCCCTGTT TGGCAAAGGC CAGGACGGGG CGGCGCGGCT GGCCCTGGAC
AAAGCCCTCG ACATCGACCC GGCCGACAAG GACGCCCTGA TCGGCGCGGC CCAGATCGAA
ACCATGGCCG GCCATCAGGA CGCCGCCCGC GCCCTGCTCG CCCGAGCGGC GGCGGCGGCG
CCCGATGATG TCGATGTGCT GGTGGCCCAG GCCGATACCG CGCTCAGCGC CAATGATCCG
GCGGCGGCCG AGGGGCTTTT CTCCCAGGCC GCGGCCCGTC TGCCGCTCAA CCCGCTGATC
CGCCTGTCGC TGGCCCAGGC CCAGATCGAG GCCGGCAAGA ACGCCGAGGC CCGCCAGACG
CTCAATACGG TGCTGGCCGA TATTCCGGCC CACCCCTGGG CGCTGTACCT GCGCGGCCTC
ACCGCCTATC GCACCAACGA TATGACCGCC GCCGACAAGG ATCTGACCGC TGCCCTGGCC
TTGGCCAAGA CCCTGCGCCC GGCGATTTTC CTGGCCGGGG TGGTCAAATA CAACATCGGC
GAATACGAAC AGGCCTCGCG GCTGCTTGCC GGGCTGACCG AGACCGAGGG CAAAAGCAAC
AAGACCGCCG ATGCCGTGCG CGCCGCCGCC CTTTTGAAAC TGGGCCGCGA CGACGAAAGT
TATCGTCTGC TCCGCCCCTT GGCCGCCGGT GACGGCGAAA CCGCCGATCT TTACGCGATG
GCCGCCGTCG CCGCCCAAGG AGCCGCCGCC ATGGCCGACA GCGAAGGCTA TTACCAGAAA
GCCGTTGTTC TGCGCCCCGA CGACCCCGCC TTGCTGACCA ATCTCGGCAT AGTCAAACTG
GCGCGCGGCG ACACCACCGC CGGCGAAGAC ACCTTGAACC GGGCGGCCGA GCTTGAAGGC
GACGACAAGA AGGCGCTGCT GCTGCTGTTC TCCTCGTTGC TGCAGAAAAA GGACTTCGAC
AAGGCCGAGG CCCTGGCCGG GGACACCAAG CGGAAATATC CCGATCGCGC CTGGGGCTGG
ACGATGGACG GCATGATCCA GGCCAGCCGG GGCGACACCG CCATGGCGCG CGCCGCCTTC
GAAACCGCCG TTCGCAAAGA GCCAACGGCC GGTGACGCCG TGCGCAATCT GGCCCTGACC
GCCCTGCAGT CGGGCGATAC CGAAGGCGCC CGCGGCGTGG TCGAAGGCTA TCTGCGGACT
AATGCCGGGG ATTCGGCGAT GGCGATGATC GCCGCCGCCG TGGCCAATAA GCGCAATGAT
CTGGTCGCCG TCGAAAAATG GCTGCGTCAG GCCCTGGAGC GCGATCCGGC CAACATGGAG
GCCGCCAGCA ATCTGGCGTC GCTGCTGACC TCGACCTCCC GTCCCCAGGC GGCAATCATC
GTCGCCCAGG ACGCCCTGCG GATTTCTCCC GATACTCCGG CGGTGATGGA AGCCCTGGGC
AAGGCCCAAT TGCTGATCGG CGATTATACG GCGGCGGCCG ATGTGCTGCG CCGGGCGGTG
GCGATCAAAC CCAGCGGGCT GACCTATTAT CTGCTGGCGA CCGCCTATCT CAATCTCAAC
GACCCGCCCC GCCTCAAGGA GGCCCTGGAG TCGGTGGTCA AGCTGCAGCC CGATCATGTC
GATTCGCGGG TGATGCTGGC GACCATGATC GTTGATGGCG GGTCTTTGAG CGACGCCAAG
ACCGCCGTCG AGGGCGTGAC CACGGATTTC CCGGGCGATC CCCGGGCCCA GGAGGTTCGC
GCCCGCTATC TGGCCAAGGC CGAGGGGCCG GCCAGCGCCA TCACCTTCCT TGAAGCCTCG
CTGACCGATC CCAACACCCG TCCGCGCAAT CTCGTCATGC TGCTGGCCTC GGCCTATAAT
GAAAAGGGCG ATGGCGCCAA GGCAAGCAGC CTGCTTGAAG ACTGGGTGGC CAAGAACCCC
GATGACTACC CCGGCCGCCT GTCGCTGGCG ACCCAGCAGA TCGCCGCCAA TCAGTTGGAA
AAGGCCAAGA CGACCCTGGA AAAAGGGCTG GAGCGGGTTC CCAACGACTG GATCGCCCGC
AACAACCTTG CCGAGGTGAT GCTCCGTCTT GGCCAGACCT CGGCCGCCTA TGACCAGATC
GTCATCGCCC GACGCTCGGG GGGACCACAG CCGGCGCTGC TTGATACCGA GGGCCAGATC
CTGCTCAAGA TGGGCAAGGC CAGCGAGGCC GTCGAGATCC TGCGGCTGGC GACGATCGAT
CGCAACGCCC CGCCCACCTA TGGCCTGCAC CTCGCCCAGG CCCTGGCGGC GGCGGGCAAG
AAAGACGAGG CCGGCGAGCG TTTGCGGGCC TTGCTCGACA AGAACAACGC CTTCGAGGGG
GCCGAACAGG CCCGGGCGCT TTTATCCGAG TTGCAGGGAT GA
 
Protein sequence
MARSLKRFWS TTSALALVLA LAACNDEAKD KARASAKAYD SAITAWRAGD QKAALIHVSN 
ALKAAPDNRD AKILMGEITL SGGDVWSGEK LLKEVRDGGA PAETWLRPLA KSLILQQKFD
EALTLARSIP ETELVGAVKT IEGLALFGKG QDGAARLALD KALDIDPADK DALIGAAQIE
TMAGHQDAAR ALLARAAAAA PDDVDVLVAQ ADTALSANDP AAAEGLFSQA AARLPLNPLI
RLSLAQAQIE AGKNAEARQT LNTVLADIPA HPWALYLRGL TAYRTNDMTA ADKDLTAALA
LAKTLRPAIF LAGVVKYNIG EYEQASRLLA GLTETEGKSN KTADAVRAAA LLKLGRDDES
YRLLRPLAAG DGETADLYAM AAVAAQGAAA MADSEGYYQK AVVLRPDDPA LLTNLGIVKL
ARGDTTAGED TLNRAAELEG DDKKALLLLF SSLLQKKDFD KAEALAGDTK RKYPDRAWGW
TMDGMIQASR GDTAMARAAF ETAVRKEPTA GDAVRNLALT ALQSGDTEGA RGVVEGYLRT
NAGDSAMAMI AAAVANKRND LVAVEKWLRQ ALERDPANME AASNLASLLT STSRPQAAII
VAQDALRISP DTPAVMEALG KAQLLIGDYT AAADVLRRAV AIKPSGLTYY LLATAYLNLN
DPPRLKEALE SVVKLQPDHV DSRVMLATMI VDGGSLSDAK TAVEGVTTDF PGDPRAQEVR
ARYLAKAEGP ASAITFLEAS LTDPNTRPRN LVMLLASAYN EKGDGAKASS LLEDWVAKNP
DDYPGRLSLA TQQIAANQLE KAKTTLEKGL ERVPNDWIAR NNLAEVMLRL GQTSAAYDQI
VIARRSGGPQ PALLDTEGQI LLKMGKASEA VEILRLATID RNAPPTYGLH LAQALAAAGK
KDEAGERLRA LLDKNNAFEG AEQARALLSE LQG