Gene Rru_A3157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A3157 
Symbol 
ID3836603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp3648198 
End bp3649415 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content67% 
IMG OID637827272 
Producthypothetical protein 
Protein accessionYP_428239 
Protein GI83594487 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.842057 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAGCC CGCAATCGAC GCCAGACGAT CACTCCCCCA AGAACACATC CGAAGGAAAA 
CGGCGCCGTT CTCCGGCGGC GTCTGCCCTG TGGATCGTTG TTTTCCTGCT TTTTCTGATC
TGCGTCGGGC TTTCGCTTTA TCTGTTTTTC AGAACCCCCG AAATCACCCG GAAAACGGCC
GCCGTCGCCA CCCCGGGCGC GACCCAGGAC ACCGGCGAGC GCGCGGCTTA TCTCGCCCAG
CAGATCGCCC TGCGCCGGGT CGAGTTAGGC GACCTTGTCG CCGCCATCGA TCCGCCGCGC
TGCGTCGCCC CGGCCCAAAT CGACGAAAGC CGCCTGCGCC TTTTGCGCCA GCAGCGCGGC
GACGCCCTGT CCCTTTGGCG ATCCTTGAGC ACCGGGGGCA AAACCGGGGC CACCCAGGCG
ATCCCCGACA CCATCCCCGA ACCGGCGGTG GCCACGGCTT TGCCCGGGGC GACGCCGCCG
CTTGCCCCCG CCACGCCGCA AGCCGCCCCT GGGGAAAGCG CACCGACGAT GGGGATCGGG
CCGTTGCGCG ATCGGCTGGA AAAGGCCTCG GTCATCGTCC TTGGCCTGCC CAAGGGCGAG
GCCGACGGCC TGATGACCGG CACCGGTTTT TTCATCGCCG ATAACCTTGT CGTGACCAAC
CGCCATGTGA TCGACGCCGC CGATCCGGCC AAGCTGTTCA TCACCAGTTC CAGCCTGGGC
AAGATGGCCC AGGTGTCGCT GATCGCCACC ACCGTCTCGG CGGTTCCGGG GGAAGCCGAT
TACGCGGTGC TGGGAACGGG AAGCGTGCGC GCCCCGGCCA CCCTCGCCCT GTCGCTCGAC
GCCGGCAAGC TGACGCCGGT GATCGCGGCG GGCTATCCCG GCATGGCGCT GCTGGGCGAT
CAGGGCTTCC AAAAGCTGAT CCAGGGCGAT CTGTCCTCGG CCCCCGATCT CAACATGAAC
CGGGGCGAGG TCCGCTCGGT GCGCCCGGTC GGCGCCATCG TCCAGATCAT CCATACCGCC
GATGTTCTGC AAGGCTACAG CGGCGGTCCG CTGCTTGACA CCTGCGGCCG GGCGATCGGC
GTCAACACCT TCATCCAGGT CGATCGCGAT CAGGCCGCCA AGCTCAACAG CGCCCAGAAG
GTTGATACCC TGCTGGCCTT CCTGCAAAAA AAGGGCATCA CCCCGGCCCT CGACAGCCGC
GCCTGTCAGC CCGGCTGA
 
Protein sequence
MSSPQSTPDD HSPKNTSEGK RRRSPAASAL WIVVFLLFLI CVGLSLYLFF RTPEITRKTA 
AVATPGATQD TGERAAYLAQ QIALRRVELG DLVAAIDPPR CVAPAQIDES RLRLLRQQRG
DALSLWRSLS TGGKTGATQA IPDTIPEPAV ATALPGATPP LAPATPQAAP GESAPTMGIG
PLRDRLEKAS VIVLGLPKGE ADGLMTGTGF FIADNLVVTN RHVIDAADPA KLFITSSSLG
KMAQVSLIAT TVSAVPGEAD YAVLGTGSVR APATLALSLD AGKLTPVIAA GYPGMALLGD
QGFQKLIQGD LSSAPDLNMN RGEVRSVRPV GAIVQIIHTA DVLQGYSGGP LLDTCGRAIG
VNTFIQVDRD QAAKLNSAQK VDTLLAFLQK KGITPALDSR ACQPG