Gene Rru_A2662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A2662 
Symbol 
ID3836097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp3092389 
End bp3093864 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content69% 
IMG OID637826768 
Productpeptidase S1C, Do 
Protein accessionYP_427746 
Protein GI83593994 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAAAG CGGGGGAGGA CCGGATGGAC CAATCGGGGC GGTGGGCCAG GGGCGTTCTG 
GCCGGGGCGA GCGTCGCGGT CTTGCTGGCG TTCGGGGGAC CGGCGGCGGG GGCGGCCGAA
CAGGTGCCAG GGGCCGGGAC GGCCGAACAG GTGCCGATGG CGGCGGCCGA GATCAAGCTG
ACCTTCGCGC CGGTGGTCCG CGCCGTCGCT CCGGCGGTGG TCAATATCTT CAGCCAGCGC
GTCATCACCG AGTCGCAAGT TCCGCCGATG TTCGCCGACC CGCTGTTCCG CCGCTTCCTC
GAGGAGCGCG GCATGCTGGG CAAGCCGCGC GAGCGGGTTC AGCGGTCCTT GGGCTCGGGG
GTGATCTTGC GCTCCGAGGG CGTGGTGGTG ACCAATGCCC ATGTGGTCAA TGGCGCGAGC
GAAATCACCG TGGCCCTCAA CGATCGCCGG GAATTCCCCG CCGAACTGGT CGGGCTTGAT
CCGCGGGCCG ATCTGGCGGT GCTGCGGATC AAATCCGATA CGCCGCTGCC GTCGCTGGCC
CTGGCCGACG GCGAGCCGCC CGAGGTTGGC GATCTGGTTT TGGCCATCGG CAATCCCTTT
GGCGTCGGTC AAACGGTCAC CAGCGGCATC GTTTCGGCCC AGGCGCGGAC CACGGCGGGA
ATTTCCGATT ACCGCTTCTT CATTCAGACC GACGCGGCGA TCAACCCCGG CAATTCCGGC
GGCGCCCTGG TCGATCTGAG CGGCCGGCTG GTCGGCATCA ACACCGCCAT TTATTCGCGC
GACGGCGGCA GCGTCGGCAT CGGCTTCGCC ATTCCGGTCG AAATGGTGCG CTCGGTGGTC
GAGGGGATCT TGGAAGACGG CAAGGTCCGC CACCCCTGGC TGGGCGCCGA TGGCCAGTCG
GTGACGACCG AACTGGCAAG CCACATGGGC TTGGATCGCC CGCTGGGCGT CGCCATCACC
GATGTGGCCA AGGGCGGGCC GGCGGCCAAG GCCGGGCTTG CCGAAGGCGA TGTCATCCTG
GCGCTTGATG GCCGGCCGGT GTTCGAGGGG GAAACCCTGC GCTATCGCAT CGCCACCCAC
CGCCCCGGCG ACAAGGTGGT GTTGGGGATC CGCCGCGACG GCAAAGATAC CACCCTGGCC
GTCACCCTGG AGGCGCCGCC GGAGGATCCG CCGCGCGATA CCACCTTGCT CAAAGGCTCC
CATCCGTTCG ACGGGGCGGA GGTTTCCAAT ATGAATCCGG CCCTGGCCGA AGAACTGGGG
TTGGAGAAGG CCTCCCGGGG GGTGACGGTC ACCGGCCTGG GCGATGGCGT CGCCGCGCGG
ATCGGCATCC GCCCCGGGGA CCGGGTCATC GAGGTCAACG GCAAGGCCAT TGACGCGGTT
AGGGACCTGA AGGCGGCGAT CAGCAAACCC AGCCGGGTGT GGCAGGTGGT GGTCGATCGC
GACGGTCGGG CGGTGCGGCT GGTGCTCGGC GGCTGA
 
Protein sequence
MIKAGEDRMD QSGRWARGVL AGASVAVLLA FGGPAAGAAE QVPGAGTAEQ VPMAAAEIKL 
TFAPVVRAVA PAVVNIFSQR VITESQVPPM FADPLFRRFL EERGMLGKPR ERVQRSLGSG
VILRSEGVVV TNAHVVNGAS EITVALNDRR EFPAELVGLD PRADLAVLRI KSDTPLPSLA
LADGEPPEVG DLVLAIGNPF GVGQTVTSGI VSAQARTTAG ISDYRFFIQT DAAINPGNSG
GALVDLSGRL VGINTAIYSR DGGSVGIGFA IPVEMVRSVV EGILEDGKVR HPWLGADGQS
VTTELASHMG LDRPLGVAIT DVAKGGPAAK AGLAEGDVIL ALDGRPVFEG ETLRYRIATH
RPGDKVVLGI RRDGKDTTLA VTLEAPPEDP PRDTTLLKGS HPFDGAEVSN MNPALAEELG
LEKASRGVTV TGLGDGVAAR IGIRPGDRVI EVNGKAIDAV RDLKAAISKP SRVWQVVVDR
DGRAVRLVLG G