Gene Rru_A1987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A1987 
Symbol 
ID3835411 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp2296765 
End bp2297928 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content66% 
IMG OID637826086 
Productpeptidase M42 
Protein accessionYP_427074 
Protein GI83593322 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID[TIGR03106] hydrolase, peptidase M42 family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCGTC TGGCGATCGA TACGGACTAT CTGGCCCGGA CTCTGGTTCG CTTGTTAGCC 
ACCCCCAGCC CGACCGGCTA TACCGATACC GTCGTCCGCG AAACCTGTGC CGAGTTGGAA
AGCCTGGGCC TGACCCCGAC CCTGACCCGA CGGGGGGCGG TTTGCGTGGT GTTGCGCGGA
CGGGAAGCCC GGCCGGCCCG CGCCATCGTT TCCCATCTCG ACACGCTGGG CGCTCAGGTC
AAGCAGCTCA AAGACAACGG CCGCCTGGAA CTGGTGCCGA TCGGCACCTG GTCGGCGCGC
TTCGCCGAGG GGGCGCGGGT CACGGTGTTC ACCGATCGCG GCGCGGTTCG CGGCACGATT
TTGCCGCTGA AGGCCTCGGG CCACATCTTT AACGAAGAGA TCGACAGCCT GCCGATCGGC
TGGCCGATGA CCGAGTTGCG GGTCGATGCC CGGGTTCATA GCAAGGCCGA TCTGATCGCC
CTGGGTATCG AGGTTGGCGA TATCGTCGCC ATCGACCCCC AGCCGGAATT CCTAGCCAAC
GGCTATATCG TGTCCCGCCA TCTTGATGAC AAAGCCGGGG TGGCGCTGAT GCTCGCGGCT
TTGAAGGCCC TGACCGCCCA TAACGAACCG CCGCCCGTCG ATGTCCATTT CATCTTCACC
ATCGCCGAGG AAGTCGGCGT CGGCGCGTCT TCGGCGCTGA CCGATGACGT CGCCTCGGTG
GTCGCCGTCG ATAACGGCAC CAGCGGACCC GGCCAGAACT CGGCCGAATT CGGCGTTACC
ATCGCCATGG CCGACCAGAC CGGCCCCTTT GATTATCATC TGACCCGGGC GCTGATCCGG
CTCTGCCGCG ACGAGGACAT CATCTTTCGC AAGGATGTGT TCCGCTACTA CCGCTCCGAC
GCCGCCTCGG CGGTGGTCGC CGGCCACGAT GTGCGCAACG CGCTGGTCAC CTTTGGCATC
GACGCCTCGC ATGGCTATGA GCGCATCCAT ATGCACGCCC TGCGGTCGGT GGCCGAATTG
CTGAGCGCCT ATGCGCTGAG CCCGGTGGAG ATCCGCCGAG ACGCCGTGGA GACCGCCCGC
GGTCTGGCCG GCTTCACCCG CCAGCCACCC CCCGAACCCA TCGCCGAGGA CCTAGCCGTT
TCGCAAGAGG GCCCTCTTGC GTGA
 
Protein sequence
MTRLAIDTDY LARTLVRLLA TPSPTGYTDT VVRETCAELE SLGLTPTLTR RGAVCVVLRG 
REARPARAIV SHLDTLGAQV KQLKDNGRLE LVPIGTWSAR FAEGARVTVF TDRGAVRGTI
LPLKASGHIF NEEIDSLPIG WPMTELRVDA RVHSKADLIA LGIEVGDIVA IDPQPEFLAN
GYIVSRHLDD KAGVALMLAA LKALTAHNEP PPVDVHFIFT IAEEVGVGAS SALTDDVASV
VAVDNGTSGP GQNSAEFGVT IAMADQTGPF DYHLTRALIR LCRDEDIIFR KDVFRYYRSD
AASAVVAGHD VRNALVTFGI DASHGYERIH MHALRSVAEL LSAYALSPVE IRRDAVETAR
GLAGFTRQPP PEPIAEDLAV SQEGPLA