Gene Rsph17029_2117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2117 
Symbol 
ID4895395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2245038 
End bp2246018 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content68% 
IMG OID640112711 
Productpeptidase U32 
Protein accessionYP_001043992 
Protein GI126462878 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.871657 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.950378 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACTCG TCTGCCCTGC CGGCACCCCT GCCGCGCTGC GCGCCGCCGT GGAAGCCGGT 
GCCCATTCCG TCTATTGCGG CTTTGCCGAC GAGACGAATG CCCGCAACTT CCCCGGCCTG
AACTTCTCGC CGAAGGAACT GGCCGAGGGC GTGGCCTTCG CCCACAAGCA TGGCGCGCAT
GTTCTGGTCG CGATCAACAC CTTTCCGCGG GCGGGAGACG AGTCGCTCTG GCATCGCAAC
ATCGCCGCAG CCGAAGCCGC GGGCGCCGAT GCGGTGATCC TCGCCGACAT GGGGCTTCTG
GCCTATGCCG CGAAGAACCA TCCGAACCTG CGGCGGCACC TCTCGGTGCA GGCGGCGGCG
GCCAACCCGG ATGTCATCAA CTTCTACAGC CGCGAGTTCG GCGTGAAGCG CGTGGTGCTG
CCGCGTGTGC TGACCGTGGC CGAGATCGCC GCGATCAACA AGGAGACGCC CGAGGTCGAG
ACCGAGGTCT TCGTCTTCGG CGGTCTCTGC GTCATGGCCG AGGGGCGCTG CTCGCTCTCG
TCCTATGCCA CCGGAAAGTC GCCCAACATG AACGGTGTCT GCTCGCCCGC GACAGAGGTG
CAGTATGTCG AGGAGGGCGA CGAGCTCGCC GCGCGCCTCG GCGAGTTCAC CATCCACCGC
GTGGGCAAGG ATCAGCCCGC GCCCTATCCG ACGCTCTGCA AGGGCTGCTT CACCTCGGGC
GATCAGGTGG GCCACATCTT CGAGGATGCG GTCAGCCTCA ACGCGCAGGA CATCCTGCCC
CAGCTCGCCA AGGCGGGCGT CACCGCGCTG AAGATCGAGG GGCGGCAGCG CTCGCGGTCC
TATGTCGCGC AGGTGGTGCG CAGCTTCCGC GCCGCCGTCG ATGCGCTGGC CGCGGGCCAG
CCCATGCCGC AGGGGGCGCT GGCCGCCCTC TCGGAAGGGC AGGCGACCAC GACGGGCGCC
TATGCCAAGA CCTGGAGGTA A
 
Protein sequence
MELVCPAGTP AALRAAVEAG AHSVYCGFAD ETNARNFPGL NFSPKELAEG VAFAHKHGAH 
VLVAINTFPR AGDESLWHRN IAAAEAAGAD AVILADMGLL AYAAKNHPNL RRHLSVQAAA
ANPDVINFYS REFGVKRVVL PRVLTVAEIA AINKETPEVE TEVFVFGGLC VMAEGRCSLS
SYATGKSPNM NGVCSPATEV QYVEEGDELA ARLGEFTIHR VGKDQPAPYP TLCKGCFTSG
DQVGHIFEDA VSLNAQDILP QLAKAGVTAL KIEGRQRSRS YVAQVVRSFR AAVDALAAGQ
PMPQGALAAL SEGQATTTGA YAKTWR