Gene Rsph17025_2386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_2386 
Symbol 
ID5083619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp2434982 
End bp2435962 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content70% 
IMG OID640483949 
Productpeptidase U32 
Protein accessionYP_001168580 
Protein GI146278421 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.904238 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.103464 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACTCG TCTGCCCTGC CGGCACCCCC GCCGCGCTGC GCGCCGCCGT GGAAGCCGGT 
GCCCATTCCG TCTATTGCGG CTTTGCCGAT GAGACGAACG CCCGCAACTT TCCCGGCCTG
AACTTCTCGC CGAAGGAACT GGCCGAGGGC GTGGCCTTCG CCCATCGCCA CGGAGCCAAG
GTGCTCGTCG CGATCAACAC CTTCCCGCGG GCGGGCGACG AATCCCTGTG GCACCGCAAC
ATCGCCGCCA CCGAGGCGGC GGGCGCGGAT GCGGTGATCC TCGCCGACAT GGGCCTCCTG
GCCTACGCCG CGAAGAACCA TCCGAACCTG CGCCGGCACC TGTCGGTGCA GGCGGCCGCC
GCCAACCCCG ACATCATCAA CTTCTACAAC CGCGAGTTCG GCGTGAAGCG CGTGGTCCTG
CCGCGCGTGC TGACCGTGGC CGAGATCGCC GCGATCAACC GCGAGACGCC CGAGGTCGAG
ACCGAGGTCT TCGTCTTCGG CGGCCTCTGC GTCATGGCCG AGGGGCGCTG CTCGCTCTCG
TCCTACGCCA CCGGCAAGTC GCCGAACATG AACGGCGTCT GCTCGCCCGC GACCGAGGTG
GAATATGTCG AGGAGGGCGA CCAGCTCGCC GCGCGCCTCG GCGACTTCAC CATCCACCGC
GTCGGCAAGG ACCAGCCCGC GCCCTATCCG ACGCTCTGCA AGGGCTGCTT CACCTCTGGT
GATCAGGTGG GCCACATCTT CGAGGATGCG GTCAGCCTCA ACGCGCAGGA CATCCTGCCC
CAGCTCGCCA AGGCGGGCGT CACCGCGCTG AAGATCGAGG GGCGGCAACG CTCGCGGTCC
TACGTCGCGC AGGTGGTGCG CAGCTTCCGC GCCGCCGTCG ATGCGCTGGC CGCGGGCCAG
CCGATGCCGC AGGGGGCGCT GGCCGCGCTC TCGGAAGGGC AGGCGACCAC GACGGGCGCC
TATGCCAAGA CCTGGAGGTA A
 
Protein sequence
MELVCPAGTP AALRAAVEAG AHSVYCGFAD ETNARNFPGL NFSPKELAEG VAFAHRHGAK 
VLVAINTFPR AGDESLWHRN IAATEAAGAD AVILADMGLL AYAAKNHPNL RRHLSVQAAA
ANPDIINFYN REFGVKRVVL PRVLTVAEIA AINRETPEVE TEVFVFGGLC VMAEGRCSLS
SYATGKSPNM NGVCSPATEV EYVEEGDQLA ARLGDFTIHR VGKDQPAPYP TLCKGCFTSG
DQVGHIFEDA VSLNAQDILP QLAKAGVTAL KIEGRQRSRS YVAQVVRSFR AAVDALAAGQ
PMPQGALAAL SEGQATTTGA YAKTWR