Gene Rsph17025_1397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_1397 
Symbol 
ID5083071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp1425016 
End bp1426860 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content67% 
IMG OID640482955 
Productpeptidase M10, serralysin-like protein 
Protein accessionYP_001167599 
Protein GI146277440 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2931] RTX toxins and related Ca2+-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.11813 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.552003 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTGT CATCCCTGTT CCCGTGGGCG GTCGGACCTG CCGATTGGCC GAAGGCCGTG 
GCCGCGACCG TCTCGCGGTA TCTGGTGCCC TCAGCCGATG AGGGGCCGGA GCTTCTCTGG
CCAGCCGGAA AATCCGATCG GGCAGGGGAC CGGGGCGGTC CTGACCTCAA GGCGGTGGCC
GCGCCCGGAC CCGAGGAGGA TGGCCCGGAT GGCCGCTTGC ACGAGGCGAG GGTTTTTGAC
CGGGGCCTGT TCAGCGGGCC TGCGGATGAG GCTTCCGCAT CCACAGGCCG TGAGCCTGCG
GCAATCTCGG TCCAGCGGGA CGAGGCCTTC GCCTTCGCGC TCGGTCCGTC GGCAACGTCC
GGCACGTCGT TCGCGCCATC CTTCGGCTCC GTGCAGCAGA TCGCGACGCA GCTCGTGAGC
GGTTACTGGC AGTGGAAAGG CGCGCCGGCC CGCGCCTACG ACGTGAAGCC GGGCGACACG
CTCAACGTGG ACTTCTCGGG CCTCACCGCC GACGGCAAGC GTCTCGCGAC CCTTGCGCTG
CAGAGCTGGT CGGACGTGAC CGGCATCCGC TTCAACACCA ACCCGGCGCG GCTGGCGACG
GTCCATATCA CGATGGATGA TGTCGAACCC GGGGCAAGCA CGTCCACGAC GTACATGGGC
AGCAAGATCC TGAAATCCCA TGTCAACATC GGCACGAAAT GGCTCTCGGA CTACGGGACC
GACGTCAACA GCTATTCGCT GCAGACCTAC ATTCACGAAC TTGGCCACGC GCTGGGTCTG
GGCCATCCGG GGAACTACAA CGGCTGGGCC AACTACGGCA CCGACAACCT GTTCCTCAAC
GACAGCTGGC AGGCCACGGT CATGTCCTAT TTCAGCCAGA CCGAGAACAC CACCATCAAG
GCCGACCGGG CCTTCGTCGT CACGCCGATG ATCGCGGACA TCGTGGCGGT CCAGATGATC
TACGGCACGC CCAGCGGCCT CCGCGCGGGA AACACCACCT ACGGCGACAA CTCGAACGCC
GGCGGCATGT ATGACCGGAT CGCGAGCCTG CCGGGGCGCG AGGTCGCCTG GACCATCTAC
GATCAGGGCG GCCGCGACAT CCTCGACCTG CGGTCCGCGA CGGCGGCGCA GGTGATCGAC
CTCCGGCCGG GCAGCGTCTC GAACGTCTAT GGCGCCGTGG GCAACCTCTC CATCGCGCAG
GGAACGGTGA TCGAGGTGGC GCGGGGCGGC AAGGGCAATG ACGTGATCAT CGGGAATTCG
GCCCACAACG ACCTGAGCGG CGGAGGCGGC TCGGACACGC TGCGCGGCGG GGCGGGCAAT
GACATCTACC GCGTGGACGG AGGCGACCGC GTGATCGAGC TTGCGGGCAA CGGCATCGAC
CGCGTGATCT CGAGCGCGAG CTTCCAGCTT GGCGCGCATG TCGAGAACCT GACGCTGACA
GGGAACCGCG CGATCAACGG CACGGGGAAC GATCTGGCCA ATGTGATCGT CGGCAATGGC
GCGGCCAATG TGCTGACCGG GCGGGGAGGG GCCGACAGTT TTGTCTTCTC CACCGCCCTC
GGCGGCGGAA ATGTGGACCG GATCACCGAT TTCAACGTGG TCGATGACAC GATCCGGCTG
GACGACGCGA TCTTCCGGGC CCTTGCACCG GGGCGCCTCG CAGGCTCTGC CTTCAGCGCG
AACGCGGCCG GCCGGGCGAA GGATGCGACC GACCGGATCA TCTACGAAAC CGACACCGGC
TCGCTCTGGT ACGACGCGGA CGGCACGGGC GGCGCGGCGC CGATCCGGTT CGCCGTGCTC
GCGCCGGGCC TCTCCGTCAC GGCGGCGGAC TTCCTCGTCA TCTGA
 
Protein sequence
MNLSSLFPWA VGPADWPKAV AATVSRYLVP SADEGPELLW PAGKSDRAGD RGGPDLKAVA 
APGPEEDGPD GRLHEARVFD RGLFSGPADE ASASTGREPA AISVQRDEAF AFALGPSATS
GTSFAPSFGS VQQIATQLVS GYWQWKGAPA RAYDVKPGDT LNVDFSGLTA DGKRLATLAL
QSWSDVTGIR FNTNPARLAT VHITMDDVEP GASTSTTYMG SKILKSHVNI GTKWLSDYGT
DVNSYSLQTY IHELGHALGL GHPGNYNGWA NYGTDNLFLN DSWQATVMSY FSQTENTTIK
ADRAFVVTPM IADIVAVQMI YGTPSGLRAG NTTYGDNSNA GGMYDRIASL PGREVAWTIY
DQGGRDILDL RSATAAQVID LRPGSVSNVY GAVGNLSIAQ GTVIEVARGG KGNDVIIGNS
AHNDLSGGGG SDTLRGGAGN DIYRVDGGDR VIELAGNGID RVISSASFQL GAHVENLTLT
GNRAINGTGN DLANVIVGNG AANVLTGRGG ADSFVFSTAL GGGNVDRITD FNVVDDTIRL
DDAIFRALAP GRLAGSAFSA NAAGRAKDAT DRIIYETDTG SLWYDADGTG GAAPIRFAVL
APGLSVTAAD FLVI