Gene Rsph17025_2931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_2931 
Symbol 
ID5084531 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp2988635 
End bp2989981 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content70% 
IMG OID640484502 
Productpeptidase M16 domain-containing protein 
Protein accessionYP_001169122 
Protein GI146278963 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00146664 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.111067 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCTGC GCCGCCTGGC CCTTCCGGCC GCCACCTGGC TCATGCTCTC GCTCCCCGCG 
CTGGCCGAGA CGGTGAGCGA TTTCCGCGTG CCGAACGGGC TCGAGGTGGT GGTGATCGAG
GATCACCGCG CCCCTGTCGT GACTCACATG GTCTGGTATC GCGTGGGCGC CGCGGACGAG
CCGCCCGGCC ATTCCGGCAT CGCCCATTTC CTCGAGCATC TGATGTTCAA GGGCACGGAC
GAGCTGGCCG CGGGCGAGTT CTCGGCCACG GTCGAGGCGC AGGGCGGCGA CGACAACGCC
TTCACCTCGT GGGACTACAC GGCCTACTTC CAGCGCGTCG CGGCCGACCG GCTCGATCTG
ATGATGAAGA TGGAAGCCGA CCGGATGCGC GACCTCGAGA TGACGGAAGA GGATGTGCGC
ACCGAGCGGC AGGTGGTGCT CGAAGAGCGC AGCCAGCGCA TCGACAGCGA CCCCGGCTCG
ATCTTTTCCG AGCAGAGCCG CGCGGCGGCC TACCTGAACC ACCCCTACGG CATCCCGATC
ATCGGCTGGC GGCACGAGAT CGAGCAGCTC GGCCGCGAGG ACGCCTTCAG CTTCTACCGG
ACCTATTACG CGCCGAACAA CGCGATCCTC GTGGTTGCGG GCGACGTGGA CCCGGCCGAG
GTGCGGCGCA TGGCCGAAGC GCATTACGGC CCGCTCGAGC CATCGGCGAA CCTGCCCGAG
CGGCTGCGCC CGCAGGAACC GCCGCAATTG TCCGAGCGGC GCCTGACCTT CACCGATCCG
CGCGTGGCGC AGCCCTATGT CTCGCGCAGC TACCTCGCCC CCGCCCGTCA GAGCGGCGCG
CAGGAGAAGG CGGCGGCCCT CACGATCCTG GCCGAACTTC TGGGGGGCAG CCCCACCACA
TCGCTTCTCG CGCGCGAGTT GCAGTTCGGC GAGCGGCCCC GCGCCGTCTG GGCGCAGGCC
TGGTACAATG GCGGGGCGCT CGACTCGGGC AGCTTCGGGC TTGCGGTCGT GCCGGTGCCC
GGGGTGCCGC TCGACGAGGC CGAGGAGGCG ATGGATGCGG TCGTCGCGCG CTTCCTCGAG
GAGGGGCCCG ACCCCGAGGA TTTCGAGCGG ATCAAGATCC AGCTCGGCGC GCAGGACATC
TATTCGCGTG ACAATGTGGA CGGGCTGGCC CGCCGCTACG GCGCCGCTCT GACCACGGGG
CTGACGGTCG AGGACGTGAA GGCCTGGCCC GATGTGCTGC AGGCGGTGAC GCCCGAGGAC
GTGATGGCCG CCGCGCGCGA GGTCTTCGAC CGCCGCCGCG CCGTCACCGG CTGGCTGATG
CAGGCCGACG AGGAGACAAG CCAATGA
 
Protein sequence
MMLRRLALPA ATWLMLSLPA LAETVSDFRV PNGLEVVVIE DHRAPVVTHM VWYRVGAADE 
PPGHSGIAHF LEHLMFKGTD ELAAGEFSAT VEAQGGDDNA FTSWDYTAYF QRVAADRLDL
MMKMEADRMR DLEMTEEDVR TERQVVLEER SQRIDSDPGS IFSEQSRAAA YLNHPYGIPI
IGWRHEIEQL GREDAFSFYR TYYAPNNAIL VVAGDVDPAE VRRMAEAHYG PLEPSANLPE
RLRPQEPPQL SERRLTFTDP RVAQPYVSRS YLAPARQSGA QEKAAALTIL AELLGGSPTT
SLLARELQFG ERPRAVWAQA WYNGGALDSG SFGLAVVPVP GVPLDEAEEA MDAVVARFLE
EGPDPEDFER IKIQLGAQDI YSRDNVDGLA RRYGAALTTG LTVEDVKAWP DVLQAVTPED
VMAAAREVFD RRRAVTGWLM QADEETSQ