Gene Rsph17029_1872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1872 
Symbol 
ID4895193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1980787 
End bp1982382 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content68% 
IMG OID640112466 
Productserralysin 
Protein accessionYP_001043748 
Protein GI126462634 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2931] RTX toxins and related Ca2+-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.043105 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCTTT CCTCGCTTTT CCCCACTTTC GCCGGGCAAG GGCCCGAAAC CGCACTGACG 
GACCGCGACG GACGCGCCGA GGCCGCTGCC TTCCAGACGA CCGCGGCGAA CAGGCTCGCT
GCCGAACCGG CGGTGGCCTC GATGCGCGAG ATCGCCGATC AGCTCGTCAG CGGCTATTGG
CGGTCGGAGG GCCACTCCAT CCGCCGCTTC GACGTGGAGC CGGGAGACAC GCTGAACGTG
AACCTGTCTG GTCTGACGGC CGATGGCCAG CGTCTCGCGC GGGCCGCGCT GGACACCTGG
GCGGATGTCA CGGGGCTGAA GTTCAATACG GCCCCCCGCT TCGGCGCGAC GGTTCATATC
GTGATCGACG ACAGGGAGAA CGGGGCTCAC TCCACGGCCA CCCTGTCGGG CGGACGGCTG
ATCAAGGCAA ATGTGAATGT CGGCCGCGAC TGGCTCGCGG ACTACGGCAC CGACCTGAAC
AGCTACAGCC TGCAGACCTT CATCCACGAA ATCGGCCATG CGCTGGGTCT CGGCCATGCC
GGCAACTACA ACGGCTCGGC CGATTATGCG AGCGACGCGC TCTTTCTGAA CGATAGCTGG
CAGGCCAGCG TCATGTCCTA TTTCAGTCAG ACCGAGAACA GCTACATCGA CGCCGACAAG
GCGTTCGTCG TGACGCCGAT GATCGCCGAC ATTCTGGCGA TCCAGACGCT CTACGGCACG
CCGGGCGGGC TGCGGGCAGA CAGCACGGTC TATGGCACGG GCTCGACGGC GGGCGGCATG
TATGACCGCG CGGCGACCTT GACCGCGGAA CCCGTGGCCT GGACGATCTT CGACCAGGGC
GGGTGGGACA GGCTCGACCT GCGCGGGGCA GAGGCTCCGC AACTCATCGA CCTGCGGCCG
GGCGCAATTT CGGACATCTA TGGCGCGAAG GGCAATCTCT CGATCGCCCC GGGCACGGTG
ATCGAGGCCG CGCGGGGCGG CAGCGGCAAC GACCGGCTCA TCGGCAACGG CGTGGCCAAC
GTCCTGAACG GCGGCGGGGG CTCGGACATC ATGCGCGGCG GGGCGGGCAA CGACATCTAC
CACAGCGACG GCCTCGACCG GATCGTGGAG CTTCCGGGGC AGGGGATCGA CCGGGTGATC
TCGACGGCGA GCTGCACCCT TGCGGCCAAT GTGGAGAACC TGACGCTGGC GGGCGGACGG
GCCATCACCG GCACGGGCAA CGGGCTTGCG AACGTCATCA CGGGCAATGC GGCGGCCAAC
CGCCTGACCG GCGGGGCCGG TGCCGACAGC TTCGTCTTCG CGACCGCGCT CGGCCGCGGC
AACGTCGACA GGATCACGGA TTTCTCGGTG GCCGACGACA CGATCCGCCT CGACGACGCC
ATCTTCCGGG CGCTGCCGCG GGGCAGTCTG GCCGACGCGG CATTCGCGGC CAATGCGGCG
GGGCGGGCGC TCGACGCGCT CGACCGCATC CTCTACGAGA CCGACACGGG CGCGCTCTGG
TATGACCGCG ACGGGACGGG TGCCGCGGCC GCCGTGCGCT TCGCTTGGGT GACGCCAGAG
CTTGCGCTGA CCGCCGCCGA TTTCCTCGTG ATCTGA
 
Protein sequence
MFLSSLFPTF AGQGPETALT DRDGRAEAAA FQTTAANRLA AEPAVASMRE IADQLVSGYW 
RSEGHSIRRF DVEPGDTLNV NLSGLTADGQ RLARAALDTW ADVTGLKFNT APRFGATVHI
VIDDRENGAH STATLSGGRL IKANVNVGRD WLADYGTDLN SYSLQTFIHE IGHALGLGHA
GNYNGSADYA SDALFLNDSW QASVMSYFSQ TENSYIDADK AFVVTPMIAD ILAIQTLYGT
PGGLRADSTV YGTGSTAGGM YDRAATLTAE PVAWTIFDQG GWDRLDLRGA EAPQLIDLRP
GAISDIYGAK GNLSIAPGTV IEAARGGSGN DRLIGNGVAN VLNGGGGSDI MRGGAGNDIY
HSDGLDRIVE LPGQGIDRVI STASCTLAAN VENLTLAGGR AITGTGNGLA NVITGNAAAN
RLTGGAGADS FVFATALGRG NVDRITDFSV ADDTIRLDDA IFRALPRGSL ADAAFAANAA
GRALDALDRI LYETDTGALW YDRDGTGAAA AVRFAWVTPE LALTAADFLV I