Gene RSP_3539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_3539 
Symbol 
ID3721953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007494 
Strand
Start bp622498 
End bp624168 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content66% 
IMG OID640073203 
Producthemolysin-type calcium-binding region, RTX 
Protein accessionYP_355041 
Protein GI77465538 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.223263 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCGGGC TGAACCTCGA AGGCACCGAC GGCGCCGATC TGCTCATCGG CTCGCGGGGC 
GCGGACGTGA TCTCCGGCCG GATGTTCAAC GACACGCTGA TGGGCGGGGC GGGCAACGAC
AGCCTCTACG GCAACGGTGA CGACGACCGC CTCTACGTTA ACGAGGGCAA CGACAGCCTC
TATGGCGAGG AGGGCAACGA CTGGCTGCAT GGCGGTCAGG GCGACGACCT GGTCGTGGGC
GGCGACGGCA ACGACACGCT CGCGGGCGGT CTGGGCAACG ACACCCTGCA GGGTGGCGCG
GGCAACGACA CGGCCAGCTA CGAAACGGCC ACCGAGGGCG TTACCGTCAG CCTCGCGCTG
CAGGGCGAAG GCCAGTTCGT GAACGCGCAG GAAGGCAACG ACCCGCTGAC CTCGATCGAG
AACCTGACGG GCAGCAATCA CGACGACACG CTGATCGGGG ACGAGGGCGA CAACGTGCTC
TCGGGTCTCG CGGGCAACGA CGTGCTGGTG GGCGGCGCGG GCAATGACAC GCTGCTCGGC
GGTGCCGGCA ACGACATCGC CGACTACGCC GCGGCGACGG GCGGGGTGAC GGTCAATCTG
GCGCGTGATG GGCAGGCGCA GATCATCGGC GCCGATCAGG GCACCGATGT CCTGAGCTCG
ATCGAGGGTG TCATCGGCAG CGCCTTCAAC GACATCCTGT CGGGCAGCGC GGTCGCCAAC
CTCATCTTCG GTGGGGACGG TGCCGACCTG GCCACCGGTG GCGCGGGCAA CGACACCATC
CTCGGCGGCG CCGGATCGGA CAGCCTCTAT GGCAACCTTG GGGATGACCT CCTCTTTGGT
GACGTGGGCA ACGACTGGAT CCACGGCGGC CAGGGCAACG ACACCGTCCT CGGCGGTTTC
GGCGACGATA CGCTGGCCGG CGGCGTCGGT GACGATGTGG TGGATGGCGG CGATGGGATC
GACACCGTCG AGTTCCAGAC CGCAACCGCC GGTGTCACCG TGGATCTCTC GCTGCAGGGT
CAGGCGCAGC GCATCAGTGC CGAGGAAGGC ACGGATACGC TGTTCTCGAT CGAGAACATC
CTCGGCAGCC GGTATGACGA CCGCCTGCTG GGCGATGCGG GCTCCAACTT GATCGACGGC
AGTGCCGGCA ACGACACTGC CATGGGTCAG GCGGGCGAGG ACCTCATCTT CGGCGGGGAC
GGCAACGACA GCCTCTATGG CAACCAGGAC AACGACACTC TGGTCGGCGG CAACGGCAAC
GACTGGTTGC ACGGCGGTCA GGGCAACGAT CTCCTGGTGG GCGATGCCGG CAGCGACACC
CTCAACGGCG GCGTGGGCGA CGATGTGCTG GTCGGGGGTC AGGGCTTCGA CCTTCTGACG
GGCGGCACCG GGGCGGACAC TTTCGTCTTC GGCAGCCTCG ACAGCGCGGA TGCGGATCGG
ATCACCGATT TCGAGCAGGG CGTCGACCAG ATCGTGATCG CCGACCAGCT GATGTGGGCG
CTGGAGAATG CCGAGCTGAA CCTCGCCGAT CAGATCGTCT GGAATGCCGA GACCGGCATG
CTCTCCATCG ATCTCGACGC CGGGGAGGCG ACCCGTCTGG TGGATCTTGC TCAGATCGAT
CATGATGGAA CGCTGAACAT CACGATCGAC GACTTCCAGT TCCTGCGCTG A
 
Protein sequence
MVGLNLEGTD GADLLIGSRG ADVISGRMFN DTLMGGAGND SLYGNGDDDR LYVNEGNDSL 
YGEEGNDWLH GGQGDDLVVG GDGNDTLAGG LGNDTLQGGA GNDTASYETA TEGVTVSLAL
QGEGQFVNAQ EGNDPLTSIE NLTGSNHDDT LIGDEGDNVL SGLAGNDVLV GGAGNDTLLG
GAGNDIADYA AATGGVTVNL ARDGQAQIIG ADQGTDVLSS IEGVIGSAFN DILSGSAVAN
LIFGGDGADL ATGGAGNDTI LGGAGSDSLY GNLGDDLLFG DVGNDWIHGG QGNDTVLGGF
GDDTLAGGVG DDVVDGGDGI DTVEFQTATA GVTVDLSLQG QAQRISAEEG TDTLFSIENI
LGSRYDDRLL GDAGSNLIDG SAGNDTAMGQ AGEDLIFGGD GNDSLYGNQD NDTLVGGNGN
DWLHGGQGND LLVGDAGSDT LNGGVGDDVL VGGQGFDLLT GGTGADTFVF GSLDSADADR
ITDFEQGVDQ IVIADQLMWA LENAELNLAD QIVWNAETGM LSIDLDAGEA TRLVDLAQID
HDGTLNITID DFQFLR