Gene RSP_3855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_3855 
Symbol 
ID4796518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_009007 
Strand
Start bp25733 
End bp28822 
Gene Length3090 bp 
Protein Length1029 aa 
Translation table11 
GC content70% 
IMG OID640102969 
Producthemolysin-type calcium-binding protein 
Protein accessionYP_001033818 
Protein GI125654624 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2931] RTX toxins and related Ca2+-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.460711 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCATTC TTCCGACCTT CAGCGTCACC AGCACCCCCG TCGTCGAGGG CAATTACCTC 
ACCTACACCA TCCGGCTCTC CGAGCCCGCC CCCGATGCGG TGAGCGTGGA TTACCTGTTC
CGCTCCGGCA CCGCGCTGCT CGACGAGGAT TTCTATTCCA CCGCCCCCTC GGGCACGCTC
GTCTTCGCCC CCGGCGAGAC GGTGCTGACC CTCCAGATCC GGTCCTACAA CGACAGGCTC
GACGAGATCG ACGAGAGCTT CTTTCTCGAA CTGAGCGATC CGCAGGGCGC GCAGTTCGGC
GCCAACATCT CGACCCTCGT CACCGCGGGC TGGGTGATCG ACGACGATGG CGTGGGGGTG
AACCGGGCGC TGGCCGTCTC GAACCCCGTG GTGACCGAGG CCGCGGGCGG GCAGGCGCTC
TTCACCCTCA CGCTGTCGGA GGCCTTCACC ACCGACCGCA GCTTCACCTA CACGACCTAC
GATGGGTCGG CCCGGGCGGG CGCGGATTAC GTGGCCCGCA CCGGCACAGT CACCTTCCTC
GCCGGCCAGA CCGAGGCCAC GGTCGCGGTC AATCTGATCA ACGATGGCGC CGTCGAGGCG
GGCGAGACCT TCGGCCTCGC GGTCACCGGC GCCCATGGCG TGCCCGCCGC GACCGGCACG
GCCGAGATCC TGAACGACGA CGGGCCGGTG CCGGTGATCT CGGTCGAGGG CGACCGGGTG
GTCGAGGGCA GCTACCTCAC CTATACGATC CGGCTCTCCG AGCCCGCGAC CGACGCGGTG
AGCGTGGATT ACCTGTTCCG CTCGGGCACC GCGCTGCTCG ACGAGGATTT CTATTCCACC
GCCCCCTCGG GCACGCTCAC CTTCGCCCCG GGCGAGACGG TGCAGACCCT CCGCATCCGC
TCCTACAACG ACAGTCTCGA CGAAATCGAC GAGAGCTTCT TTCTCGAGCT GAGCGAACCG
CAGGGCGCGC GGTTCGGCGC CAACATCTCG ACCCTCGTCA CGACGGGCTG GGTGATCGAC
AATGACGGCG TGGGGGTGAA CCGGGCACTG GCCGTCTCGA ACCCGGTGGT GACCGAGGCC
GCAGGCGGGC AGGCGCTCTT CACCCTCACG CTGTCGGAGG CCTTCACCAC CGACCGCAGC
TTCACCTACA CGACCCATGA CGGCTCGGCC CGGGCAGGCG CGGATTATGT TGCCCGCACC
GGCACCGTCA CCTTCCTCGC CGGCCAGACC GAGGCCACGG TCGCGGTCAA TCTGATCAAC
GATGGCGCGG TCGAGGCGGG CGAGACCTTC GGTCTGGCCG TCACCGGCGC CCATGGCGTG
CCCGCCGCCA CCGGCACGGC CGAGATCCTG AACGACGACG GGCCGGTGCC AGTGATCTCG
GTCGAAGGCG ACCGGGTGGT CGAGGGCAGC TACCTCACCT ATACGATCCG GCTCTCCGAG
CCCGCGACCG ACGCGGTGAG CGTGGATTAC CAGTTCCGCT CCGGCACCGC GCTGCTCGAC
GAGGATTTCT ATTCCACGGC CCCCTCGGGC ACCCTCACCT TCGCCCCCGG CGAGACGGTG
CAGACCCTCC GCATCCGTTC CTACAACGAC AGTCTCGACG AGATCGACGA GAGCTTCTTT
CTCGATCTGA GCGATCCCCG CGGCGCGCAG TTCGGCGGCG GCAACCGCGC CCTGTCCACC
GGCGGCTGGG TGCTCGACAA TGACGGGGTG GGGCTGAACC GCTCGGTCTC GGTCGGCCAT
GCCACGCTGC AGGAAGGGCC CGGCGGCCGG GTCGCGGTCT TCGTGGTCGA GCTTTCGGCC
GCCTCGGCCG AGCGGATCGC CATCGGCTTC CAGACCCTCG CCGGGACGGC CCGCCCGGGC
AGCGACTTCG CCGCCCGGTC GGGCGAGGTG GTCTTCCTGC CCGGCCAGAC CCGGGCCGAA
ATTCTCATCC CCATCCTCGA CGATCTCGTG CTCGAGAATA CCGAGAGCTT CAGCCTACGC
CTCGTGCCGC CCTTCCCCAG CGCCATCTCC TCGGCCACGC CGGTGCCGGT GGGGGTGGCC
ACCCTCCTCG ACGGCACGCT CCGCGGCTCG GGCGGCAACG ACCGGCTGAT CGGCACGGCC
AATGCCGAGC GGATCGAGGG GTTCGGCGGC AACGACCGGA TCGAGGGCCG GGGCGGCAAT
GACCTTCTGT CGGGCGGCGC GGGCAACGAC CTGCTCGACG GCGGCGGCGG ACGCGACCGG
ATGGTGGGGG GCACGGGCAA CGACCGCTAT ATCGTCAACC ATGCCGGGGA CAGCACGATC
GAGCTGGCCG GGGGCGGTAT CGACACGGTG CAGAGCAGCC TCTCCTGGAC CCTGGCCGCC
AATGTCGAGC GGCTGGTGCT GACCGGCGGG GCTGCCCTGT CGGGCACCGG CAACGGGCTT
GCGAACCTGC TCACCGGCAA TGGCGGGGCG AACCGGCTGC AGGGGCTTGC CGGCAACGAC
ACGCTGAACG GCGGCGGCGG GCGCGACGTC ATGATCGGCG GGACGGGCAA CGACACCTAC
ATCACCGATG GCGGCGACAC GATCGTGGAG CGCGCGGGCC AGGGGGTGGA TGTGGTGCGC
GCCTCGGTCA GCTACACGCT GGGCGCCCAG CTCGAGACGC TGGTGCTGAC GGGCACGGCG
AACCTTTCGG GGACCGGCAA CGGGCTTTCG AACCTGCTGA TCGGCAATGG CGGCGCGAAC
CGGCTGAGCG GCGGCGCCGG CAACGACACG CTGAGCGGCG GCGGCGGGGC GGATCTGCTG
ATCGGCGGCG CCGGGCGCGA CAGCTTTGTC TTCAACACCC GCCCCGGGCC CGGCGCGATC
GACCGGATCG CGGATTTCAA CGTGGCCGAC GACGTGATCC ATCTGGAAAA TGCGGTGTTC
CGCGGCCTTC CGGCGGGGGC GCTGCGCGGG GCGGCCTTTG CCTCCAACCT CTCGGGACAG
GCCACCGATG CGGCCGACCG GATCCTCTAT GAGCGCGACA CCGGGGCGCT CTGGTTCGAT
GCCGACGGCA CCGGAGGGGG CGCGCGGGTG CAGATCGCGA CCCTCTCGGC GGGGCTCGGC
CTGACGGCCG CGGATTTCTT CGTGATCTGA
 
Protein sequence
MAILPTFSVT STPVVEGNYL TYTIRLSEPA PDAVSVDYLF RSGTALLDED FYSTAPSGTL 
VFAPGETVLT LQIRSYNDRL DEIDESFFLE LSDPQGAQFG ANISTLVTAG WVIDDDGVGV
NRALAVSNPV VTEAAGGQAL FTLTLSEAFT TDRSFTYTTY DGSARAGADY VARTGTVTFL
AGQTEATVAV NLINDGAVEA GETFGLAVTG AHGVPAATGT AEILNDDGPV PVISVEGDRV
VEGSYLTYTI RLSEPATDAV SVDYLFRSGT ALLDEDFYST APSGTLTFAP GETVQTLRIR
SYNDSLDEID ESFFLELSEP QGARFGANIS TLVTTGWVID NDGVGVNRAL AVSNPVVTEA
AGGQALFTLT LSEAFTTDRS FTYTTHDGSA RAGADYVART GTVTFLAGQT EATVAVNLIN
DGAVEAGETF GLAVTGAHGV PAATGTAEIL NDDGPVPVIS VEGDRVVEGS YLTYTIRLSE
PATDAVSVDY QFRSGTALLD EDFYSTAPSG TLTFAPGETV QTLRIRSYND SLDEIDESFF
LDLSDPRGAQ FGGGNRALST GGWVLDNDGV GLNRSVSVGH ATLQEGPGGR VAVFVVELSA
ASAERIAIGF QTLAGTARPG SDFAARSGEV VFLPGQTRAE ILIPILDDLV LENTESFSLR
LVPPFPSAIS SATPVPVGVA TLLDGTLRGS GGNDRLIGTA NAERIEGFGG NDRIEGRGGN
DLLSGGAGND LLDGGGGRDR MVGGTGNDRY IVNHAGDSTI ELAGGGIDTV QSSLSWTLAA
NVERLVLTGG AALSGTGNGL ANLLTGNGGA NRLQGLAGND TLNGGGGRDV MIGGTGNDTY
ITDGGDTIVE RAGQGVDVVR ASVSYTLGAQ LETLVLTGTA NLSGTGNGLS NLLIGNGGAN
RLSGGAGNDT LSGGGGADLL IGGAGRDSFV FNTRPGPGAI DRIADFNVAD DVIHLENAVF
RGLPAGALRG AAFASNLSGQ ATDAADRILY ERDTGALWFD ADGTGGGARV QIATLSAGLG
LTAADFFVI