Gene Rsph17029_4185 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_4185 
Symbol 
ID4894964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009040 
Strand
Start bp119508 
End bp122597 
Gene Length3090 bp 
Protein Length1029 aa 
Translation table11 
GC content70% 
IMG OID640110576 
ProductNa-Ca exchanger/integrin-beta4 
Protein accessionYP_001041888 
Protein GI126464912 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2931] RTX toxins and related Ca2+-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones53 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones87 
Fosmid unclonability p-value0.610735 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCATTC TTCCGACCTT CAGCGTCACC AGCACCCCCG TCGTCGAGGG CAATTACCTC 
ACCTACACCA TCCGCCTCTC CGAGCCCGCC CCCGATGCGG TGAGCGTGGA TTACCTGTTC
CGCTCCGGCA CCGCGCTGCT CGACGAGGAT TTCTATTCCA CCGCCCCCTC GGGCACGCTC
GTCTTCGCCC CCGGCGAGAC GGTGCTGACG CTCCAGATCC GGTCCTACAA CGACAGGCTC
GACGAGACCG ACGAGAGCTT CTTTCTCGAA CTGAGCGATC CGCAGGGCGC GCAGTTCGGC
GCCAACATCT CGACCCTCGT CACCGCGGGC TGGGTGATCG ACGACGATGG CGTGGGGGTG
AACCGGGCGC TGGCCGTCTC GAACCCCGTG GTGACCGAGG CCGCGGGCGG GCAGGCGCTC
TTCACCCTCA CGCTGTCGGA GGCCTTCACC ACCGACCGCA GCTTCACCTA CACGACCCAT
GACGGCTCGG CCCGGGCGGG CGCGGATTAT GTCGCCCGCA CCGGCACCGT CACCTTCCTC
GCCGGCCAGA CCGAGGCCAC GGTCGCGGTC AATCTGATCA ACGATGGCGC GGTCGAGGCG
GGCGAGACCT TCGGCCTCGC CGTCACCGGC GCCCATGGCG TGCCCGCCGC CACCGGCACG
GCCGAGATCC TGAACGACGA CGGGCCGGTG CCGGTGATCT CGGTCGAGGG CGACCGGGTG
GTCGAGGGCA GCTACCTCAC CTATACGATC CGGCTCTCCG AGCCCGCGAC CGACGCGGTG
AGCGTGGATT ACCAGTTCCG CTCGGGCACC GCGCTGCTCG ACGAGGATTT CTATTCCACC
GCCCCCTCGG GCACGCTCGT CTTCGCCCCC GGCGAGACGG TGCAGACCCT CCGCATCCGC
TCCTACAACG ACAGTCTCGA CGAGATCGAC GAGAGCTTCT TTCTCGAGCT GAGCGAACCG
CAGGGCGCGC GGTTCGGCGC CAACATCTCG ACCCTCATCA CGACGGGCTG GGTGATCGAC
AATGACGGCG TGGGGGTGAA CCGGGCGCTG GCCGTCTCGA ACCCCGTGGT GACCGAGGCC
GCGGGCGGGC AGGCGCTCTT CACCCTCACG CTGTCGGAGG CCTTCACCAC CGACCGCAGC
TTCACCTACA CGACCCATGA CGGGTCGGCC CGGGCGGGCG CGGATTATGT CGCCCGCACC
GGCACCGTCA CCTTCCTCGC CGGCCAGACC GAGGCCACGG TCGCGGTCAA TCTGATCAAC
GATGGCGCCG TCGAGGCGGG CGAGACCTTC GGCCTCGCGG TCACCGGCGC CCATGGTGTG
CCCGCCGCCA CCGGCACGGC CGAGATCCTG AATGACGACG GGGCGGTGCC GGTGATCTCG
GTCGAAGGCG ACCGGGTGGT CGAGGGCAGC TACCTCACCT ATACGATCCG GCTCTCGGAA
CCCGCGACCG ACGCGGTGAG CGTGGATTAC CAGTTCCGCT CGGGCACCGC GCTGCTCGAC
GAGGATTTCT ATTCCACCGC CCCCTCGGGC ACGCTCACCT TCGCGCCCGG CGAGACGGTG
CAGACCCTCC GCATCCGCTC CTACAACGAC AGTCTCGACG AGATCGACGA GAGCTTCTTT
CTCGATCTGA GCGATCCCCG CGGCGCGCAG TTCGGCGGCG GCAACCGCGC GCTGTCCACC
GGCGGCTGGG TGCTCGACAA TGACGGGGTG GGGCTGAACC GCTCGGTCTC GGTCGGCCAT
GCCACGCTGC AGGAAGGGCC CGGCGGCCGG GTCGCGGTCT TCGTGGTCGA GCTTTCGGCC
GCCTCGGCCG AGCGGATCGC CATCGGCTTC CAGACCCTCG CCGGGACGGC CCGCCCGGGC
AGCGACTTCG CCGCCCGGTC GGGCGAGGTG GTCTTCCTGC CCGGCCAGAC CCGGGCCGAA
ATTCTCATCC CCATCCTCGA CGATCTCGTG CTCGAGAATA CCGAGAGCTT CAGCCTGCGC
CTCGTGCCGC CCTTCCCCAG CGCCATCTCC TCGGCCACGC CGGTGCCGGT GGGGGTGGCC
ACCCTCCTCG ACGGCACGCT CCGCGGCTCG GGCGGCAACG ACCGGCTGAT CGGCACGGCC
AATGCCGAGC GGATCGAGGG GTTCGGCGGC AACGACCGGA TCGAGGGCCG GGGCGGCAAT
GACCTTCTGT CGGGCGGCGC GGGCAACGAC CTGCTCGACG GCGGCGGCGG ACGCGACCGG
ATGGTGGGGG GCACGGGCAA CGACCGCTAT ATCGTCAACC ATGCCGGGGA CAGCACGATC
GAGCTGGCCG GGGGTGGCAT CGACACGGTG CAGAGCAGCC TCTCCTGGAC TCTGGCCGCC
AATGTCGAGC GGCTGGTGCT GACCGGCGGG GCTGCCCTGT CGGGCACCGG CAACGGGCTT
GCGAACCTGC TCACCGGCAA TGGCGGCGCG AACCGGCTGC AGGGGCTTGC CGGCAACGAC
ACGCTGAACG GCGGCGGCGG GCGCGACGTC ATGATCGGCG GGACGGGCAA CGACACCTAC
ATCACCGATG GCGGCGACAC GATCGTGGAG CGCGCGGGCC AGGGGGTGGA TGTGGTGCGC
GCCTCGGTCA GCTACACGCT GGGCGCCCAG CTCGAGACGC TGGTGCTGAC GGGCACGGCG
AACCTTTCGG GGACCGGCAA TGGGCTTTCG AACCTGCTGA TCGGCAATGG CGGCGCGAAC
CGGCTGAGCG GCGGCGCCGG CAACGACACG CTGAGCGGCG GCGGCGGGGC GGATCTGCTG
ATCGGCGGCG CCGGGCGCGA CAGCTTTGTC TTCAACACCC GCCCCGGGCC CGGCGCGATC
GACCGGATCG CGGATTTCAA CGTGGCCGAC GACGTGATCC ATCTGGAAAA TGCGGTGTTC
CGCGGCCTTC CGGCGGGGGC GCTGCGCGGG GCGGCCTTTG CCTCCAACCT CTCGGGACAG
GCCACCGATG CGGCCGACCG GATCCTCTAT GAGCGCGACA CCGGGGCGCT CTGGTTCGAT
GCCGACGGCA CCGGAGGGGG CGCGCGGGTG CAGATCGCGA CCCTCTCGGC GGGGCTCGGC
CTGACGGCCG CGGATTTCTT CGTGATCTGA
 
Protein sequence
MAILPTFSVT STPVVEGNYL TYTIRLSEPA PDAVSVDYLF RSGTALLDED FYSTAPSGTL 
VFAPGETVLT LQIRSYNDRL DETDESFFLE LSDPQGAQFG ANISTLVTAG WVIDDDGVGV
NRALAVSNPV VTEAAGGQAL FTLTLSEAFT TDRSFTYTTH DGSARAGADY VARTGTVTFL
AGQTEATVAV NLINDGAVEA GETFGLAVTG AHGVPAATGT AEILNDDGPV PVISVEGDRV
VEGSYLTYTI RLSEPATDAV SVDYQFRSGT ALLDEDFYST APSGTLVFAP GETVQTLRIR
SYNDSLDEID ESFFLELSEP QGARFGANIS TLITTGWVID NDGVGVNRAL AVSNPVVTEA
AGGQALFTLT LSEAFTTDRS FTYTTHDGSA RAGADYVART GTVTFLAGQT EATVAVNLIN
DGAVEAGETF GLAVTGAHGV PAATGTAEIL NDDGAVPVIS VEGDRVVEGS YLTYTIRLSE
PATDAVSVDY QFRSGTALLD EDFYSTAPSG TLTFAPGETV QTLRIRSYND SLDEIDESFF
LDLSDPRGAQ FGGGNRALST GGWVLDNDGV GLNRSVSVGH ATLQEGPGGR VAVFVVELSA
ASAERIAIGF QTLAGTARPG SDFAARSGEV VFLPGQTRAE ILIPILDDLV LENTESFSLR
LVPPFPSAIS SATPVPVGVA TLLDGTLRGS GGNDRLIGTA NAERIEGFGG NDRIEGRGGN
DLLSGGAGND LLDGGGGRDR MVGGTGNDRY IVNHAGDSTI ELAGGGIDTV QSSLSWTLAA
NVERLVLTGG AALSGTGNGL ANLLTGNGGA NRLQGLAGND TLNGGGGRDV MIGGTGNDTY
ITDGGDTIVE RAGQGVDVVR ASVSYTLGAQ LETLVLTGTA NLSGTGNGLS NLLIGNGGAN
RLSGGAGNDT LSGGGGADLL IGGAGRDSFV FNTRPGPGAI DRIADFNVAD DVIHLENAVF
RGLPAGALRG AAFASNLSGQ ATDAADRILY ERDTGALWFD ADGTGGGARV QIATLSAGLG
LTAADFFVI