Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3678 |
Symbol | |
ID | 4898484 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | - |
Start bp | 778009 |
End bp | 782178 |
Gene Length | 4170 bp |
Protein Length | 1389 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640114286 |
Product | cadherin |
Protein accession | YP_001045540 |
Protein GI | 126464427 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.210841 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGACA ATGCGATCGT CGTCGAGAAC GCCCGGACCG ACGGGGTGAT GCCGAAGTCC TACTGGGATG TCGAGCATTC GACCTTCATC GAAGGGTTCG CCACCGACAT CAGCGTCAAT GCCGGCCAGC GGATCGACTT CAAGATCAAC GTCAACGACG AGGCGGGAAG CGATTACAAG GTCGAGGTCT TCCGGCTGGG CTATTACGGC GGAGACGGCG CGCGGAAGGT GGCGGAATGG GTCAATACCG ACGCCAAGGT GCAGGATCAG GCGACCTACA ATCCGGCGCT GGGGCTCGTG GATGCGGGCA ACTGGTCGGT GACGGACAGC TGGCTCACGC CGACCGACGC GGTGTCAGGG GTCTATATCG CGCGCGTCCA GAGGCTCGAT GCCGCGGGCA ACCCGATCGA GGGCGAGGTC AACCAGATCC CCTTCATCCT GCGCGAGGAT GACCGGGCTT CGGACATCGT GCTCCAGACC TCGGACACGA CCTGGCAGGC CTACAACGGC TGGGGCGGCA ACAACGGCCA GATCGGCGCC AACCTCTATG GCGACGTCAG CGGCACGATC GACCACCCGG ACATCCCCGG CGCCAGTTCC TACGAGCCGG ACCGGGCCTA TGCGGTCAGC TACAACCGCC CGCTCATCAC GCGCGGGATC GACGGCGAGC AGGGCGGCGT CGCGGCCGGG GCGCAGGACT ATCTGTTCGG TGCCGATTAT GCGGCGATCT ACTGGCTGGA GAAGAACGGC TACGACGTCT CCTACATGTC GGGCGTGGAT ACCGACCGGC TCGGCGCCGA CTATCTGAAG AAATATCAGG CCTTCATCTC GGTCGGGCAC GACGAATACT GGTCTGGCGG CCAGAGGTCG AATGTCGAGG AGGCCCGCGA CGACGGCGTC AACCTGCTCT TCTGGGGCGG CAACGACGTC TACTGGAAGA CGCGCTACGA GGCGAGCGCG GTCGACGGCG TCGATTACCG CACCCTCGTC TGCTACAAGG AGACGACGGC GGTCGCCGAC CCGAACGCGG GACCGGAAGA CTACTACAAC CTCGATCCGA CCGACATCTG GACCGGCACC TGGCGCGACA CCCGCTTCCA CGGCAACCCT CTGGCGGGGG GCGGCCTGCC CGACGATTTC CTCTCGGGCC AGTGCCCGAC CTGCAACTGC GCCGAGAACT CGCTCACCGG CCAGCTGTTC GGGCCCGACG GCACCGGCGA ATTCGGCGGC GCGCTGGATG TGACCGGCGA ATATGCGCAG CTCCGGGTCT GGCGCGACAC CACCATCGCG AACGGCGGCG CGCTCGACAT CGCTCCGGGT CTGCTCGGCT ACGAGTGGAA CACCTCGCCC GACGACTACA ACCGGCCGGC CGGCCTCGTG CATCTGTCCG AGACGACCAT TCCCTGGGGC GCGATCCTGA CCGATCAGGG CAACACCACC GCGCCGGGCA TCGCCACGCA CAATCTGTCT CTCTACCGCG CCGAGAGCGG CGCGCTCGTC TTCGGGGCCG GCACGGTGTT CTGGACCTGG GCGCTCAGCA ACCTGCACGA CGGCGAGCCC TACAATGCGC AGATCGAGAA CCGCGACCTG CAGCAGTTCG TGATGAACAT GTTCGCCGAC ATGGGCATCC AGCCCGGTGT GGCCGATGCG GTGCTGGCCT CGCAGGGGCT GCTGCGCGCG CTGGCCTCGG CGGACCATGC GCCCGCCACG GCGCTCATCA ACGACCTGCC CGACACGCTG CCCGCGCTCG AGACCGTCAC GATCACCGGC ACCGCCACCG ACGACGACGG CAATCCCTTG ACCGAGGACG GACGGGTGGC GCTGGTCGAG GTGTCCCTCG ACGGCGGCAC GACCTGGACG GCGGCACAGG GCACGACGAA CTGGAGCTTC AGCTGGACGC CGACCCGTCA GGGGATCTTC GACATCCGGG TGCGAGCCTT CGACGACTCC CTGAACCTGC CGGTGGCCGT GACGCTCGAC CGCGAGACGG TCGAGATCAC CGCACCCGTG GTGCCCCCCG AGGTGAGCCT CTTCGACCCG TTCGTGACCT TCAACGGCGA GTCGCACAAT GACAACACGG CGCTCGAACT CGGGACGCGC TTCACGGCGC TGCAGACGGG CTCCGTCACC GAACTGCGCT ACTATCGCGC GGCCTCGGAT GCGACCGACA CGGATACGCG GACGGGGCAT CTGTGGCGCG CGGACGGCAC GCTGCTGGCG ACCGTGACCT TCGTCTCGAC GCCGGGTCAG ACGGGCTGGC AGACGGCGGA GCTGTCGTCG CCGATCACGC TGGCGGCGGG GCAGAGCTAT GTCGTCTCCT ACACGACGCA GGACAATTAC GTCGGCACGA ACGGCTTCTT CTCGACCGGC TACGTCGATC CCTACGGCAT CCTGAGCGCC GGGGCGGGCA CCGGCGGCGT GTTCGCGGTG GGCTCGAACA TCTTCCCCCA ATCGAGCTAT CAGGGGACGA GCTACTGGGT GGACATCACC TTTGCCCCGC AGGCGGTGGG AACCGGCCTG CCGGTCTTCG CCGGGCCCTC GAGTCTCTCG GTGGCCGAGA ACGACCTTTA TGTGGGCACG GTCGCCGCCA CCGACCCGCA GGGCCAGCCG ATCAGCTACG CCATCACCGG CGGGGCGGAT GCCGGTCTCT TCAGCATCGA CGCGAAGACG GGCGCGCTGC ATTTCCGCTT CCCCGCCAAC CATGAGCTGC CCACCGACGC GGACGGCAAC AACATCTTCC AGTTCCGCGT GACGGCCACG GATCTGACCG GCGAGGCCGC CACGCGCGAC TTTTCGGTGA CCCTCACCGA TGTGGTCGAC GAGTCCCTCC ATCCGTCGCT TCTCTTCGGC CCGGCGGATG CTCCCGCCGC CTCGATCACC GACGATCCCA CCGACTATGA GCTCGGGATG CGGTTCCAGG CCGCGAGCAG CGGCGAGATC ACCAGCCTGC GCTACTTCCG GGGCGCGGCC GATGCGGGCG ACGTGGACAC GCGCACCCTG CATCTGTGGA GCGCGACGGG CGTGCTGCTC GCCTCGGTCG AGGTGACCTC GGGCGCGGGC CAGTCCGGGT GGCAGACCGG CACGCTCTCC GCGCCGATCC AGATCGAGGC GGGCCAGACC TATGTCGTCT CCTACGGCAC GGTGCAGAAC TATGTCGCGA CGCAGAATTA CTTCACCTCC GACCATGTCG GCGCCGATGG CGCCCTGACC GGCCTCGGCG GCACCGGCAA CGGGGTCTAT CATGCCGGCG GCACCGGGAT CTTCCCGACC TCGAGCTATC TGAGCACGAA CTACTGGGTC GATGTGGTCT TCGAGCCGGG CAGCAACGGC ACGACCAACA GCGCGCCCAC CTTCACCAGC GCCGCCACCG TCCCGGCCGC CGAGAACCAG ACCGTCGCGA TCGATCTTTC GGCGGTGGAT GCCGATGGCG ACAGCTTCGT CTTCGCCATC GCCGGGGGGG CGGATGCCTC GAAATTCGCG ATCGATCCGA ACACGGGCCT CCTCACCTTC CTGACCGCCC CCGACTACGA GGCGCCGACG GATGCCGATG GCAACAATGC CTATCAGGTG ACGGTCTCGG TATCCGACGG GATGAGCCCG CCCACGACGC AGGCTCTGAC CGTGAACGTG ACCGACGTGG TCGAGCCGGG CGGCACGGCC TGGAACCTGT TCGGGGCGGC CGCGGGGCCC GACCAGATCG TGACCTCCGA CAGCGAAGAC TACGAACTCG GAGCGAAGTT CGTCTCGAAC GGCGCGGGCC TCGTCACCAG CCTTCGCTAC TTCCGGGGCG CGGCCGATGC GCAGGACACC GATGTGCGCA CGCTCAACCT CTGGAGCGCG ACCGGCACCA AGCTCGCCAC GGTCACGGTG ACATCGGCCC CCGGCCAGAC CGGCTGGCAG ACGGCAGAGC TCGACACGCC CTTCCAGATC CAGGCGGGCC AGACCTATGT GATCTCTTAT GGCACGACGC AGAACTACGC CTACAGCGGC GGCTTCTTCG ACACCGACTG GGTCGGGCCC GGCGGCCATC TGACCGCGCC CGGCGGCGCC GTGGCGAACG GCGTCTTCCA CGCGGGCAGC ACCGGGCTCT TCCCGGATCA GAGCTTCAAC GATGCGAACT ACTGGGTGGA TCTGACGGTC ACGCCGCTCG ACCCCGGGCT GCTCGTCTGA
|
Protein sequence | MTDNAIVVEN ARTDGVMPKS YWDVEHSTFI EGFATDISVN AGQRIDFKIN VNDEAGSDYK VEVFRLGYYG GDGARKVAEW VNTDAKVQDQ ATYNPALGLV DAGNWSVTDS WLTPTDAVSG VYIARVQRLD AAGNPIEGEV NQIPFILRED DRASDIVLQT SDTTWQAYNG WGGNNGQIGA NLYGDVSGTI DHPDIPGASS YEPDRAYAVS YNRPLITRGI DGEQGGVAAG AQDYLFGADY AAIYWLEKNG YDVSYMSGVD TDRLGADYLK KYQAFISVGH DEYWSGGQRS NVEEARDDGV NLLFWGGNDV YWKTRYEASA VDGVDYRTLV CYKETTAVAD PNAGPEDYYN LDPTDIWTGT WRDTRFHGNP LAGGGLPDDF LSGQCPTCNC AENSLTGQLF GPDGTGEFGG ALDVTGEYAQ LRVWRDTTIA NGGALDIAPG LLGYEWNTSP DDYNRPAGLV HLSETTIPWG AILTDQGNTT APGIATHNLS LYRAESGALV FGAGTVFWTW ALSNLHDGEP YNAQIENRDL QQFVMNMFAD MGIQPGVADA VLASQGLLRA LASADHAPAT ALINDLPDTL PALETVTITG TATDDDGNPL TEDGRVALVE VSLDGGTTWT AAQGTTNWSF SWTPTRQGIF DIRVRAFDDS LNLPVAVTLD RETVEITAPV VPPEVSLFDP FVTFNGESHN DNTALELGTR FTALQTGSVT ELRYYRAASD ATDTDTRTGH LWRADGTLLA TVTFVSTPGQ TGWQTAELSS PITLAAGQSY VVSYTTQDNY VGTNGFFSTG YVDPYGILSA GAGTGGVFAV GSNIFPQSSY QGTSYWVDIT FAPQAVGTGL PVFAGPSSLS VAENDLYVGT VAATDPQGQP ISYAITGGAD AGLFSIDAKT GALHFRFPAN HELPTDADGN NIFQFRVTAT DLTGEAATRD FSVTLTDVVD ESLHPSLLFG PADAPAASIT DDPTDYELGM RFQAASSGEI TSLRYFRGAA DAGDVDTRTL HLWSATGVLL ASVEVTSGAG QSGWQTGTLS APIQIEAGQT YVVSYGTVQN YVATQNYFTS DHVGADGALT GLGGTGNGVY HAGGTGIFPT SSYLSTNYWV DVVFEPGSNG TTNSAPTFTS AATVPAAENQ TVAIDLSAVD ADGDSFVFAI AGGADASKFA IDPNTGLLTF LTAPDYEAPT DADGNNAYQV TVSVSDGMSP PTTQALTVNV TDVVEPGGTA WNLFGAAAGP DQIVTSDSED YELGAKFVSN GAGLVTSLRY FRGAADAQDT DVRTLNLWSA TGTKLATVTV TSAPGQTGWQ TAELDTPFQI QAGQTYVISY GTTQNYAYSG GFFDTDWVGP GGHLTAPGGA VANGVFHAGS TGLFPDQSFN DANYWVDLTV TPLDPGLLV
|
| |