Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3898 |
Symbol | |
ID | 4899146 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | - |
Start bp | 1032046 |
End bp | 1034661 |
Gene Length | 2616 bp |
Protein Length | 871 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640114502 |
Product | hypothetical protein |
Protein accession | YP_001045749 |
Protein GI | 126464636 |
COG category | [S] Function unknown |
COG ID | [COG0392] Predicted integral membrane protein [COG2898] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.129662 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.427904 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCTGGC GCAGCGAGAC GCCCGAAGAC GCGACGCCTC CGGACGAGGC CCTCCCGCCC CGGTCGCGCC CCGTGCGCGA CTGGCTCAGC CGCAACCGCA CCGTGCTGCT CGCGCTGGTG ACATGGGTCG TGTTCGCGGC AGTCGCCTAC ACGACCTACC GGATCACGGG CGACATCCGC TACAAGGACA TCCTCCATGC GCTCGAGGCG ACGACCTGGA CCGACATCCT GATCGCGGGC TTCTTCACCG TGGTGAGCTT CGTCCTGCTC GCGGGCTACG ATGTCAACGC GCTGCGCCAT CTGGGCAAGC AGGCCAATCT GGTGCAGGTG GGCATGATCG CCTTCAGCGC CTATGCGATC GGCAACACCG TGGGCTTCGG CCCGCTGTCG GCCGGGGCGG TGCGCTACCG CGGCTACAGC CGGCTCGGCC TCTCGGGCGA GCAGATCGCC GGCGTGATCG CCTTCGTGAC CCTCTCCTTC GGCCTCGGTC TGACGGTGAC CACCGCGCTC GCGGCGCTGG TGGCGGCCGA TCAGGTGGCG GGCTTTGCCG GTCTCACGCC GCAGATGCTG CGGCTTCTGT CGGCGGCCCT CCTGCTCGCG CTGACGGTGG CGGCCGTGAT CCTCTGGCGC AGCGACGGGT GGCTGCGGCC GCACATGCCG CGCCCGGGCA TCGCACTGGG CCAGCTTGCC ATCACCGCGG CCGATCTCAT GGTCTGCGCC ACTGTCCTCT GGGTGCTCCT GCCGCAGGAT CTGGAGGTCA GCTGGATCTC CTTCGTCATC ATCTATGCCA TCGCCATCGG CCTCGGCGTG CTGAGCCACG TCCCGGCGGG TCTGGGCGTC CTCGAGGCGG TGATCCTGAC CACGCTCGGC GGCTCGACGG GGACGGATGC GCTGCTGGGG TCGCTCGTCC TCTACCGGGT CATCTACCAT GTCGTGCCCC TCCTGATCGC GGTGGTCGTG GTCGCCTGGA CCGAGGCGCT CGAGGCGTTC CATTCGCCCC GCCTCGAATG GGCGCAGCGG ATGGGCACGC TGCTCGCGCC CTCGCTCCTC GGGTCGCTCG CGGTGATCTG CGGCGTCATG CTGATCTTCT CGAGCGTGAT CCCCGCGCGC GAGGCGAACC TCGTCTGGCT CGCGGGCTAC GTCCCCGCCC TGCTCATCGA AGGGGCGCAT TTCCTGTCGA GCCTGATCGG CCTCGTGCTG TTCGTCGCGG CCCGGGGCCT CACGCAACGG CTGGACGGCG CCTACTGGCT GACGCTAGGT GCGGCGAGCG CGGCCTTCCT CTTCACCTTC GTCAAGGCGC TGGCGCCTTA CGAGGCGGTG ATGCTGGCCG CCCTGATCGG CTTTCTCCTC CTGAGCCGGC CGCTCTTCGA CCGGCCCGCC TCGCTCTTCT CGCAGACGCT GACGCCGCCC TGGATCGCGG GGATCGCCAC CGTGGCCATT TCGGCCATCA CGATCCTGCT CTTCGTTCAG AAGGACGTGG CATACAGCCA CGACCTATGG TGGCAGTTCG AGATCTCGGC CGAGGCGCCG CGCGGGCTGC GCGCCCTTCT GGGCGTAGTG GTGCTGTCGG CGCTGATCGC AATCCGCAGC CTGCTGCAGC CCTCCCGCCC CGAGCCCGGG ATGCCCGACG AGGCCGAGCT GCAGAAGGCG CTCGCCATCG TCGAGCGTCA GGACATGGGC GAGGCCAATC TCGTGCGGAT GCGCGACAAG AGCCTGATCT TCTCGGACGC GGGCGACGCC TTCCTCATGT ATGCGGTGCA AGGCCAGTCC TGGATCTCGC TCTTCGGGCC CATCGGCGCC CCGCGCGCGC AGGCCGAACT GATCTGGCGC TTCATCGAGA CCGCGCGCGC CAAGGGCGGC CGGCCGGTCT TCTATCAGGT GCCGCCCTCG CTTCTGCCGC TCTGCGCCGA CGCGGGCCTG CGCGGGCTGA AGCTGGGCGA GCGGGCGGTG GTCGATCTCG AGGCGATGGA TCTGCAATCG AGCCAGTGGG CCGAGCAGCG GCAGGCCCTG CGCAAGGGCG AGCGGATGGG GCTGGCCTTC GAGCTGCTGG AGCCCGCCGA CCTCGGCCCG ATCCTCGACG AGCTCCAGCA GGTCTCGGAC GCATGGCTCG CGCATCACGA CACCCGCGAG AAGGGCTTCG CCCTCGGCCG GTTCGAGCGC GACTATGTGG CCGAGCAGCC GGTGGCGGTG CTGCGCGCCG AGGGACGCAT CGTGGCCTTC GCCACGGTCA TGCAGACGGG GACGAAGGCC GAGGCCACGC TCGATCTGAT GCGCTTTGCC CGCAGCGCGC CGCCGGGCTC GATGGATGTG CTGCTGTGCA ACCTGCTGGT CGAGATGAAG CGGCAGGGCT TCCGCAGCTT CAACCTCGGG ATGGCGCCGC TGTCGGGCAT CACCGCGCAT CAGGCCGCGC CGTTCTGGAA CCATCTCGGC CAGTCCGTCT TCGAACATGG CGAGCGGTTC TACAATTTCC GCGGCCTTCG GTCCTTCAAG GCCAAATACC GTCCCGACTG GCAGTCGCGC TACCTCGTGA CGCCGGGCGG GGTCTCGCCT CTGGCGGCGC TGGTCGACGT CACGCTGCTG ATCGGCGGCG GCCTCCGGGG CGTGATGCGG AAGTGA
|
Protein sequence | MSWRSETPED ATPPDEALPP RSRPVRDWLS RNRTVLLALV TWVVFAAVAY TTYRITGDIR YKDILHALEA TTWTDILIAG FFTVVSFVLL AGYDVNALRH LGKQANLVQV GMIAFSAYAI GNTVGFGPLS AGAVRYRGYS RLGLSGEQIA GVIAFVTLSF GLGLTVTTAL AALVAADQVA GFAGLTPQML RLLSAALLLA LTVAAVILWR SDGWLRPHMP RPGIALGQLA ITAADLMVCA TVLWVLLPQD LEVSWISFVI IYAIAIGLGV LSHVPAGLGV LEAVILTTLG GSTGTDALLG SLVLYRVIYH VVPLLIAVVV VAWTEALEAF HSPRLEWAQR MGTLLAPSLL GSLAVICGVM LIFSSVIPAR EANLVWLAGY VPALLIEGAH FLSSLIGLVL FVAARGLTQR LDGAYWLTLG AASAAFLFTF VKALAPYEAV MLAALIGFLL LSRPLFDRPA SLFSQTLTPP WIAGIATVAI SAITILLFVQ KDVAYSHDLW WQFEISAEAP RGLRALLGVV VLSALIAIRS LLQPSRPEPG MPDEAELQKA LAIVERQDMG EANLVRMRDK SLIFSDAGDA FLMYAVQGQS WISLFGPIGA PRAQAELIWR FIETARAKGG RPVFYQVPPS LLPLCADAGL RGLKLGERAV VDLEAMDLQS SQWAEQRQAL RKGERMGLAF ELLEPADLGP ILDELQQVSD AWLAHHDTRE KGFALGRFER DYVAEQPVAV LRAEGRIVAF ATVMQTGTKA EATLDLMRFA RSAPPGSMDV LLCNLLVEMK RQGFRSFNLG MAPLSGITAH QAAPFWNHLG QSVFEHGERF YNFRGLRSFK AKYRPDWQSR YLVTPGGVSP LAALVDVTLL IGGGLRGVMR K
|
| |