Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3160 |
Symbol | |
ID | 4898924 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | + |
Start bp | 184269 |
End bp | 185306 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640113762 |
Product | extracellular solute-binding protein |
Protein accession | YP_001045032 |
Protein GI | 126463919 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0687] Spermidine/putrescine-binding periplasmic protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00659585 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGATGGATC GGGTTTGCAG GCTGTCGCTC GCGGCGCTGC TCCTGGGCTC GGGCGCCGCC TATGCGGACG GCTCGACGCT GACGGTCGCC TATTACGGCG GCAACTGGGG CGAGGCCTTC GACGCCTGCG TGGCGCAGCC CTTCACCGAG AAGACGGGCA TCCGGGTGGT GGCCGAGATC GGCAACTCGA CCACCACGCT TGCCAAGCTG CAGCAGCAGT CGGGCGATGC GGTGATCGAC GTGGCCTATA TGGACGGCGG GATCAGCGAG CTCGCCCAAG AGGCGGGCGT GCTGGCCCCC ATCGATCTCG CCGCGATCCC GAACGCGGCC AGCTACCTGC CGCAGGCCGT CTACGAGGCG GGCGACGAGG TCTTCGCGGT CAGCGCGGGC TATTACTCGC TCGGCCTCAT CTACAACACG TCGGAAGTGA CCGAGACGCC GGACAGCTGG CTTTCGCTCT GGGACGAGAG GTATGCGGGC GCCGTGGCCC TGCCCTCGCC CAGCAACTCC TCGGGCGTGC CCTTCGTCCT GTTCCTCGCC CGCTCGGTGC TGGGCGATAC CTCGGAATCC CTCGACCCGA CCTTCGCCAA GCTGAAGGAG CTGGACACGG GGCTTCTCTT CGACAGTTCG GGCGCGGCCT CGAACGCCTT CCAGAGCAGC GAGGTCATCA TCGGCGCCCA TTTCAACGTC GGCGCCTGGG ACCTGACCGA CGGCGGCCTG CCCATCGGCT TCAGCGTGCC GAAGGAGGGC GTCTGGGCCA CCGATGCGCG GATGCATGTG GTGAAGGGCA CGAAGAACCC CGAGGGCGCG GCGCAGTATC TCGACATGGC GGCCTCGCCC GAGGCGGCGG CCTGCCTGGC CGAACGGCTC TATCTCGGCC CGCCCGTCAC CGGCGTCACG CTCGCGCCCG ATGTCGAGCG CAAGCTGCCC TGGGGCGAGG GCGGATCGGT CGAGAAGCTG CATCTGTCGG ACTGGACCGA GGTGAATGCG CGCCGCGCCG CCATCGTCGA ACGCTGGAAC CGCGAGATCG CGAACTGA
|
Protein sequence | MMDRVCRLSL AALLLGSGAA YADGSTLTVA YYGGNWGEAF DACVAQPFTE KTGIRVVAEI GNSTTTLAKL QQQSGDAVID VAYMDGGISE LAQEAGVLAP IDLAAIPNAA SYLPQAVYEA GDEVFAVSAG YYSLGLIYNT SEVTETPDSW LSLWDERYAG AVALPSPSNS SGVPFVLFLA RSVLGDTSES LDPTFAKLKE LDTGLLFDSS GAASNAFQSS EVIIGAHFNV GAWDLTDGGL PIGFSVPKEG VWATDARMHV VKGTKNPEGA AQYLDMAASP EAAACLAERL YLGPPVTGVT LAPDVERKLP WGEGGSVEKL HLSDWTEVNA RRAAIVERWN REIAN
|
| |