Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_4073 |
Symbol | |
ID | 5086246 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009430 |
Strand | + |
Start bp | 124835 |
End bp | 126340 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640485636 |
Product | hypothetical protein |
Protein accession | YP_001170230 |
Protein GI | 146280073 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1596] Periplasmic protein involved in polysaccharide export |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.000220985 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.0992314 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGCCTG AAGGTTCTCG CCGCCTTTCC ACCAGCCGTC TTCTGCGCGT GGCAGCCCTC GTCGCCGGAG GTTTCGTCCT TGTCGTGGGC GCGACCCGGT TCGCCGACGA AGGGTTCCTC CAGCGCCTGC GGGCGGATTA TCCGCTCGTC TTCGGCCGGG TCGAGACGCT CCTCGACACG CGGCGGGCGG CCGGGCCCGT CGAGCCCGCG ATCCGCCCCG AGGAGGTGGT GGCGCTGACG GTCGAGCGGC CGGGCCTGCC CGCCAACTGC CTCGAGCCCG GCCCGGCCCG GGCCTCGCTC GCCGTGCCGG GCGACAGGCT TCGGATCCGC TTCTTCGAGA AGGGCGGCTT CTCGACCGGC GAGGCCGGCG GAGCGGGCGC GGCGCAGACC CTCGTCTTCG AGCGCCTCGA CCTGTCGGGC ACCTACGAGG TGGCCTCGGA CGGCAGCCTT GCCCTGCCGC TTCTGGGGCG GCTGCCCGTG CAGGGGCGCG AGCTGGTCTG CATCGAGGCG GCGCTGGCCG ACGGTTACAT CGGGATCCTG AACGCGCCTC TCGATGCCAC GGTCAGTTTC GAGAGCCGCC CCCCGGTGGT GCTGCGCGGC CCGGTGCGCG CGCCCGGCAC CTATGGCTGG ACCGAGGGGC TGACGGTGGC CCGGCTGATC GCCTCGGCCG GGTCGGCGGC GATGGGCGGC TATGACAGCC TCGGTCGCCG GGTCGAGCTG GAGGCGCGGG TGCGCGAGCT GCGCGACCGG ATGCTGGGCG TCGCGCTGGA ACGGGCCCGG ACCGAGGCCG CGATCGAGCG GCAGCGGGCG CTGAAGCTGC CGGCCTCGGA GCTTGACTAC ATGGGGGTGG AGCTTGGCCT GCGGCGGATC GAGGGCGAGA CGCAGGCCCT CGTGGCCGAA CTCGACGCCT TCGAGGCGAT CGAGAGCCGC TGGCAGACCG AGGTGGCCGA CCTCGGGCGC CGCCTCGCCG AGATGCGGCG CCACCACCAG ATCGCGCAGG AGCAGCTCGA GGTGCTGCGC CAGCGGCGCG AGGAACTGTC GGATCTCAGC GGGCGCGGCG TGACGACCGC GGCGCGGCTC GACGCGGCCA CGCTGAACCT GATGGGCAGC GAGCGCGCCA TGCTCGAGAC CTTCGACGCC CTGCTCGCGC TGGAATCGCA GCTGAACATC GCCCGACTGT CGCTGGAGCA GGCCCGCACC GACCGGAGCC GGCGCCTCGC GGCTGAACTG CGCGAAGAGG CCGAGGAGGA GAACCTGCTC AAGGGCCAGC TGCGCGCGGT GCAGGCCGAG ATCGCCCGGG TCGATCTGGG CGACGGGCTG GTGGAGGGCT TCGTGCCCGT CGTCGAGATC GAGCGGCCGG GCCCCGAGGG CGTCCGCCGG ATCCAGGTCG CGCCCGAGGA CGAGGTCTTT CCGGCCGATC TGGTCACGAT CTCGATCCCC GGCCGCGATC TCGTGATGCC GGTCCGCTCG TCGGAGGACG ACGGCCGGTC CAGCCTGCTG CGGTAG
|
Protein sequence | MMPEGSRRLS TSRLLRVAAL VAGGFVLVVG ATRFADEGFL QRLRADYPLV FGRVETLLDT RRAAGPVEPA IRPEEVVALT VERPGLPANC LEPGPARASL AVPGDRLRIR FFEKGGFSTG EAGGAGAAQT LVFERLDLSG TYEVASDGSL ALPLLGRLPV QGRELVCIEA ALADGYIGIL NAPLDATVSF ESRPPVVLRG PVRAPGTYGW TEGLTVARLI ASAGSAAMGG YDSLGRRVEL EARVRELRDR MLGVALERAR TEAAIERQRA LKLPASELDY MGVELGLRRI EGETQALVAE LDAFEAIESR WQTEVADLGR RLAEMRRHHQ IAQEQLEVLR QRREELSDLS GRGVTTAARL DAATLNLMGS ERAMLETFDA LLALESQLNI ARLSLEQART DRSRRLAAEL REEAEEENLL KGQLRAVQAE IARVDLGDGL VEGFVPVVEI ERPGPEGVRR IQVAPEDEVF PADLVTISIP GRDLVMPVRS SEDDGRSSLL R
|
| |