Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_3963 |
Symbol | |
ID | 5086139 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009429 |
Strand | + |
Start bp | 868804 |
End bp | 870330 |
Gene Length | 1527 bp |
Protein Length | 508 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640485522 |
Product | hypothetical protein |
Protein accession | YP_001170122 |
Protein GI | 146279964 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.81037 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.136891 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGCAT CCGGAATATG CTGGGTCTTG GCCATAGCTG GGTCTCCTCC GTGGTGTGGT TCGCAAAACC ACCATAGAGA CCTGTGCCCC GGTTATGGCC GCCCGCTGCG CGATCCGGGG GCTTCGCGCG CGGCCATAAC CTTTCGCTGC GCATCATTCC CAACATGCTA CGGGCTGGAA TACACCTTCC GCCTGCGTCC GGGCGTCAAG TTCCACAGCA CCGACTTCTT CACGCCCACG CGCGACCTGA ACGCCGATGA CGTGATCTTC TCGTTCCTGC GCCAGGGCGA TGAGGAAAAC CCGTGGCACC AGTATGTGAC CGGGATCACC TACGAATATT ACAGCGGCAT GGAAATGCCG ACCGTGATCA AGGAGATCCA GAAGGTCGAT GACCTGACGG TCAAGTTCGT CCTGACCCGT CCCGAGGCGC CCTTCCTCGC GAACCTCGCG ATGGACTTCG CCTCGATCCT GTCGAAGGAA TATGCCGACA AGCTCGAGGC CGAGAACCGC AAGGAAGACC TCAACAACGC CCCCGTCGGC ACCGGCCCGT TCAAGTTCGT GGCCTACCAG AAGGACGCGG TGATCCGCTA CCAGGCCCAT GACGACTACT GGGCCGGCCG CGAGAAGATC GACGACCTGA TCTTCGCCAT CACCCCCGAC CCGGCGGTGC GCATGCAGAA GCTGCAGGCC GGCGAATGCC ACATCATGCC CTATCCGGCG CCGGCCGACA TCGAGGCGCT GAAGGCCGAT CCGAACCTGC AGGTGATGGA ACAGCCGGGC CTGAACGTGG CCTATCTCGC CTACAACACC ACCATCGCGC CCTTCGACAA CCCCGATGTG CGGCGCGCGC TCAACATGGC GCTGAACAAG GAGGCTGTCC TTGACGCCGT GTTCCAGGGC ACGGGGCAGG TGGCCAAGAA CCCGATCCCG CCGACCATGT GGGGCTACAA CGACGCGGTC GAGGAGAATC CCTACGATCC CGAGGCCGCC AAGGCGCTTC TGGCCGAGGC CGGGGTCTCG GATCTCTCGA TGGAGATCTG GGCCATGCCG GTGCAGCGGC CCTACATGCC GAACGCGCGC CGCACGGCCG AGCTGATGCA GGAAGACCTG GCCAAGATCG GCGTCAACGT CGAGATCGTC TCGTACGAGT GGGGCGAGTA TCTGAAGAAA TCGACCGACC CGGCCCGCAA GGGCGCGGTG ATCCTTGGCT GGACGGGCGA CAACGGCGAC CCGGACAACT TCCTCGGCGT GCTTCTGGGC TGCGCGGCCA CCGGCGACGG CGGCGCGAAC CGGGCCCAGT GGTGCAACAA GGAGTTCGAC GACCTGATCC AGAAGGCGAA GGTCACGGCG GATCAGGACG AGCGCGCCAA ACTCTACGAA GAGGCGCAAC TCGTCTTCAA GCGCGAGAAC CCCTGGGCCA CCATCGCCCA TTCGACGGTC TTCATGCCGA TGTCGAAGAA GGTCTCGGGC TATGTGATGA ACCCGCTGGG CAAACACGGC TTCTCGGGCG TCGATATCGA AGAGTGA
|
Protein sequence | MRASGICWVL AIAGSPPWCG SQNHHRDLCP GYGRPLRDPG ASRAAITFRC ASFPTCYGLE YTFRLRPGVK FHSTDFFTPT RDLNADDVIF SFLRQGDEEN PWHQYVTGIT YEYYSGMEMP TVIKEIQKVD DLTVKFVLTR PEAPFLANLA MDFASILSKE YADKLEAENR KEDLNNAPVG TGPFKFVAYQ KDAVIRYQAH DDYWAGREKI DDLIFAITPD PAVRMQKLQA GECHIMPYPA PADIEALKAD PNLQVMEQPG LNVAYLAYNT TIAPFDNPDV RRALNMALNK EAVLDAVFQG TGQVAKNPIP PTMWGYNDAV EENPYDPEAA KALLAEAGVS DLSMEIWAMP VQRPYMPNAR RTAELMQEDL AKIGVNVEIV SYEWGEYLKK STDPARKGAV ILGWTGDNGD PDNFLGVLLG CAATGDGGAN RAQWCNKEFD DLIQKAKVTA DQDERAKLYE EAQLVFKREN PWATIAHSTV FMPMSKKVSG YVMNPLGKHG FSGVDIEE
|
| |