Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_3837 |
Symbol | |
ID | 5085384 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009429 |
Strand | + |
Start bp | 734628 |
End bp | 735965 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640485395 |
Product | hypothetical protein |
Protein accession | YP_001169997 |
Protein GI | 146279839 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0111576 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.104463 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCGTC GGACGGCGGC ATCCTGCAGG CCGCGGCGGC ATTCGTTGGA ACATAACCGG CGCAGGTTCA TTGGTGCAGT GCAGCATCTC TTGATGCAGT GCAGCATCCG CGTCGCGACG AGCCGCGTGG ATTTGAATGG GACCGCTGCG ACCCCAACCC CAATACATGC TTGTCGTGCC CGGCCGCCCC GCGGAATACC CGCAGGTCGC CAGCCGCGCC CGCAGAGAGG ACTGACCATG ACCACCATCC GCCAACTGAT CGAGTCGAGC CCCAAGCGCG CCAACGAACT GTTCGCCCGC CTCGTCGACA CGTCCGAGAC CGCGATCAAG ACGCGCGACC GGCTGTTTTC GCAGCTGAAG GACGAGCTGG AGCTTCAGGT CAGGCTCGAG GAGCAGCATC TGCTTCCGGT GCTGAAGAAG CACAAGGACA CGAAGGGGCT TGTCGCCGAC GCGCTGAACG ACAACCGCCA GACCCGCGAG CTTCTGGCCG AGCTGGAGCG CACGCCCAAG GAAAGCGAGG CCTTCGGCAC CAAGGTGGCC GAGCTGCGCA AGGTCTTCCA GCAGCATGTC CGCGATGACA AGAAGGACTT CCTGCCCGTC GTGGTGAAGG CGCTCTCGGA CGAGGAAGCC AGCGCCGTCG TCGGGAAGAT CGAGGACGGG AAGGCGAAGC TCGAGGCCGA GCAGCGCGCC GAGGCCGACG AGCGCCGCGC CGACACCCGC CGTCAGGCCG ACGAGGCCGA GCGCGAGAAG GCCGGCCAGG CCAGGGCCGA GCAGGACGAG GCCGCCCGGG CGAAGGCCGA GAAGGCCGAG ACCGAGCGCA GCCGCAAGGA AGCGGCCGCA GCCGAGGCCA CGCCGCCGGC CGCCGCCCGC AAGGCCGATG ACGCCCGCAA AGCCGATGAC GGGAAAGGAA ACCGCAAGGA AGCCCCGCGC CAGTCCGAGA GCCGGGACAA GGCCGAGGCG GAACGCCCCG CGGCAGAGCT GGTCGATGCC GGCGAAAAGG TTGCCCGCAT GGTGATGACC GCCGGCGCCG AGGGCACGCG AGATCTGGGC AAGGCGCTCC GCGTGGCCAG CGGGGCCGAC CAGCCGCAGG AGGCCGCGGC CAAGGCCGAG TCCCTTGTGC CCGGCCCCTC GATGATGGAC CTGCTGGGCG AACAGGCGCG GCACGCGATG CAGGTGACGG TGGCGGTCAG CACGGCCCGC TCGATGAACG ACATCGCCCG CGCGCAGGGC GACTTCCTGA CCGGCAGCTT CCAGCGGATG ACCCAGCTCA ACGCGCGCTA TCTCTCGCTC TTGCGCGACG GGATGGGGTT CGGGGCGTTC CTGAACCCGC GTCACTGA
|
Protein sequence | MIRRTAASCR PRRHSLEHNR RRFIGAVQHL LMQCSIRVAT SRVDLNGTAA TPTPIHACRA RPPRGIPAGR QPRPQRGLTM TTIRQLIESS PKRANELFAR LVDTSETAIK TRDRLFSQLK DELELQVRLE EQHLLPVLKK HKDTKGLVAD ALNDNRQTRE LLAELERTPK ESEAFGTKVA ELRKVFQQHV RDDKKDFLPV VVKALSDEEA SAVVGKIEDG KAKLEAEQRA EADERRADTR RQADEAEREK AGQARAEQDE AARAKAEKAE TERSRKEAAA AEATPPAAAR KADDARKADD GKGNRKEAPR QSESRDKAEA ERPAAELVDA GEKVARMVMT AGAEGTRDLG KALRVASGAD QPQEAAAKAE SLVPGPSMMD LLGEQARHAM QVTVAVSTAR SMNDIARAQG DFLTGSFQRM TQLNARYLSL LRDGMGFGAF LNPRH
|
| |