Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_0838 |
Symbol | |
ID | 5084944 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | - |
Start bp | 855280 |
End bp | 856971 |
Gene Length | 1692 bp |
Protein Length | 563 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640482396 |
Product | hypothetical protein |
Protein accession | YP_001167047 |
Protein GI | 146276888 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.296759 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.604194 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCAGT TCAACATTTC CGTCCCGAGG GAGTCCGACG TCACCGCTGG TCCAGCGGGC CGATCGCAAG GATGCGCCGC CCGTTTCGCG CGGTCCGAGG AGGGTTCGAT CCTCGTCTTC GGCCTCATGC TGTTCATCCT CATGCTGATG CTGGGCGGCT TTGCGGTCGA TGTCATGTCC TTCGAGGCCA AGCGCACCGA CCTTCAGCAG GCCGTGGACC GCTGCGCCCT GACAGCGGCG GCGCTGGCGC AGACCCGCGA CCCCGAGGAG GTGGTCGAGG ACTGCATGCT CAAGGCGGGC AAGGCGGACT ATGTCACCCT CATCGACCAC GACGAGGGGC TGAACTACCG CGAGGTGGTG GTCACCGCCC AGCAGCCGAC CAAACCCCTG TTCGCGCACA TGCTGGGCAT CGACAGCCTC ACCGCCCCCG CCGCCACCAA GGCCGAGCAG AAGGTGACCA ACGTCGAGAT CGTCATGGTT CTCGACGTGT CCGGCTCGAT GGTCCGCGAC TCTTACAGCA GACCCACGGA CAAGCTGAAG AACCTGAAGG CCGCGGCCAA GGAATTCGTG GACACGATGT TGGCGAAGGA TCTGAACCAC CGGATCTCGA TCGCCATCGT GCCCTACAAC GGTCAGGTGA ACCTCGGCAA ATCCCTGCGG CAGAAGTTCA ATATCTACGA CAACAACGGC GTGACCTACA TGGATTGCGT GGACATGCCG GCGTCAGTCT ATGCCAGCAC CGGGCTTTCC CGGACGCTGA AGATGCCGAT GACGGCGAAT GCCGATACCT TTTCGGCGGC CTTCAGCAAC GCGCGGACAG GTGGCACGCC GCCCGATACC TATTCGGGTC CGAAGACATC CGAGGCACAG CCCACCCCCG GCAACCGCTG GTGCCTGCCT ACGGCTGCCA ACGTGGTCCG CCTCCCGACG GGCAGCATTT CGAGCCTTCA GGCCAGCATC GACGGGCTCG AGGGCAACGG CGCCACCTCG ATCAACGCGG GAATGAAGGT CGGGCTGTCG CTTCTGGACC CATCGGCCCG CCCCATGTTT TCGGAATTCG TCGGCTCGGG TGAAATCCAG TCCTATTTCC ATGGCCGCCC CTTCGACTAC ACCGACGAAG AGGTGATGAA GGTCATGATC GTGATGACCG ATGGCGAGCA TTTCGAGGAA GAGCGGGTCA ATGACGGCTA CCGCGTCGGC GACTCCCCGA TCTACCGCAG CAGCGACGGC GAATATCTCT CGCTCAAGCT GAGCAATGGC AAATTCTACT GGCCCTACGA CAACACGACG ACCAACAGCG CGGCAAAGGG CAAATCACCC ACACAACTGA CCTGGCAGCA GGTCTGGGCC AGCTACAGGA CCTCCTACAT CGCCTGGCAG CTTTATTCCC GGCGCCCCGG CTTCAGATCC AGCGACCGGC TGGCCGCCTA TACGGCACAG ATGAATGCGT TCCGCACCCT GACGCCCATC AGCACGATGG ACGCGCAGTT GCAGGCGCTC TGCAACCTCG CCAAGTCCAA CAATGTCACC ATCTTCGGCA TCGCGTTCGA GGCGCCAGCG AATGGCAAGA CGCAGATCCA GAACTGCTCG ACCTCCAGGA GCAGCCACTA TTTCGATGCG TCCGGCCTCG AGATCCAGAC CGCCTTCCGG GCCATCGCCA GCCAGATCAG CTACCTGAGG CTCAGCCAAT GA
|
Protein sequence | MAQFNISVPR ESDVTAGPAG RSQGCAARFA RSEEGSILVF GLMLFILMLM LGGFAVDVMS FEAKRTDLQQ AVDRCALTAA ALAQTRDPEE VVEDCMLKAG KADYVTLIDH DEGLNYREVV VTAQQPTKPL FAHMLGIDSL TAPAATKAEQ KVTNVEIVMV LDVSGSMVRD SYSRPTDKLK NLKAAAKEFV DTMLAKDLNH RISIAIVPYN GQVNLGKSLR QKFNIYDNNG VTYMDCVDMP ASVYASTGLS RTLKMPMTAN ADTFSAAFSN ARTGGTPPDT YSGPKTSEAQ PTPGNRWCLP TAANVVRLPT GSISSLQASI DGLEGNGATS INAGMKVGLS LLDPSARPMF SEFVGSGEIQ SYFHGRPFDY TDEEVMKVMI VMTDGEHFEE ERVNDGYRVG DSPIYRSSDG EYLSLKLSNG KFYWPYDNTT TNSAAKGKSP TQLTWQQVWA SYRTSYIAWQ LYSRRPGFRS SDRLAAYTAQ MNAFRTLTPI STMDAQLQAL CNLAKSNNVT IFGIAFEAPA NGKTQIQNCS TSRSSHYFDA SGLEIQTAFR AIASQISYLR LSQ
|
| |