Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_4175 |
Symbol | |
ID | 5086347 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009430 |
Strand | + |
Start bp | 218694 |
End bp | 219782 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640485737 |
Product | hypothetical protein |
Protein accession | YP_001170331 |
Protein GI | 146280174 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3405] Endoglucanase Y |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACGCA GGCACTTCAT GGCGTCTCTG CTCGGCACCC TCGCGGCAGG ACGGGCTTTC GGCGGCTTCG CACAGGAGGG AGCCGCCTTC GTCCTGCCCG CCGATCATCC GCTGCAGCCG GCCTGGCAAG CCTGGAAAAG CCTCTGCCTG CAGGAGGACG GCCGGATCGT CGATGCGCCC CAAAGCGGGG CCAGCCATTC CGAAGGCCAG GGTTACGGTC TGCGGCTGGC CGTGGCCTTC GGGGACGAGG AGGCGTTCCG CAGGATCTTC GAATGGAGCG AAAGCCATCT GGCCCTGCGC GAGGACGGGC TGCTGAGCTG GCGCTACCTG CCCGAAGCCG GCGATCCGGT TCCCGACCGC AACAATGCCT CCGACGGCGA CCTCTTCTAC GGCTGGGCGC TGATCCAGGG CGCGATGGCC TGGAACGAAC CCCTCTTCGC CGACCGGGCC CGGATGATCG GCACGGCGCT CGCGCGCAGC TGCCTGGCCG ATCATCCCGA CGGCTCGGGT CGCCGGTATC TTCTGCCGGC CGCCTATGGG TTCACCAGCG ACAGGGCCAT CACGCTCAAC CTGTCCTACT GCATGCCCCG CGCGATGCGC GAACTGGGCG CCTTCGCCGA AGCCCCCGCC CTCGTCACGG CGGCGCAGGA CGGTCTCGAC CTGATGAACG GGATCGCCGG ATCAGGCCTT CTGCCCGACT GGCTGCAGCT GACGCCCGAG GGGCGCACTC CCGCGCCCGG CCTGCCCGAC CAGAGCGGCT ACGAGGCGCT GCGCATTCCG CTCTACCTCT GCTGGTCAGG CATGACCGAC ACGCCCGCCC AGCAGCGCTT CCGCGAGATG CACCGCCAGG CCGCGGCCCG GGATCGGGGC ACCGCCACGG TCTTCGATCC GGAGACCGGC CGGATCCGGG AGAGCAGCGA CGAGGTCGGC TACCGGGCTC TCGTGGCTCT CAACGACTGT GTCCTGTCAC GGACCGCAGG ATCCGCCATG CCGGATTTCG ACGCCGCCCA GGTCTATTTC CCCGCCACGC TCCATCTGAT GGCGCTGGTC GCCCAAGCCG AGTTCTTCCC CAAATGCCTG CCCGTCTGA
|
Protein sequence | MKRRHFMASL LGTLAAGRAF GGFAQEGAAF VLPADHPLQP AWQAWKSLCL QEDGRIVDAP QSGASHSEGQ GYGLRLAVAF GDEEAFRRIF EWSESHLALR EDGLLSWRYL PEAGDPVPDR NNASDGDLFY GWALIQGAMA WNEPLFADRA RMIGTALARS CLADHPDGSG RRYLLPAAYG FTSDRAITLN LSYCMPRAMR ELGAFAEAPA LVTAAQDGLD LMNGIAGSGL LPDWLQLTPE GRTPAPGLPD QSGYEALRIP LYLCWSGMTD TPAQQRFREM HRQAAARDRG TATVFDPETG RIRESSDEVG YRALVALNDC VLSRTAGSAM PDFDAAQVYF PATLHLMALV AQAEFFPKCL PV
|
| |