Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3063 |
Symbol | |
ID | 4898991 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | - |
Start bp | 75998 |
End bp | 76918 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640113665 |
Product | helix-turn-helix domain-containing protein |
Protein accession | YP_001044935 |
Protein GI | 126463822 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.13528 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTTTC ACAGCGACTT CATCGGACAG ACGGACGGGA TCGATCGCAC CGCGCCGGTG CGGTGGCGCC GGATGGAGGG CGCGACCAGC GTCTTCTGGC AGGCCCGCGG CGTGCGGGGC GGACGGGGCT ACTATCTCTC GCAGCATCCG CGGATCATGG TCTTTCTCGA CGAGGTCTCC TCGATCCGGC TGTCGAACGA CTCCGGCGGC CGGGCGCATC TGGGCCGGGC GCTGCCGCGC GTGCTCTATG TGCCCGCCGG GCTGCCGATG TGGACCCATT TCACCTGCGA CCACAGCTTT GCCCATCTCG ACCTGCATCT GCACCGCGAC CGGCTGTTGA AGATGCTGAC CCCGGCCCTC GGCCGGTCCG CGGCGCTCGA CGTCCTCGCC CGTCCGGTCG AGAGCGAGGC GACGCGGTCC CTTCTGACAC TGGCGGAGCT TCTGGTGGAG GAGACGGTCG CGCCTGCGCA TCATCGGATC CATGCCGAGA CGCTGGCCGC GGGCCTTGTC ACCGGGCTTC TGGATCTCGC GTCGGGAGAG GAGGCCATGA GCCGCACCCG GCTGACGCAG GGCCAGATCC GCAAGCTCGA GCGCGCCTTC CGGGCGGGGG GCGGGCGCGG CCAGTCGGTG GCCGAGATGG CGCAGGTGGT GGGCCTGTCC GAGAGCTGGT TCACCCGGCT CTTCCGCGAC ACGACCGGGG TCACGCCGCT GCAATGGCAG CTGCGCCAGC GGGTCGAGCT GGCGCAGACC CTTCTCGAGG AGAACCTCAG CGTGGCCGAG ATCGCCGACC GCTTGGGCTT CAGCGATCAG GCGCATCTGA CGCGCGTCTT CCGTCAGGTG ACGGGTCAGC CGCCTGGCGC CTGGCGCCGC GCCCGACTCT GCGCGGCGGC GCCCGAAAGC GGGGCGGCGC GCCGGGCCTG A
|
Protein sequence | MSFHSDFIGQ TDGIDRTAPV RWRRMEGATS VFWQARGVRG GRGYYLSQHP RIMVFLDEVS SIRLSNDSGG RAHLGRALPR VLYVPAGLPM WTHFTCDHSF AHLDLHLHRD RLLKMLTPAL GRSAALDVLA RPVESEATRS LLTLAELLVE ETVAPAHHRI HAETLAAGLV TGLLDLASGE EAMSRTRLTQ GQIRKLERAF RAGGGRGQSV AEMAQVVGLS ESWFTRLFRD TTGVTPLQWQ LRQRVELAQT LLEENLSVAE IADRLGFSDQ AHLTRVFRQV TGQPPGAWRR ARLCAAAPES GAARRA
|
| |