Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_3187 |
Symbol | |
ID | 5085672 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009429 |
Strand | - |
Start bp | 50650 |
End bp | 51747 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640484759 |
Product | positive regulator of sigma E, RseC/MucC |
Protein accession | YP_001169376 |
Protein GI | 146279218 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.600776 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.981913 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGTCC CGCGCCAAGC CCTGATTGCC GCCGCCGTGG CACTCTGCGC GGCGGGCGGG GCCATCGGCG CCACCCTCAT GCGCGGCCCG GCCGAGGTGC GCGAGACCTT CTATCTCTTC GGGACGCTGG TCGAGATCGA GGCCCGCGGC GCCCCCGAAA GGGTGGCCCG CGAGGCCATC GCCCGCGTTG GCCAGCGGCT GGAGGCCGCC CACAAGGACT GGCACGCCTG GCGCCCGGGC GAACTCGAGA GCCTCAACGC CGCCTTCGCC GCAGGCGAAA GCCGCAAGGT CGAGGTCGGG CTCGCGGCCC TTCTGCGGGA GGGACGGGCG CTGTCCTGCG CGAGCGGCGG CCTCTTCGAT CCGGCGATCG GCGGGCTGAT CGAGGCCTGG GGCTTTCATG CGGATGTTCC GCCCGACGGC GCGCCACCGC CCGCTGCGGT CCTCGCGGCC CTGGTGGCGC GTCATCCGCG CATGACCGAT CTGACGATCA CGGGCGACGA GGTGCGCTCG GACAACCCGG CCGTCCAGCT CGACCTCGGG GCCTTCGCGA AGGGGGCGGC GCTCGACCTC GCCGCCGAGG ATCTGCGCGC CGCGGGGGTG CAGGATGCGG TGCTGAATGC GGGCGGCGGC GTGAAGGTCA TCGGGCGGCA TGGCGCGCGG CCCTGGAGGG TCGCGATCCG CGACCCGTTC CAGTGGGGCG TGGTGGCGGC GGTGTCGCTG CGCCCCGGCG AGGCGCTGCA CACGTCCGGC AACTACGAAC GCTATTTCGA CGAAGGCGGC GTCCGCTTCT CGCACATCAT CGACCCGCGA ACCGGGCGGC CGATGCAAGG GATCGTTTCG GTTTCGGTTC TGGACCCGAG TGGCGCGCGC GCGGATGCGG CGGCCACCGC CCTCGCCGTG GCCGGCCCGG TCGAGTGGCC GCAGGTCGCC GCCGCCATGG GGGTGAGGGC GGTGCTGATG ATCACCGACG ACGGCTCCAT CCTCGCAACG CAGGCGATGC ACGAGCGGCT CGAGCCGGTG CCGGGGGGCT TCCCCGCCCC CGTTCGGATC GTCGCTCTGC CAGAGGCCGT TCCCCCGCCC GCCTGCTCCA ACGGGTGA
|
Protein sequence | MTVPRQALIA AAVALCAAGG AIGATLMRGP AEVRETFYLF GTLVEIEARG APERVAREAI ARVGQRLEAA HKDWHAWRPG ELESLNAAFA AGESRKVEVG LAALLREGRA LSCASGGLFD PAIGGLIEAW GFHADVPPDG APPPAAVLAA LVARHPRMTD LTITGDEVRS DNPAVQLDLG AFAKGAALDL AAEDLRAAGV QDAVLNAGGG VKVIGRHGAR PWRVAIRDPF QWGVVAAVSL RPGEALHTSG NYERYFDEGG VRFSHIIDPR TGRPMQGIVS VSVLDPSGAR ADAAATALAV AGPVEWPQVA AAMGVRAVLM ITDDGSILAT QAMHERLEPV PGGFPAPVRI VALPEAVPPP ACSNG
|
| |