Gene Rsph17025_3187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3187 
Symbol 
ID5085672 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009429 
Strand
Start bp50650 
End bp51747 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content74% 
IMG OID640484759 
Productpositive regulator of sigma E, RseC/MucC 
Protein accessionYP_001169376 
Protein GI146279218 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.600776 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.981913 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTCC CGCGCCAAGC CCTGATTGCC GCCGCCGTGG CACTCTGCGC GGCGGGCGGG 
GCCATCGGCG CCACCCTCAT GCGCGGCCCG GCCGAGGTGC GCGAGACCTT CTATCTCTTC
GGGACGCTGG TCGAGATCGA GGCCCGCGGC GCCCCCGAAA GGGTGGCCCG CGAGGCCATC
GCCCGCGTTG GCCAGCGGCT GGAGGCCGCC CACAAGGACT GGCACGCCTG GCGCCCGGGC
GAACTCGAGA GCCTCAACGC CGCCTTCGCC GCAGGCGAAA GCCGCAAGGT CGAGGTCGGG
CTCGCGGCCC TTCTGCGGGA GGGACGGGCG CTGTCCTGCG CGAGCGGCGG CCTCTTCGAT
CCGGCGATCG GCGGGCTGAT CGAGGCCTGG GGCTTTCATG CGGATGTTCC GCCCGACGGC
GCGCCACCGC CCGCTGCGGT CCTCGCGGCC CTGGTGGCGC GTCATCCGCG CATGACCGAT
CTGACGATCA CGGGCGACGA GGTGCGCTCG GACAACCCGG CCGTCCAGCT CGACCTCGGG
GCCTTCGCGA AGGGGGCGGC GCTCGACCTC GCCGCCGAGG ATCTGCGCGC CGCGGGGGTG
CAGGATGCGG TGCTGAATGC GGGCGGCGGC GTGAAGGTCA TCGGGCGGCA TGGCGCGCGG
CCCTGGAGGG TCGCGATCCG CGACCCGTTC CAGTGGGGCG TGGTGGCGGC GGTGTCGCTG
CGCCCCGGCG AGGCGCTGCA CACGTCCGGC AACTACGAAC GCTATTTCGA CGAAGGCGGC
GTCCGCTTCT CGCACATCAT CGACCCGCGA ACCGGGCGGC CGATGCAAGG GATCGTTTCG
GTTTCGGTTC TGGACCCGAG TGGCGCGCGC GCGGATGCGG CGGCCACCGC CCTCGCCGTG
GCCGGCCCGG TCGAGTGGCC GCAGGTCGCC GCCGCCATGG GGGTGAGGGC GGTGCTGATG
ATCACCGACG ACGGCTCCAT CCTCGCAACG CAGGCGATGC ACGAGCGGCT CGAGCCGGTG
CCGGGGGGCT TCCCCGCCCC CGTTCGGATC GTCGCTCTGC CAGAGGCCGT TCCCCCGCCC
GCCTGCTCCA ACGGGTGA
 
Protein sequence
MTVPRQALIA AAVALCAAGG AIGATLMRGP AEVRETFYLF GTLVEIEARG APERVAREAI 
ARVGQRLEAA HKDWHAWRPG ELESLNAAFA AGESRKVEVG LAALLREGRA LSCASGGLFD
PAIGGLIEAW GFHADVPPDG APPPAAVLAA LVARHPRMTD LTITGDEVRS DNPAVQLDLG
AFAKGAALDL AAEDLRAAGV QDAVLNAGGG VKVIGRHGAR PWRVAIRDPF QWGVVAAVSL
RPGEALHTSG NYERYFDEGG VRFSHIIDPR TGRPMQGIVS VSVLDPSGAR ADAAATALAV
AGPVEWPQVA AAMGVRAVLM ITDDGSILAT QAMHERLEPV PGGFPAPVRI VALPEAVPPP
ACSNG