Gene Rsph17025_4318 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_4318 
Symbol 
ID5086494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009431 
Strand
Start bp74090 
End bp75220 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content61% 
IMG OID640485874 
Producthypothetical protein 
Protein accessionYP_001170468 
Protein GI146280312 
COG category[K] Transcription 
COG ID[COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.598317 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.713921 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTTT CCCCTAACCA GACACTCGAG CGAAACCGAA GACTGATAGA CGATCTGCGT 
GCCCGACCAG CAGAGACGCC TTGGCTGGAG TTCAAGGAGA ACAACGCAGA TGCCTCGCTG
ATTGGCAAGC TGGTCTCCGC GCTGTCGAAT GCTGCCCGTC TGGCTGACCA GCATTTTGGG
TATCTTGTAT GGGGTGTCCG CGATGGCAAC CACGAGGCTG TCGGCACCAG CTTCGACTTC
GCAACGAAGC GCGAACAAGG CCAGCCATTC GAGCTCTGGC TGGCCAACCG ACTTCAGCCC
AGCCTCGATT TCCGGTTCGA GGAGGTCGAC TATCGGGGCG TACGGCTCGT CCTGCTGACG
ATACCGGCCG CGGGCACGGC ACCGATCGAG TTTGACCGTA TCAGCTACGT GCGGATCGGC
AGCGCCACAC CGAGGCTCTC AGATCATCCG GAGCGGCTTC GTGCGCTCTG GGCCAGGCTT
CAACCCTACG CGTGGGAAGC CGGACTCACT GCGCAGTTCG TGACCGGCGA CGACGTCCTC
GCGCGGCTCG ACTATGCGAA CTATTTCGAC CTGACTGGTC AGCGATTGCC GGACAATCGG
CACGGCATCT TTGACAGGCT TCAAGCGGAT CGGCTGATTC AGCAGGACGT TGGAGGGCAC
TGGAACATCA CCAACCTTGG CGCGATCCTC TTTGCCAAGC GCCTCGCAGA CTTCGGCCCC
TCGCTCGAAC GAAAAGGCGT CAGGTTCGTC GCCTATGGTG GTCCCGGCCG CGCCTACGCC
GTAACGCACC GCCAGGATGG ACAGCGTGGA TATGCCGCCG GATTTCAGGG ACTTATCGAT
TTCATCGACG GACTGCTTCC GCGCAACGAG CACATCGGCT CGGCCTTCCG GGAAGAACGT
CCACTCTATC CTGCCATTGC CATTCGAGAG CTGGTTGCGA ATGCGCTGAT CCATCAGGAC
ATGACGATCA CGGGTGCCGG GCCCCTGATC GAGCTTTTCA GCGACCGCAT GGAGATCACG
AACCCCGGTG CGCCGCTAGG GTCAGGACCC ATTGATCTTG CTTCTGCGGC AACGATGAGC
AACCTTTTCT GGCTGACCGA CGCACAGATG GCGCGGCTTC AAGGGGCCTG A
 
Protein sequence
MTLSPNQTLE RNRRLIDDLR ARPAETPWLE FKENNADASL IGKLVSALSN AARLADQHFG 
YLVWGVRDGN HEAVGTSFDF ATKREQGQPF ELWLANRLQP SLDFRFEEVD YRGVRLVLLT
IPAAGTAPIE FDRISYVRIG SATPRLSDHP ERLRALWARL QPYAWEAGLT AQFVTGDDVL
ARLDYANYFD LTGQRLPDNR HGIFDRLQAD RLIQQDVGGH WNITNLGAIL FAKRLADFGP
SLERKGVRFV AYGGPGRAYA VTHRQDGQRG YAAGFQGLID FIDGLLPRNE HIGSAFREER
PLYPAIAIRE LVANALIHQD MTITGAGPLI ELFSDRMEIT NPGAPLGSGP IDLASAATMS
NLFWLTDAQM ARLQGA