Gene Rsph17029_2108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2108 
Symbol 
ID4897393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2233785 
End bp2234948 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content75% 
IMG OID640112702 
ProductFmu (Sun) domain-containing protein 
Protein accessionYP_001043983 
Protein GI126462869 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCCCG CCGCCCGCCT CTCCGCCGCC ATCGGCATCC TCGACCGGAT CCTCGGCGGC 
ACCCCGGCCG AGCAGGCCCT GACGAACTGG GGCCGGGCGA GCCGGTTCGC GGGCTCGGGC
GATCGGGCCG CGGTGCGCGA CCTCGTGTTC GACGCACTGC GCTGCCGGCG CTCCTTCGCG
CGGGCGGGCG GAGCCGAGAC AGGGCGGGGG CTCGTGCTGG GCGGGCTGCG GATGGCCGGG
CAGGAGGTGG CGGAGCTCTT CACCGGCGAG GGCCATGCGC CCGCACCGCC CGCGGGATCC
GAGCTCGAGC CGCAGGCGCC CGCCGAGCTC GACGCGCTCG ATTGTCCCGA CTGGCTCGCG
CCGGCGCTGC GCGACAGTCT CGGCGCGGAC TTCGCCCCGG TGATGGAGGC GCTCCGCCAT
CGCGCGCCCG TCTTCCTGCG CGTCAATCTC GCGCGGACCG ACCGGGCGGC CGCGGCGGCC
GAACTCGCGG CCGAGGGCAT TGCCACGCAG CCGCATCCTC TGGCGGAGAC GGCGCTCGAA
GTGACGGAAA ACCCGCGCAG ACTGCAGGCT TCCGCCGCCT ATCGCGAGGG GCGGGTCGAG
CTGCAGGATG CCGCTTCTCA GGCCATCGTC GCGGCGCTGC CGCTCGCCTC GGGCGACCGT
GTGCTCGACT ACTGCGCCGG CGGCGGCGGC AAGAGCCTCG CCATGGCCGC CCGCGCGCCC
ATCGATCTCT CGGCCCACGA TGCCGACCCG CGCCGCATGC GCGACCTGCC CGAGCGGGCC
GCCCGCGCCG GTGCCGAGGT GCGGCGGCTC GCACCGGGAG ACCCCGCGCG CCGCGGCCCC
TTCGATCTCG TGCTGGCGGA CGTGCCCTGC TCGGGCTCGG GCAGCTGGCG CCGTGCGCCG
GAGGGCAAGT GGAGCCTCAC GCCGGAGCGT CTGGCCGAAC TCCGGGCGAT CCAGTCGACG
ATCCTCGACG AGGTGGCCCC GCTGGTCCGC CCGGGCGGGC ATCTCGCCTA TGCGACCTGC
TCGCTTCTCG CCTCCGAAAA CGCGGGCCAG ACCGACGCTT TCCTCGAGCG GAGCCCCGGC
TGGGCGCGGG TGCACCAGCT GCGACTGACC CCTCTCGACG GCGGCGACGG CTTCTTCCTC
GACCTGTTGC AGAGAAACGG CTGA
 
Protein sequence
MTPAARLSAA IGILDRILGG TPAEQALTNW GRASRFAGSG DRAAVRDLVF DALRCRRSFA 
RAGGAETGRG LVLGGLRMAG QEVAELFTGE GHAPAPPAGS ELEPQAPAEL DALDCPDWLA
PALRDSLGAD FAPVMEALRH RAPVFLRVNL ARTDRAAAAA ELAAEGIATQ PHPLAETALE
VTENPRRLQA SAAYREGRVE LQDAASQAIV AALPLASGDR VLDYCAGGGG KSLAMAARAP
IDLSAHDADP RRMRDLPERA ARAGAEVRRL APGDPARRGP FDLVLADVPC SGSGSWRRAP
EGKWSLTPER LAELRAIQST ILDEVAPLVR PGGHLAYATC SLLASENAGQ TDAFLERSPG
WARVHQLRLT PLDGGDGFFL DLLQRNG