Gene Rsph17029_0049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0049 
Symbol 
ID4896923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp58771 
End bp59991 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content73% 
IMG OID640110625 
Producthypothetical protein 
Protein accessionYP_001041941 
Protein GI126460827 
COG category[S] Function unknown 
COG ID[COG3214] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.130826 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCCTCC CCAACCCGCT CGCCCGCCGG CTGTTCCTGC ATCTCCACGC CCTTGCCGAG 
CCGCCCACCG GCCCGGCCAA GGGCGAGGCG CTGCTGGCGC TGATCGACCG TCTGGGCTTC
GTCCAGATCG ACAGCATCTC GACCGTCGCC CGCGCCCATC ACATGATCCT CTTCGCGCGC
AGGCAGGCCT ACCGGCCCGA GGCGCTCGAC CGGCTGCTCG CGCAGCGCCA CCTGTTCGAA
CACTGGACCC ACGATGCGGC GGTGATCCCC GCCCGCTTCT TCCCCTTCTG GCACCACCGC
TTCCGCCGCG ACCGGCCGCG GCTGCTGGCC CGCTGGCGCG GCTGGCAGCG CGAGGGGTTC
GAGGAGCAGT TCGATGCGGT CCTCGCGCGG ATCGCCGAAA GCGGGCCGGT CTCGGCCGCC
GAAGTGGGCG AGGAGGAAGA GCGCGGCACA GGCGGCTGGT GGGACTGGCA CCCGTCGAAG
GCCGCCTTGG AATATCTCTG GCGGGTGGGC GAGCTTTCCA TCACGCGCCG CGACTCGTTC
CGCAAAGTCT ACGATCTGAC CTCCCGCGTC ATCCCGTCCG GGTGGCTCGC GATGGATCCG
GGCGACGCCG CCACGATCCA CTGGGCCTGC TCCGAGGCGC TCGACCGGCT GGGCTTCGCC
ACCTCGGGCG AGCTGGCCGC CTTCTGGGCC GCCGCCAGCC CCGCCGAGGC GCAGGCCTGG
TGTCACGATG CGCTCGCGCG CGGCGAGATC GTGGAGGTCC GCGTCGAGGG GGCCGACGGC
AGCCTCCGGC GCAGCTACGC CCGCCCGGAG GTGGCCGCGC TGGCCGAGGC CGCGCCCGAT
CCCTCGCCGC GGCTGCGGAT CCTGTCGCCC TTCGATCCGG TGCTGCGCGA CCGGGCCCGC
GCCGAACGGC TGTTCGGCTT CCGCTACCGG ATCGAAGTGT TCGTGCCCGA GGCCAAGCGC
ACCTACGGCT ATTACGTTTT CCCGATCCTC GAGGGCGACC GGCTGATCGG CCGGATCGAC
ATGCGCGCCC ACCGCGAGAG CGGCAGCCTG CGCGTGCGCG CGCTCTGGCC CGAGCTGGGG
GTGCGGCTCG GCTCGCGGCG GCTCGGGCGG CTCGGGGCCG AGCTCGACCG TCTGGCGCAG
TTCGCGGGCT GCGATCAGGT GAAGTTCGAG CCGGACTGGC TGCGCGAGAC GCTGCCCGAG
GGGAGCGTCG CCGGAGACTA G
 
Protein sequence
MILPNPLARR LFLHLHALAE PPTGPAKGEA LLALIDRLGF VQIDSISTVA RAHHMILFAR 
RQAYRPEALD RLLAQRHLFE HWTHDAAVIP ARFFPFWHHR FRRDRPRLLA RWRGWQREGF
EEQFDAVLAR IAESGPVSAA EVGEEEERGT GGWWDWHPSK AALEYLWRVG ELSITRRDSF
RKVYDLTSRV IPSGWLAMDP GDAATIHWAC SEALDRLGFA TSGELAAFWA AASPAEAQAW
CHDALARGEI VEVRVEGADG SLRRSYARPE VAALAEAAPD PSPRLRILSP FDPVLRDRAR
AERLFGFRYR IEVFVPEAKR TYGYYVFPIL EGDRLIGRID MRAHRESGSL RVRALWPELG
VRLGSRRLGR LGAELDRLAQ FAGCDQVKFE PDWLRETLPE GSVAGD