Gene Rsph17029_1077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1077 
Symbol 
ID4896655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1112074 
End bp1113225 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content72% 
IMG OID640111664 
Productputative mRNA 3-end processing factor 
Protein accessionYP_001042960 
Protein GI126461846 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1236] Predicted exonuclease of the beta-lactamase fold involved in RNA processing 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACGCG ACCCCCTTCT GACCTTCACC GACCGCGGGA TCTTCTGCCC CGCGGGCGAT 
TTCTACATTG ACCCGTGGCG GCCGGTGGAG CGTGCGCTCA TCACCCACGG CCATTCGGAC
CATGCCCGAT CCGGCCACGG CGCCTATCTG GCGACGGAGG GCTCGGCCCC GGTGATCCGC
TACCGGCTGG GCGACATCCG CCTCAAGACG ATCCGCTACG GCGAGACCCG GCGGATCGGC
GGCGTCACGG TCTCGTTCCA TCCGGCGGGT CATGTGCCGG GCTCGGCGCA GATCCGTGTC
GAGCGGAACG GCGAGGTCTG GGTGGTCTCG GGCGATTACA AGGTGGCCGA GGACGGGCTG
TCGGAGCCTT TCGAGCCGGT CACCTGCCAC AGCTTCATTT CGGAATGTAC CTTCGGCCTG
CCGGTCTTCC GCTGGAAGCC GCAGGCCGAG CTCGCGGCCC AGCTGAACCG CTGGTGGGCG
GCGAATGCCG CCGAGGGGCG CACGTCGATC GTGGGCGCCT ATACGCTCGG CAAGGCGCAG
CGGCTTCTGG TCTCGGCCGA TCTCTCCATC GGCCCGATCC TGACCCATGG TGCGGTCGAG
GCCACCACCG CCGTCCTGCG CGAGCAGGGG CTGGCGCTGC CGCCCACCAC CTATGTGGCG
CCCGGCATCG ACGGCACGTC GCACCCGGGG GCACTGGTGA TCGCGCCGCC CTCGGCGCTG
GGCACCCCCT GGGCCACGCG CTTCGGCCCC TCGGCCGAGG CCTTCGCCTC GGGCTGGATG
GCGCTGCGCG GCGTCCGCCG CCGACGCGGC CTCGCGCAGG GCTTCGTCAT GTCCGACCAT
GCCGACTGGG ACGGGCTCAA TGCCGCGATC CGCGCCACGG GGGCCGAGCG GATCTTCGTC
ACCCACGGCT ATACCGCGAT CTTCCGCCGC TGGCTCGAGG ATCAGGGGTT CGAAGCGGGC
ATCGTCGCCA CGGAATATGA GGGCGAGAGC CTCGATGCGG CCGAAGCCGA GGCGGGTCCG
CTGATCGAGC CCGACGCGGG CGCAGATGCC GTGGCCGAGG AGGACGGGAC GGCAGCCGAT
CCGGCCACGG ACGGGTCGGA GCCCGCCGAG GGCAAGCGCA GGCGCCCGGC AGCGGGGGAC
GCCCGGACAT GA
 
Protein sequence
MARDPLLTFT DRGIFCPAGD FYIDPWRPVE RALITHGHSD HARSGHGAYL ATEGSAPVIR 
YRLGDIRLKT IRYGETRRIG GVTVSFHPAG HVPGSAQIRV ERNGEVWVVS GDYKVAEDGL
SEPFEPVTCH SFISECTFGL PVFRWKPQAE LAAQLNRWWA ANAAEGRTSI VGAYTLGKAQ
RLLVSADLSI GPILTHGAVE ATTAVLREQG LALPPTTYVA PGIDGTSHPG ALVIAPPSAL
GTPWATRFGP SAEAFASGWM ALRGVRRRRG LAQGFVMSDH ADWDGLNAAI RATGAERIFV
THGYTAIFRR WLEDQGFEAG IVATEYEGES LDAAEAEAGP LIEPDAGADA VAEEDGTAAD
PATDGSEPAE GKRRRPAAGD ART