Gene Rsph17029_1007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1007 
Symbol 
ID4895672 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1041655 
End bp1043160 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content70% 
IMG OID640111593 
ProductMORN repeat-containing protein 
Protein accessionYP_001042890 
Protein GI126461776 
COG category[S] Function unknown 
COG ID[COG4642] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGAAGA CGAAGGAACG GATCGGACGG GCCAGAAGAC ACGGTCTGGC CGCCGTGGCA 
CTTGGAGCGG CACTGGGGGC CCCGGCGGTC CTCGCGCAGG ATCCGGGCGC GGTGACGACC
AAGCAGTATG ACGACGGCTC GGTCTACGAG GGCACGTTCC GCAACGGCCT CCAGCACGGG
ACGGGCACCT ACCGGCTGCC CAACGGCTAC GAATATTCCG GCGACTGGAC CGATGGCGAG
ATCCGCGGAG AGGGCCGCGC CCGCTTTCCC AATGGCTCGG TCTACGAGGG CGCCTTCGTC
GCGGGCAAGC CCGAGGGGCG CGGCAAGATC ACCTTCTCGG ACGGCGGCAC CTACGAGGGC
GACTGGGCCG GCGGCCAGAT GACCGGCGAG GGCGTGGCGC GCTATGCCAA CGGCTCGGTC
TATACCGGCC AGTTCCGCAA TGCCGTCCAC CACGGCAGGG GCGTACTCGA GAACCCCGGC
GGCTACCGCT ACGAGGGCGA CTGGGTCGAG GGCGTCAAGG AAGGCCGGGG CAAGATCACC
TATCCCGACG GCGCGATCTA CGAGGGCGAT CTGGTCAAGG GCCAGCGTCA GGGTCAGGGC
ACGCTCACCA TGCCCGACGG GCTCGTCTAC GTGGGCGCCT GGGACAATGG CCAGATCAAC
GGCACGGGCA GGCTCACCCA GCCCAACGGC GACATCTACG AGGGCCCGCT CAAGGACGGC
CAGCGCGAGG GCCGCGGCAA GGTCACCCAC AAGAACGGCG ACGTCTACGA GGGCGAGTTC
CATGCCGACC GGCGCCACGG GCAGGGCACC TTCCGCGGCA CCGACGGCTA TGTCTACGAA
GGCGCCTGGG TCGAGGGCCG GATCGAGGGC CAGGGCCGCG TCACCTATCC CGACGGCTCG
GTCTATGTGG GCCGGTTCCA CGAGGACCAG CCCGAGGGGC GCGGCAAGAT CACCTATCCC
GACGGATCGA CCTACGAGGG CGACTGGAAG GACGGCGTGA TCGAGGGGCG CGGCACCGCC
ACCTATGCCA ACGGCCTCGT CTACGAAGGC CAGTTCCATG CCGCCAAGAA CCATGGGCAG
GGCGTCATGA CCTATCCCGA CGGCTACCGC TACGAGGGCG ACTGGGTCGA GGGCCAGCGC
CACGGCCGGG GCACGGCCAC CTATGCCGAC GGCACGGTCT ATACCGGACA GTTCGTGCGC
GGCCAGCGCG AGGGAGAGGG CGAGATCGTG ATGGCCGACG GCTTCCGCTA CAAGGGCGGC
TGGAAGGCGG GCGAGATCGA CGGCGAGGGC ATTGCCACCT ACGCCAACGG CGACGTCTAC
GAGGGCACCT TCAAGGCCGG CAAGCGGCAG GGCCAGGGCG TGATGCGCTA TGCCACGGGT
CAGGAGAGCG CGGGCGAATG GAAGGACGGG ATCCTCGCCG AGCCTGCCGC CGAGGCCCCC
GCTGGGCAGG CCCCCGCCCC GGCCGAGGAT GCCACGCCCC CCGCCGAGGC CTCCACCGCC
GACTGA
 
Protein sequence
MRKTKERIGR ARRHGLAAVA LGAALGAPAV LAQDPGAVTT KQYDDGSVYE GTFRNGLQHG 
TGTYRLPNGY EYSGDWTDGE IRGEGRARFP NGSVYEGAFV AGKPEGRGKI TFSDGGTYEG
DWAGGQMTGE GVARYANGSV YTGQFRNAVH HGRGVLENPG GYRYEGDWVE GVKEGRGKIT
YPDGAIYEGD LVKGQRQGQG TLTMPDGLVY VGAWDNGQIN GTGRLTQPNG DIYEGPLKDG
QREGRGKVTH KNGDVYEGEF HADRRHGQGT FRGTDGYVYE GAWVEGRIEG QGRVTYPDGS
VYVGRFHEDQ PEGRGKITYP DGSTYEGDWK DGVIEGRGTA TYANGLVYEG QFHAAKNHGQ
GVMTYPDGYR YEGDWVEGQR HGRGTATYAD GTVYTGQFVR GQREGEGEIV MADGFRYKGG
WKAGEIDGEG IATYANGDVY EGTFKAGKRQ GQGVMRYATG QESAGEWKDG ILAEPAAEAP
AGQAPAPAED ATPPAEASTA D