Gene Rsph17029_3869 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3869 
Symbol 
ID4898523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp996359 
End bp997549 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content74% 
IMG OID640114473 
Producthypothetical protein 
Protein accessionYP_001045720 
Protein GI126464607 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000457394 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCATGC CCCTCGTTCT GGCGCTGCGC GAGCTGCGGC ACGACTGGAT CTCGGCGCTC 
TGCTTCGTGG CGGCGCTGGT GGGCGTGCTG GCGCCCATGC TGATCCTGCT CGCGCTGAAG
ACGGGTGCGC TCGACACGAT GGTCGAGCGG CTGGTCGACG ATCCGGCGAA CCGCGAACTG
CTGGCGGTGG GGGCCGGCGC GTATGACGAG GGCTTCTTCC GCTGGCTGGA GGCGCGGCCC
GAGGCGGGGT TCGTCGTGCC CGCCACGCGC AGCATCAACG CCCTTGCGGA TGCGGTCGTG
GCCTCCGCTC CCCGCCGCGA GATGGTGCGG GAGGTGCCGC TGGTGGTTTC GGCCGCGGGC
GATCCGCTGC TGGCGGGAGA TGTCGGGCCG GGTCGGGTCT GGCTGAGCGC GCCTCTCGCG
CGGTCTCTGG AGGTCGCGCC GGGTGGCGCG CTGACGATGG TGATCGGACG GCGCATCGAC
GGCCTCGAGC AGACGGCGCG GCGACCGCTG AAGGTGGCGG GGATCGTTCC GGCCGAGCGC
TACGGCCGCC CGGCGCTGTT CCTGTCGCTG CCGGACATGC TGGCGATCGA GCGGTTCCGC
GACGATCCGG CCGTCACGCC CGGAAGCTGG CTTCAGGCCG CCGCGCCGCC TGCGGCCTTT
GCCAGCTTCC GCCTCTATGC GCGGACGCTC GCGGATCTCG GGCCGCTCTC GGCGGCGCTG
GAGGGGCGCG GTGTTGCGGT GCGCCCCCGC GCCGAGAATG CGGCGCTGCT GCTGCAGCTG
CGGCGGGGCG CGGATCGGCT GTATCTTGCG GTCGCCGCAT TGGCCGCCGC CGGATTCTGG
GCCGCGATGA GCGCCAATCT CCGCGGCATG GTGGAGCGGC GGCGGCTGGC CTTCAGCCTG
CTGCGGTTGC TGGGCCTGAC GCCCGTCCAG CGCGCGACGG TTCCGCTGAT CCAGAGCCTC
GTGCTGATCG CGGCGGGGCT CGGGCTCTCG CTCGCCCTCG TTCTGCCGGC CGTGGCGCTG
ATCAACGCGA GCTTTCCCTC CGTGGCCGAA GGGGCGGCGC TCGCGCGCCT CAGGCCGGAC
CAGTTGGGGG GGGCGGCTGC GCTTGCCTGC GTGACGGCGC TGACCGCGGC GCTCTGGGCG
ATGGCGGCGG TGCTGCGGAT CCCGAGCGAG GAGGTGTTGC GTCATGGCTA G
 
Protein sequence
MPMPLVLALR ELRHDWISAL CFVAALVGVL APMLILLALK TGALDTMVER LVDDPANREL 
LAVGAGAYDE GFFRWLEARP EAGFVVPATR SINALADAVV ASAPRREMVR EVPLVVSAAG
DPLLAGDVGP GRVWLSAPLA RSLEVAPGGA LTMVIGRRID GLEQTARRPL KVAGIVPAER
YGRPALFLSL PDMLAIERFR DDPAVTPGSW LQAAAPPAAF ASFRLYARTL ADLGPLSAAL
EGRGVAVRPR AENAALLLQL RRGADRLYLA VAALAAAGFW AAMSANLRGM VERRRLAFSL
LRLLGLTPVQ RATVPLIQSL VLIAAGLGLS LALVLPAVAL INASFPSVAE GAALARLRPD
QLGGAAALAC VTALTAALWA MAAVLRIPSE EVLRHG