Gene Rsph17025_4200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_4200 
Symbol 
ID5086371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009430 
Strand
Start bp241587 
End bp242831 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content69% 
IMG OID640485761 
Producthypothetical protein 
Protein accessionYP_001170355 
Protein GI146280198 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.250516 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGAGG CGCAGGTGGT TCTCGGCGAA GGCAAGATCC CCGTCGGCCG GCTTGTCTTC 
GAGCATGACG GCCGCCGCTC CCATTCCACC TTCCTCTACG ACCGGGCCTG GCTCGAGAAC
CCGCGCGGGT TCGATCTCTC GCCGCAGATG CCGCGGGCGG TCGTGCCCTA CACGGCCGCC
ACAGGCGGAC GCGACAGCCG CAAGCAGGAC GTGATCGCGG GTCCTTTCAG CGACAGCTCT
CCCGACAGCT GGGGCCGCAA GCTGATGCGC CGCGTGCTGG GCGAAGGCGC AACCGAGTTC
GATTTCCTGA TCTGCTGTGA CGACACAGCG CGTCAGGGGG CGCTGCGCTT CCTGGATGAC
AATGGCCGGC TTTTTGGCAC GGGCGGACCG CCCGTGCCAC GGCTGATGGA TCTCGAGGAG
CTGCGCGCCA TAGCGGCGCG CTTCGAGGCC GACCCGGCAG GGGCCGAGGA TGCTGCGCGC
GAGCTCGTGG GCGCCGCTGG CTCGCTGGGC GGGGCACGCC CGAAAGCCAA CCTCAGGGAC
GGCGCCGATC TCTGGATCGC CAAGTTCACC TCGATCAACG ACACCTGGCC GGTCGAGCGG
CTCGAAATCG CAACGCTGAA ACTGGCCCGC GACCTCGGCC TGCGTGCGCC CGACGCGCGA
CTTGCGTTGC CCGCGAGCGA ACGTCCCGTG GCGCTCATCC GACGCTTCGA CCGTCGGGCA
ATCGAAGGCA GGCCCGGCCG TATACCCTAC ATTTCCGCTC GCACGGCGCT CGGACATGTT
GGCGGCGGCA CAGGCAGCTA CACCGACATC GCCGATGCGA TCCGCGCAAT CTCCGTGCGG
CCGGCGGACG ACATGCGCGA GCTGTGGTGC CGAATGCTCT TCGGCATCCT TTGCACCAAT
ACCGACGACC ACCTGAAGAA CCACGGCTTC ATCTATGCCG GCAACAACCT CTGGCGCCTC
TCGCCCCTCT TCGACGTGAA CCCGCAGCCG CGGCGGCACC CGCAGCTCGA GACCGCCATC
AGCCCGATCC ACGGCCATGA GCCGGCCATC GAGGCGGCGA TCGAGGCAGC GCCCTTCTTC
GATCTTGACG AGACCGAGGC CCGCCAGAGG GCGCGGGACA TGGCGGTCGC GCTCGCCGCC
GGCTGGCGCG ACGCACTGCG ACGTGAGGGG ATCACGGGAC CGGCGCTCGC CGCCTGCGCG
CCGGCCTTCG AACATGACCG CCTGGAGGCG GCTCTGGCGC TCTGA
 
Protein sequence
MPEAQVVLGE GKIPVGRLVF EHDGRRSHST FLYDRAWLEN PRGFDLSPQM PRAVVPYTAA 
TGGRDSRKQD VIAGPFSDSS PDSWGRKLMR RVLGEGATEF DFLICCDDTA RQGALRFLDD
NGRLFGTGGP PVPRLMDLEE LRAIAARFEA DPAGAEDAAR ELVGAAGSLG GARPKANLRD
GADLWIAKFT SINDTWPVER LEIATLKLAR DLGLRAPDAR LALPASERPV ALIRRFDRRA
IEGRPGRIPY ISARTALGHV GGGTGSYTDI ADAIRAISVR PADDMRELWC RMLFGILCTN
TDDHLKNHGF IYAGNNLWRL SPLFDVNPQP RRHPQLETAI SPIHGHEPAI EAAIEAAPFF
DLDETEARQR ARDMAVALAA GWRDALRREG ITGPALAACA PAFEHDRLEA ALAL