Gene Rsph17025_0659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_0659 
Symbol 
ID5083003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp660884 
End bp662770 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content71% 
IMG OID640482216 
Productgeneral secretion pathway protein E 
Protein accessionYP_001166870 
Protein GI146276711 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.814938 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.16836 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCTT CCAAGGCCGC GCCTTTGCCC GAGGCCCCGG TTCCCGACGC CTTGGGCGTG 
ATGCTCCTGC GGCAGGGCCA CCTTGCGCCG CACCGGATCA TGGGCGCCCT CCGGCGCAGT
TCGGGCCATG CGGCGGGACT TGCCGACGTG CTTCTGGCCG AAGGGGCCAT GGATGAGGAG
GAAATCCTCG CCCTGACCGC CCGCCAGAGC GGCCTGCCGC TGCTCGATCC GGCAACGGGG
GCCGCCGATC CGCGGCTGAT CGACCGCCTC GGCGTTCGGA CCTGCCTGCG CGAGACGCTT
CTGCCGGTGC ATGACGTCGG CGGCGCCGTG CTTATCGCGG CCCCGTCGCC CGAGAGCTTC
CGGCGCCACG GCCCGCTGCT TGGGCAACTC TTCGGGCGCG TGATCCCCGT GCTGGCCACG
CGAACCGCGA TCGAGGGCGC GCTGCATGCG GCGCGCGCCA CAGCCATCGG ACTTGCGGCC
GAAACCCGCG TCACGCCCGG CGAAAGCTGC CGGGGCTGGC GCACCGGGCG GGCGACGCGG
CTTGCCCTCG GCACGGGCCT CGGCCTTGCC GCCGGGCTGA TGCTGGCGCC GGGCCTGGTG
GTGCTGGCGC TTTCCCTCTG GGCGCTGTTC GCCATGACCT GCGGCACCGC CCTGCGGATC
GCCACCGCCA TAGCCACGCT CCGACGCCGC CCGGCGGACC CGCCCTGCCC GCCCCTCCTG
CGGCTGCCCA TCGTCTCGGT GATCGTCGCC CTCTATCAGG AGGAGGATAT CGCCGGCCGC
CTGGTGGCGC GGCTCGGGCG GATCGACTAT CCGCATGACC GGCTCGAGAT CCTGCTTGTC
GTCGAAGAGG CCGATCTGCG CACCCGCAAG GCTCTGGTCG AGGCCCGCCT GCCGCCCTGG
ATGCGGATCG TGATCTCTCC CGCCGGCGCC ATCCGCACCA AGCCGCGTGC GCTCAACGTG
GCGCTCGACC ACTGCCGCGG CTCGATCGTG GGCGTCTATG ACGCCGAGGA TGCGCCCGAC
CCCGACCAGA TCCGCCGCGT GGTCGAGGGG TTCAGCCGCC GCGGCTCGCA GGTGGCCTGC
CTTCAGGGCC AGCTCGACTA TTACAACCCG CGGACCAACT GGCTCTCGCG CTGCTTCACC
ATCGAATATG CCTCCTGGTT CCGCCTGATG CTGCCGGGGC TCGACCGGCT CGGGCTTGCG
GTGCCGCTGG GGGGAACGAC GCTCTTCTTC CGCCGCGAGG CGCTCGAGGA TCTGGGAGCG
TGGGACGCCC ACAACGTCAC CGAGGACGCC GATCTCGGCA TCCGCCTCGC GCGCCATGGC
TACCGGACGG ACCTGATCGA CACGGTGACG GGCGAAGAGG CGAACTGCCG CGCGCTCCCC
TGGATCAAGC AGCGCTCCCG CTGGATCAAG GGCTTCATGA TGACCTGGGC CGTCCACATG
CGCGATCCGG TGCTTCTGTG GCGGCAGCTG GGCCCCTGGC GCTTTGCCGG CTTTCAGGTA
ATGTTCCTCG GCTCGCTGTC GCAGACGCTG CTGGCCCCGG TCCTGTGGTC GTTCTGGCTG
CTGGCCCTCG GCCTGCCGCA TCCGGTGACG CCGCTTCTGT CCACGCCGGC CCTCTGGGCC
ATCGTCGGCC TGCTCCTCGG AGCCGAGGGG ACGAGCATCG CGCTCGGCAT CCTCGCGCTG
CGCCTCACGC GGCACAAGCT CAACCCCCTG TGGGTGCCGA CGATGCATCT CTACAACCCG
CTGGCCACCT TTGCCGCCTA CAAGGCCCTG TGGGAGCTTC TCCGCGCCCC GTTCTACTGG
GACAAGACGC GCCACGGCCT CTTCGACGGC TCTTCGCGGG GGCCGGCGGC CTGGGTGCCG
CGGCTGAGGG GCCAACGCGC GGCCTGA
 
Protein sequence
MPPSKAAPLP EAPVPDALGV MLLRQGHLAP HRIMGALRRS SGHAAGLADV LLAEGAMDEE 
EILALTARQS GLPLLDPATG AADPRLIDRL GVRTCLRETL LPVHDVGGAV LIAAPSPESF
RRHGPLLGQL FGRVIPVLAT RTAIEGALHA ARATAIGLAA ETRVTPGESC RGWRTGRATR
LALGTGLGLA AGLMLAPGLV VLALSLWALF AMTCGTALRI ATAIATLRRR PADPPCPPLL
RLPIVSVIVA LYQEEDIAGR LVARLGRIDY PHDRLEILLV VEEADLRTRK ALVEARLPPW
MRIVISPAGA IRTKPRALNV ALDHCRGSIV GVYDAEDAPD PDQIRRVVEG FSRRGSQVAC
LQGQLDYYNP RTNWLSRCFT IEYASWFRLM LPGLDRLGLA VPLGGTTLFF RREALEDLGA
WDAHNVTEDA DLGIRLARHG YRTDLIDTVT GEEANCRALP WIKQRSRWIK GFMMTWAVHM
RDPVLLWRQL GPWRFAGFQV MFLGSLSQTL LAPVLWSFWL LALGLPHPVT PLLSTPALWA
IVGLLLGAEG TSIALGILAL RLTRHKLNPL WVPTMHLYNP LATFAAYKAL WELLRAPFYW
DKTRHGLFDG SSRGPAAWVP RLRGQRAA