Gene RSP_3838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_3838 
Symbol 
ID4796555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_009007 
Strand
Start bp
End bp1839 
Gene Length1836 bp 
Protein Length611 aa 
Translation table11 
GC content70% 
IMG OID640102951 
Productcapsule polysaccharide export protein 
Protein accessionYP_001033800 
Protein GI125654606 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3524] Capsule polysaccharide export protein 
TIGRFAM ID[TIGR01010] polysaccharide export inner-membrane protein, BexC/CtrB/KpsE family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTCTCGT GCCCCTGTCA TCGGGCCCGA GAGAGCGAGA GCCAACCGCA ACGGTTGTGC 
GAGCTGTGCC AGTACCCTGT CCTTGCTGTG TCGCACCGGG GTGCCGTCCA TTCTGCCCCG
TTAGCTTCTC TGGTGTCGCG CTCCGTGCCC TTGACTTTCC GGCTGGTCGC CACTCTGTCA
ACCGTTCTTC CGATTCCGGC CCTGACCTTC GCTCGGCCGA TTCCGGGCAC CGCCCGTTCG
CCGACCCCAT CTGGTCGCCC GGTCCCGCCG GGTGCCCCGC CGCGATTCCG CCGCCGGACG
GCCCCTGCCC TCCCCCCCCG CCCCGTCGCG GCCCCCGAAG TCGCCCCGAA ACCCGGCCCC
GGCGCGGCGC TCGGCCCGAG CGGCCTGCGG AGCGGCGAAC GTCCCGCCGC GGAGAGCCCG
AGCGCCGAAG AGACGCGCGG CCCGGGTCGT GCAGGAGACG CGCGGGCCGC AGACGCGCGG
GCCGGAGAAC TCCGCCCCGG CAGGCCTGCC GATGCGGGCC GGGCGGGCGG CGGCCAAGGC
TCAGGACAGG GCGACGGTTC CGGACCGGCG CGCGGCGCCG GGGGCTCAGG TCCGGCCGGC
AAGGCGGGCG GCAGGGGCCA GCCCGGCGGC GGCGGCAAGG CAGGCGAGGG CAAGACCGGC
GAGGGCAAGG GCCGCATCCT GCCCTCCTCC TTCAAGGTGC CGGCCGCCGC CCCCCGCGCC
GCGGCGCGGC TGCGCCACCA CGGCCTGCTC GCGAGCTTCC TCGGCCTCGT GCTGGCCCCG
ATCCTCGCGT CCGGCCTCTA TCTCTTCGCC ATCGCCGAGG ATCAATATAC CTCGACCGTG
GGCTTCTCGG TCCGCACCGA GGAGATGGGC TCGGCGCTCG ATCTTCTGGG CGGGCTGAGC
AGCTTCGGCC TCACCGGCGG CGGCTCGGCC TCGGATTCCG ACATCCTCTA CCAGTTCATC
CAGAGCCAGG AGCTGGTGCA GCGGATCAAC GAGCGGATCG ACCTGCGCGC AATCTATTCG
AAGCCCGGCT TCGATCCGGT CTTCAGCTTC GACCCCGACG GCGGGATCGA GGATCTGGTG
GATTACTGGA AGGACATGGT GCGGATCAGC TACGACAGCA CCACCGGGCT GATCGAGCTG
CGCGTCCATG CCTTCACGCC CGAGGACGCA CAGGCGGTGG CGCAGGGGAT CCTCGACGAA
TCGAACCGGA TGATCAACGA CCTGTCGGCC ATCGCCCGGG CCGATGCCAC GCGCTATGCG
CGCGAGGAGC TCGACAATGC GGTCGAACGG CTGCGCGTGC AGCGTGTCGC CATGACCGAA
TTCCGCTCGC GCACCCAGAT CGTCGATCCC TCGGCCGACA TCCAGGCCCA GATGGGCCTC
CTGAACACGC TCCAGCAGCA GCTCGCGTCG GCCAGCATCG ATCTCAACCT GCTGCGCCAG
ACCACCCAGC CGAGCGACCC GCGCATCGCC CAGAACGAAC GGCGCATCGG GGTGATCGAG
GAGCTGATCC AGCGCGAACG CGAGAAGTTC GGCCTGGGCG GCGGCACCGG CACCGGGGCC
AGCACCTATT CCACCATGAT CGCCGAGTTC GAGCGGCTGA CCGTCGATCT CGACTTCGCC
GAGAAGGCCT ATATCGCCGC GCTCACGAAC CACGACGCGG CCATCGCCGA GGCGCAGCGG
ATGAGCCGCT ATCTCGCGAC CTATGTCCGG CCCACCCTCG CCCAGCAGTC GCTCTATCCG
CAGCGCGGCC TGCTCACGCT GATGATCGGC GGGTTCGCTC TCATGCTCTG GGCGATCGGG
ATGCTGATCT ATTACAGCGT GCGCGACCGG CGCTGA
 
Protein sequence
MLSCPCHRAR ESESQPQRLC ELCQYPVLAV SHRGAVHSAP LASLVSRSVP LTFRLVATLS 
TVLPIPALTF ARPIPGTARS PTPSGRPVPP GAPPRFRRRT APALPPRPVA APEVAPKPGP
GAALGPSGLR SGERPAAESP SAEETRGPGR AGDARAADAR AGELRPGRPA DAGRAGGGQG
SGQGDGSGPA RGAGGSGPAG KAGGRGQPGG GGKAGEGKTG EGKGRILPSS FKVPAAAPRA
AARLRHHGLL ASFLGLVLAP ILASGLYLFA IAEDQYTSTV GFSVRTEEMG SALDLLGGLS
SFGLTGGGSA SDSDILYQFI QSQELVQRIN ERIDLRAIYS KPGFDPVFSF DPDGGIEDLV
DYWKDMVRIS YDSTTGLIEL RVHAFTPEDA QAVAQGILDE SNRMINDLSA IARADATRYA
REELDNAVER LRVQRVAMTE FRSRTQIVDP SADIQAQMGL LNTLQQQLAS ASIDLNLLRQ
TTQPSDPRIA QNERRIGVIE ELIQREREKF GLGGGTGTGA STYSTMIAEF ERLTVDLDFA
EKAYIAALTN HDAAIAEAQR MSRYLATYVR PTLAQQSLYP QRGLLTLMIG GFALMLWAIG
MLIYYSVRDR R