Gene Rsph17025_4101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_4101 
Symbol 
ID5086274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009430 
Strand
Start bp153746 
End bp155914 
Gene Length2169 bp 
Protein Length722 aa 
Translation table11 
GC content69% 
IMG OID640485664 
Producthypothetical protein 
Protein accessionYP_001170258 
Protein GI146280101 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR01007] capsular exopolysaccharide family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.965422 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.2163 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCGGG TGGCGTCAGA TGAAGACGAG ATCGACCTGG GCCGGATCCT GGCCCAGCTC 
TGGGCGGGAC GCTTCCGGAT CGCGGGCTCC ACAGCGGCAG CGGCCGTCCT GGCGCTCGTT
CATCTCGCCG ACACGCCCCC CACCTTTCGG GCCGAGGCGC TCCTGCAGCT CGAAGAGAAG
GCGGCACAGG CCCTGCCGGC AGCGCTTTCC GACATTGCCG GTTTGGAGCC GCGCATCGCC
GCCGAGATCG AGATCCTGCG CTCGCATTCG GTGCTGGCGG AGGCGGTGGC GGCTCATCGT
CTCGATCTCC AGGCCCTGCC GGTCCAGGCG CCTGTCCTGG GCCATGCGGT GGCCTCGGGG
CGACTGCCGC TTCCCGATTC CGGAATTCTG GGCGCCTATG ATCGCGGTGA TGGACGGATC
CTGCTCGACC TGCTGGAGGT CCCTTCCGAG TGGGTCGATG AATGGATCCG GCTGACGGCG
ACGGGGAACG GGGGCTTCAC GCTTTCCCTG CCGGATGGAC GTCGGCTCGA CGGGCAGGTT
GGCGAGCCGC TCCTGTGGCC GGAAGCGGGG TTCGGCCTGC GGATCGTCGG GCTGGAGGCA
CCCGCAGGGC GACAATTCCG GCTTCGCCGG CAGGACGAGA TCGGGGCCAT CGAGGCATTG
CGCGGGCGGC TTTCGGTCAG CGAGCGCGGG CGGGGGGCCC TGATCCTCGA CGTGACCCTG
ACGGGCCCGG ATCCTGTCGA GGCCCAAGCT GCGCTGGCCG CGGTCACCGA CGCCTATCTG
CGGCAGAACA GGGGCCGGAG CGCGGCAGAG GCGCAGAGCA GTCTGGATTT CATCGAGCAC
CAGCTTCCCG GGGCGCGCGA AGCGGTTGCA AGGGTCGAGG ACCGGCTCGA GGCCTATCGT
CAGGCCCAGC ACACGCTCGC CCCTGACCTT GAAAGCCTGA GCCTTCTGAA CGAGATCCGT
GCGGCCGAGA CGGAGCTGCG CGAGCTGTCG GGGAAAGAGG AGGATCTGGC TCGTCGTTTC
ACACCCCTGC ATCCGGCCTA CCAGAAGCTC CTCGCCGCAC GGGCGCGGGC GGAGGACCGG
CTGGCCGGGT TGCGCCAGGA GGCGGCCGGT CTTCCGGAAA CCCAACGGGG TCTCTTCAAC
CTTTCACGTG AACTGGACGT TGCTCGCCAG GTTCATCTGG ACCTGCTGAA CCGGGCGCAG
GAACTGCGCG TCCTCACGGC GAGCACGCTC GGCAATGCCC GCCTCCTCGA TGCGGCCCGG
GCAAGACCCG CTCCCGTGGC CCCGCGACGG GGACGGGTGC TGGCACTTGC CCTTCTTCTC
GGCGCGCTTG GCGGCGCAGG GTATGTGCTC GGGCGCAACT GGCTGCAGGC GGTCATCCTC
GGCCCCGAGG ATCTCGACCG GCTGGGATTG CCGGTCTATG CCACCGTGCT TCTTGCGCCG
CAGGCGGTGC GTCAGCGAGG AGACCGCCGT CCCTGGCCGA TCCTGGCCCT GACCGATCCC
GATTGCGTCA CCCTGGAGGG GATCCGGTTG CTGCGGGCGG GGCTGCACTT CGGGCGTGAG
GCAGCACGCA GCCGCTCGGT CGGTTTCACC TCGCCATCCT CCGGAGCGGG CAAGTCCTTC
CTCGCAGCCA ACCTTGCGGT GGTTGAGGCA CAGGCGGGTC AGCGGGTCTG CCTCGTCGAC
ACCGACCTGC GGCGGGGGGA TCTCCGGCGC TACTTCGGGA TGTCGAGGGG AACGCCGGGG
CTGTCGGACT ATCTCTCTGG CTCGGCGGCC GTCGACGATC TTCTTCGCCC GGGGCCGGTG
GAGGATCTGA TGGTGCTGAC CGCAGGCCGA CTTCCACCAC ACCCTTCCGA GCTTCTCCTG
CGGCCGGCCT TCGCCCACCT CGTCGCGGAA CTCGACCGCC GCTTTGACCT TGTGATCTTC
GACATGCCCC CGGCGCTCGC CGTCACCGAC GCGGCCGTGA TCGGTCGCAC GATGGGGATC
ATGCTGGCGG TCCTGCGCCA CGCCATCACC GAGCGCGAGG AGGTGGAGGT CATGATCCGC
CAGATGCAGG GCGCCGGCGT GACACTCGGG GGCGCGGTGA TCAACGGCTA TCGGCCCTGC
GGCCGCCGAG GCGCCTACGG CTATCGCTAC GACGACAGCT ACAGCTATCG TTCCGGACAG
GAGGCCTGA
 
Protein sequence
MARVASDEDE IDLGRILAQL WAGRFRIAGS TAAAAVLALV HLADTPPTFR AEALLQLEEK 
AAQALPAALS DIAGLEPRIA AEIEILRSHS VLAEAVAAHR LDLQALPVQA PVLGHAVASG
RLPLPDSGIL GAYDRGDGRI LLDLLEVPSE WVDEWIRLTA TGNGGFTLSL PDGRRLDGQV
GEPLLWPEAG FGLRIVGLEA PAGRQFRLRR QDEIGAIEAL RGRLSVSERG RGALILDVTL
TGPDPVEAQA ALAAVTDAYL RQNRGRSAAE AQSSLDFIEH QLPGAREAVA RVEDRLEAYR
QAQHTLAPDL ESLSLLNEIR AAETELRELS GKEEDLARRF TPLHPAYQKL LAARARAEDR
LAGLRQEAAG LPETQRGLFN LSRELDVARQ VHLDLLNRAQ ELRVLTASTL GNARLLDAAR
ARPAPVAPRR GRVLALALLL GALGGAGYVL GRNWLQAVIL GPEDLDRLGL PVYATVLLAP
QAVRQRGDRR PWPILALTDP DCVTLEGIRL LRAGLHFGRE AARSRSVGFT SPSSGAGKSF
LAANLAVVEA QAGQRVCLVD TDLRRGDLRR YFGMSRGTPG LSDYLSGSAA VDDLLRPGPV
EDLMVLTAGR LPPHPSELLL RPAFAHLVAE LDRRFDLVIF DMPPALAVTD AAVIGRTMGI
MLAVLRHAIT EREEVEVMIR QMQGAGVTLG GAVINGYRPC GRRGAYGYRY DDSYSYRSGQ
EA