Gene Rsph17025_1712 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_1712 
SymbolmdoG 
ID5083951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp1759235 
End bp1760857 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content67% 
IMG OID640483270 
Productglucan biosynthesis protein G 
Protein accessionYP_001167910 
Protein GI146277751 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3131] Periplasmic glucans biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.15106 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGCAC ATTCCGCTCC CTCTGCCCGC CTGAACCGCC GGCTGCTGCT TTCGGCGGCA 
AGTTCGACCC TTGCCCTTGC CGCCACGGGG CTCGTCGGGC TTCCCCTGCG GGCCCAGGAG
GCGCCTGCGG AAGAGCCGCC GGCCTCCGTG CCGAGTGCTG CGCCGCAGCA GTTCAGCTTC
GACTGGCTGA CGGAAGAGAT GCGGGTCGCC GCCACCCAGC CGCATGTCGA GCCGGCGAAC
CTGACCGGCT TCCTCGGCGA TCTGCACTAT GACGACTACC GCCTGATCAA CTTCCGCAAC
GACCGCTCCC GCTGGGCGGA CACGGACAGC ATGTTCCGCA TCCAGGCCTT CCATCTGGGC
TGGCTGTTCG GCGCGCCCGT GCGGCTCTTT GACGTGACGG ATGGCTTCAT CCACGAGGTC
CAGTTCTCGA CCGACGACTT CGAATACCGC AACGAGCTTG CGACGCGGGT GGCGGCCCAT
GTGGATCTGC CCGGCGTGGC GGGCTTCCGG CTGAATTTCC CGCTCAACCG GCCCGACATC
CACGACGAAC TGGTGGCCTT CCTCGGGGCG AGCTACTTCC GCGCCCTCGG CCGCGGCAAC
GCCTACGGGA TCTCGGCGCG CGGACTGGCC ATCAACACCG CGACCTCCTC GCCCGAGGAG
TTTCCGCGCT TCTCGCGCTT CTACCTCGAG CGTCCGCAGG CCGGGGGCCT CTCGACCGTT
CTCTATGCCG CGATGGAGAG CCCGAGCGTC ACGGGCGCCT ATCGCTTCGT CATCACGCCG
GGCATCGAGA CGGTGATGGA TGTCACCGCG CGGCTCTTCT TCCGCAACGC CGTCACCCAG
CTTGGCGTGG CGCCGCTGAC CTCGATGTTC CTCTATGGCG AGAAGAACCG CGCCAGCTAC
GACGACTTCC GTCCGAACGT GCACGACAGC GACGGCCTGG CCATCCGCCG GCGCGAGGGC
GACCTGCTGT GGCGCCCGCT CAACAATCCG CCCCGGCTGG CGAGCAGCTA TTTCGGCGAA
GAGAACCCGC AGTCCTTCGG CCTTCACCAG CGCAAGCGGG CGTTCGAGGA TTACCAGGAC
GCCGAGGCGC ATTACGAACT GCGTCCGTCG GTCGATGTCG AGCCGGTCGG CGACTGGGGC
AAGGGTGTCG TCCGTCTGGT CGAGATCCCG ACGCGCTACG AGACGAACGA CAACATCGTG
GCCTTCTGGG TGCCCGAGGG GAAGATCGCC GCCGGAGATG CGCGCGAATT CGCCTACCGG
CTCCGGTGGG GCGCCCTGCC CATCGAGACA CCGGCGGACC TGGCCCATGT GTGGGAGACG
CGGGCGGGTC ACGGCGGCGT CTCAGGGGTG GAAAATACCG CGGGAACGCG CAAGTTCGTC
GTTGATTTCA AGGGCGGACT TCTGGGCAAC CTGCCGCGCA ACGCCAAGGT GGAAGCCATC
ACGTCGGTGC AGCATGGCGA AATCGTCACG CAGACCCTTG AACGACTCGA CGGGATGGAC
ATATGGCGTC TCGTCCTGGA CGTGGCCGCA GCCGAGGGGG CCACGGTGGA ACTGGCCAGT
CACATCGCCG GTTATGGACG CAAACTCTCG GAAACTTGGC TCTACCAGTG GAACAAAGCC
TGA
 
Protein sequence
MHAHSAPSAR LNRRLLLSAA SSTLALAATG LVGLPLRAQE APAEEPPASV PSAAPQQFSF 
DWLTEEMRVA ATQPHVEPAN LTGFLGDLHY DDYRLINFRN DRSRWADTDS MFRIQAFHLG
WLFGAPVRLF DVTDGFIHEV QFSTDDFEYR NELATRVAAH VDLPGVAGFR LNFPLNRPDI
HDELVAFLGA SYFRALGRGN AYGISARGLA INTATSSPEE FPRFSRFYLE RPQAGGLSTV
LYAAMESPSV TGAYRFVITP GIETVMDVTA RLFFRNAVTQ LGVAPLTSMF LYGEKNRASY
DDFRPNVHDS DGLAIRRREG DLLWRPLNNP PRLASSYFGE ENPQSFGLHQ RKRAFEDYQD
AEAHYELRPS VDVEPVGDWG KGVVRLVEIP TRYETNDNIV AFWVPEGKIA AGDAREFAYR
LRWGALPIET PADLAHVWET RAGHGGVSGV ENTAGTRKFV VDFKGGLLGN LPRNAKVEAI
TSVQHGEIVT QTLERLDGMD IWRLVLDVAA AEGATVELAS HIAGYGRKLS ETWLYQWNKA