Gene Rsph17029_3700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3700 
Symbol 
ID4898996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp811358 
End bp812566 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content71% 
IMG OID640114308 
Productglycosyl transferase, group 1 
Protein accessionYP_001045562 
Protein GI126464449 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.820196 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGACC TTCTGTCCCC GACCGCCGCC CGTGCGGCGC TCTCCGAGCC GCCGCGGGCC 
GAGCGTCCCG CGCCGTCCGC GGCCCGTCCG CTTGCGCAGG CGCCCCTGCG GATCCGCACC
GGGCACAGCT ACCATGTCTA TTTCACCATC GCGCTCGGCC AGCGGAACGC GCGCTTCGTC
GAGCGGGCGA TGACCCCGCT CAACCGCTAC TTCGCCTTCG CGGATTCCTT TGCCCTGACG
CCCGAGCCCG GCTTCGATGC GATCCACGCC TGGAACGCGG TGCCGCTGCT GACCCGGCGG
CCCTTCATCC TCACCTTCGA GGATTACATG CCCCGAACGC CGGACGACCG GCGCATTCCC
TGGGTCGAGC GGGCGCTGAC GCGGATCCTG CTCGGCGACC GGTGCCGCGG GCTTGTCGCC
ACCTCGGATT ATGCGCTGCG GCAGTTCCGC TGGCAGCACC GCGCGAACCC GCGCCTGCCT
GAGCTGCTGG CCAAGACCGA GCGCCTCTAT CCGGTGACGC CGCCCCGCCG CGACCGGCCG
AAGCCGCACT CCGACCGGCT GCGGCTGCTG TTCGTCGGGC GCGACTTCAT GCGCAAGGGC
GGTCCCGCGC TGATGGAGGC GCATGCGAGG CTGCGGGCGC AGGGCGTGCC TGTCGAGACC
ACGGTCGTCT CGGCGCTGCA GTGGTCGCCG CGCGACTATA TCGGTCCGCC GGATGCGGCC
TATGTCGCCG AGTGCCATGC CCGTCTGGAC CAGGAGGGGG TGATCTGGCA CCGGTCCCTG
CCGAGTGCCG AGGTCCACCG GCTGATGGAT GCGGCGGACT ATCTGATCTT CCCGACCTTC
CACGACACGT TCGGCTTCGT GACCCTCGAG GCCTTCGCCG GTGCCACGCC GGTCATCGCC
AGCGACACCT GCGTCCTGCC CGAGCTGATC GTGCCGGGCG AGAACGGCTT TCTCCTGCCG
TTCGAGAACG ACGGGATCGG CAAATGGGCC TGGCTCTACC GGCAGGCCGA GGCGGGCTAT
CTCGAGGCCT ACCGTGCGCA GGCCGGGCGT CTGGCGGAAG GGCTGGTCGA GACCTTGGGC
CGGGCCTGGG ACGGGCGCCG TGATTATGAG CGGCTCAGCG CGGGCGCGCT GGCGGCGGCG
CAGACGCGGT TCCACCCGGA CACGGCGCGG CGGCGGCTCG AAATCCTCTA CGAGCGGTTC
CGGGCGTGA
 
Protein sequence
MSDLLSPTAA RAALSEPPRA ERPAPSAARP LAQAPLRIRT GHSYHVYFTI ALGQRNARFV 
ERAMTPLNRY FAFADSFALT PEPGFDAIHA WNAVPLLTRR PFILTFEDYM PRTPDDRRIP
WVERALTRIL LGDRCRGLVA TSDYALRQFR WQHRANPRLP ELLAKTERLY PVTPPRRDRP
KPHSDRLRLL FVGRDFMRKG GPALMEAHAR LRAQGVPVET TVVSALQWSP RDYIGPPDAA
YVAECHARLD QEGVIWHRSL PSAEVHRLMD AADYLIFPTF HDTFGFVTLE AFAGATPVIA
SDTCVLPELI VPGENGFLLP FENDGIGKWA WLYRQAEAGY LEAYRAQAGR LAEGLVETLG
RAWDGRRDYE RLSAGALAAA QTRFHPDTAR RRLEILYERF RA