Gene RSP_3853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_3853 
Symbol 
ID4796547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_009007 
Strand
Start bp18938 
End bp20710 
Gene Length1773 bp 
Protein Length590 aa 
Translation table11 
GC content75% 
IMG OID640102967 
ProductO-linked acetylglucosamine transferase 
Protein accessionYP_001033816 
Protein GI125654622 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3914] Predicted O-linked N-acetylglucosamine transferase, SPINDLY family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGTGC AGCCCGCCCC GATCCTGCCC ATCGGCTCCG TCTCTCCCCC GCTCACGGCC 
GAGCAGCTGG TGGGTCTGGC CGAGGCCGCC CCCGCCGCGG CGATCGAGAT CTACCGCCGC
TGGCTCGCGC TCCATCCCGA GCGGCCCGAC GCCTGGATCG CCTGGTTCAA TCTCGCGGTG
CTGCTCGAGG CTGCGGGCGA GCCGCAGGGG GCGCTCGGCG CGGCCGCCAC CGCGCTCCGC
CAGAAGCCGG ATCTGTGGCA GGCGGCCCTC GCGGCCGGTC AGGCGGCCGA GGCGCAGGGC
GACCGGACGC AGGCGCTGGC CTTCCTGCGC CAGGTGCTGC CCCCGGCCGA GGGGCGGCGC
CAGCTCCACC GCCAGCTCGG CCGGATGCTC GAGGCCGAGG GCCGGCTCGC GGAGGCCGCC
GAGGAGCTGC GGGCCTCGCT TCTCCTCGAT CCCCGCCAGC CCGAGGTGGT GCAGCATCTC
GTCCATGCCC GCCAGAAGAT GGCGGCCTGG CCCCCGGCCC GGCTCGCCGT CCCCGGCCTG
ACCGAGGCCG AGGCCGAGCT GCAGTGCGGC CCGCTCGCCA CCCTCGCGCT GCATGACGAT
CCCCTTCGGC AGGGCGAGGT GGCCGCGGCC TGGATCGCCC GGCATGTGCC CGATCCGGGC
ATTCGGCTCG CCCCGGCCGG GGGCTACCGC CACGACCGGC TGCGGCTCGG CTATCTCTCG
TCGGACTTCT GCCGCCACGC CATGAGCTTC CTCATCGCCG AACTGCTCGA GCGCCACGAC
CGCAGCCGGT TCGAGGTGGT GGGCTACTGC GCCTCGCCCG AGGACGGCAG CCCCGAGCGC
GCGCGGGTGC TCGCCGCCCT CGACCGGCAT GTGCCGATCG GCCCCCTCTC CGACGAGGCC
GCGGCCCGGC GCATCCGCGC CGACGAGATC GACCTGCTGA TCGATCTCAA CGGGCTGACC
CGCGGCGCCC GGCCGGGCAT CCTGCGCTGG AAGCCCGCCC CGGTGCAGGC GACCTATCTG
GGCTATATCG GGCCGGTCCC GCTGCCCGAG CTCGACTGGC TGATCTGCGA CCGAGTGACC
GTGCCCGAGG CCGAGGCCGC CCATTACCGC CCGGCCCCGC TCCGGCTCGA GGGCTGCTAT
CAGGCCAACG ACGGGCAACG GCCCCTGCTG CCCGCCGTCG ACCGCCCGGG CGAGGGCCTG
CCGGAGGCCG CCTTCGTCTT CGCCTGCGCC TCGCATTTCT ACAAAATCAC CGAGCCCCTC
TTCGCCGCCT GGTGCCGGAT CGTCGCGGCC GTGCCGGGGT CGGTCCTGTG GCTCGTCGCG
GATACGCCCG AGGGGCAGGC GGCGCTGGCC GGCCGCTGGC AGGCGGCGGG CCTAGACCCC
CACCGGCTGA TCTTTGCCCC CCGCGTCGAT CCCGCCCGCT ACCGGGCGCG GCTGGCGCTG
GCCGACCTCT TTCTCGACAC GATGCCCTAC AATGCCGGGA CCATCGCCTC GGACGCGCTC
CGGATGGGGC TGCCCCTGCT CACGCTCGCG GGGCGGACCT TCTCGGGCCG GATGGCGGCG
AGCCTCCTCA CGGCGGTGGG GCTGGAAGAT TGCATCGCCC CCGACCTCGA GGCCTATGTC
GCCCGCGCTG TGGCGATCGC CACCGACCCG GCGGCGGCCC CCGCCCTGAC GGGGCCCGCC
CTCGCCGAGC GCTGGAGCCT CACCTTGGGC GACTGCCGCG ATTTCACCCG CCGTTTCGAG
GCGGCCCTGC TCTCGGTCGC CCGCCGCGCC TGA
 
Protein sequence
MTVQPAPILP IGSVSPPLTA EQLVGLAEAA PAAAIEIYRR WLALHPERPD AWIAWFNLAV 
LLEAAGEPQG ALGAAATALR QKPDLWQAAL AAGQAAEAQG DRTQALAFLR QVLPPAEGRR
QLHRQLGRML EAEGRLAEAA EELRASLLLD PRQPEVVQHL VHARQKMAAW PPARLAVPGL
TEAEAELQCG PLATLALHDD PLRQGEVAAA WIARHVPDPG IRLAPAGGYR HDRLRLGYLS
SDFCRHAMSF LIAELLERHD RSRFEVVGYC ASPEDGSPER ARVLAALDRH VPIGPLSDEA
AARRIRADEI DLLIDLNGLT RGARPGILRW KPAPVQATYL GYIGPVPLPE LDWLICDRVT
VPEAEAAHYR PAPLRLEGCY QANDGQRPLL PAVDRPGEGL PEAAFVFACA SHFYKITEPL
FAAWCRIVAA VPGSVLWLVA DTPEGQAALA GRWQAAGLDP HRLIFAPRVD PARYRARLAL
ADLFLDTMPY NAGTIASDAL RMGLPLLTLA GRTFSGRMAA SLLTAVGLED CIAPDLEAYV
ARAVAIATDP AAAPALTGPA LAERWSLTLG DCRDFTRRFE AALLSVARRA