Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3683 |
Symbol | |
ID | 4898510 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | + |
Start bp | 788999 |
End bp | 790003 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640114291 |
Product | glycosyl transferase family protein |
Protein accession | YP_001045545 |
Protein GI | 126464432 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1442] Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0415394 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.880044 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATCTGC TCTTCTGCGC CGACCGGCCG TTCTTCCGGC ATGCCGCGGT GGCGGCCGTG TCGGCCGCAA GCGCGACCCG AGGGCCCCTG CAGGTGCATC TTCTGACCTG CGACAGCTGT CCCGAGGAGG AGGCGCGCTT CCGGGCCGCG CTTGCGCCCT TCGCCCATGT CGGGATCTCG GTCCACCGGG TGCCGGCGAC CCGGCTCGAG GGGCTCTTCG TCGACCGGCA CCTGAGCGCT GCGGCCTATC TGCGCTTCCT CGCCCCGGAG GTCCTGCCCG AGGCGGTGCA GCGCGTCCTC TATCTCGATT GCGATCTGAT CGTGCTCGAC GATGTGGCCA AGATCCTGAG CATCGATCTC CAAGGCAGGG CCGTGGCGGC GGCGCCCGAT CTCGGATGGA AGGATGCCGC GCAGGCGGCG CGGTTCCGCA CCCTCGGCAT CCCGCTCGAC CGGCCTTATG TGAATTCCGG CGTTCTGCTG ATGGACCTCG GCCGCTGGCG CCGGGACGGG CTGTCGCAGA AGCTGTTCGA CTATGTCGCG CGCCACGGCT CGCTGCTGCT GCGCCACGAT CAGGACGCGC TCAACGCGGT GCTGGCCGAC GATATCCATC TGCTCGACCG GCGGTGGAAC CTGCAGGTGC TGCTGCTGAG CCCCTGGGCG AAGCGGGCCT TGCCGGAGGA TCGGCAGGCC ACGGTCGCGG CCCGGCGCGA TCCGGCGATC CTGCATTTCT CGACGGCCGA GAAACCGTGG AACTTCCGCG TCTGGACGCG GCGGAGAGAG CTTTATTTCC GGTTCCGTGC GCGCACGCCC TGGAGCCGCG CCGTGCCCGA GGGGCTGAGT GCCGCGCAGG CCTGGGAATA TGATCTGGCG CGCAGGCTGC TGCGCCTCGG CCTCGACCTC TATCTTCTGC GGGGCGCGGC CCTGCGCCTG CGCAGGATCC TTCTGGCACG CATGGAGCGC GGGCGGACCT GGCCGGGCGC CGCACGGAGG CCGATGAACC GATGA
|
Protein sequence | MHLLFCADRP FFRHAAVAAV SAASATRGPL QVHLLTCDSC PEEEARFRAA LAPFAHVGIS VHRVPATRLE GLFVDRHLSA AAYLRFLAPE VLPEAVQRVL YLDCDLIVLD DVAKILSIDL QGRAVAAAPD LGWKDAAQAA RFRTLGIPLD RPYVNSGVLL MDLGRWRRDG LSQKLFDYVA RHGSLLLRHD QDALNAVLAD DIHLLDRRWN LQVLLLSPWA KRALPEDRQA TVAARRDPAI LHFSTAEKPW NFRVWTRRRE LYFRFRARTP WSRAVPEGLS AAQAWEYDLA RRLLRLGLDL YLLRGAALRL RRILLARMER GRTWPGAARR PMNR
|
| |