Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_0522 |
Symbol | |
ID | 4897692 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 547233 |
End bp | 549143 |
Gene Length | 1911 bp |
Protein Length | 636 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640111106 |
Product | general secretion pathway protein E |
Protein accession | YP_001042410 |
Protein GI | 126461296 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.450126 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCAGCAC TCCCCGATTC CCTGCCCCAG ATCCGGCCGG GCCGCGCGCC TGCAGCGCCG CGCCCCCTGC CCTCGCCCGC GGAGGCCGCG GAGCCGCTGG GCGTGATGCT GCTGCGCGAA GGGCATCTAG CGCCGCACCG GATCATGGCG GCCCTCAGTC ACGGCGGGCG GCCATCCGCG CCGCTGGCCG ATGTGCTTCT CGCCGAAGGC GCCCTGTCCG AGGACGAGAT CCTTGCCATG ATGGCGCGGC GGAGCGGGCT GCCGGTGCTC GACCCCGCGG CCGAGCGGCC CGATCCCCGG CTCATCGACC GGCTGGGGGT GCGCGACTGT CTGCGCGAGG GCCTTCTGCC CCTCCGCGAC ACGGGCAGCG CCGTCCTGCT GGCGGCGGCC GCCCCCGAGA GCTTTCGCCG CCACCGGCCG CGGCTCGAGG AGCTGTTCGG CACCGTGATC CCCGCCCTGG CCACCCGCTC GTCCATCGAG GCCGCGCTGC AGGAGGTGCG CGCGGACGCC ATCGGAACCG CGGCCGAACT TCGGGTCGCA CCGGAGGAAA GCTGCCGCGA CTGGCGCACC GGGCAGATGA CGCGGTTTGC GGCGCTGGCG GGCCTCGCGC TCGCCGCGGG CCTCGCCTTG GCACCGGGCC TCGTGCTGCT TGCCCTGACC GCCTGGGCGC TTCTGGCGCT GGCCTGCGGC ACAGCGCTGC GGCTGGCGAC CGCGGTGGCG AGCCTGCGCC GCCCTCCGCC CGAGCCCGAA AGCCCGCCGC TCCTGCATCT GCCGATGGTC TCGATCATCG TGGCACTCTA CCGCGAGGAG GATATCGCGG GCCGTCTCGT GGCGCGCCTC GGCCGCCTCG ACTATCCCCA CGACCGGCTC GAGATCCTGC TCGTGGTGGA AGAGGCCGAC CGACGGACGC GGCGGGCGCT GCTCGAGGCG CGCCTGCCGC CCTGGATGCG GATCGTGGTC TCGCCCAAAG GCGCGATCCG CACCAAGCCG CGGGCGCTCA ACGTGGCGCT CGACCATTGC CGGGGCTCCA TCGTGGGCGT CTACGACGCC GAGGATGCGC CCGAGCCCGA CCAGATCCGC CGCGTGGTCG AGGGCTTCAG CCGGCGCGGC TCTCACGTCG CCTGCCTGCA GGGACGGCTC GACTATTACA ACCCGCGCAC CAACTGGCTG TCGCGCTGCT TCACCATCGA ATATGCGGCC TGGTTCCGGC TGATGCTGCC GGGGCTCGAC CGGCTGGGGC TCGTGGTCCC GCTCGGAGGC ACCACCCTCT TCTTCCGCCG CGCGGCGCTC GAGGAGCTGG GCGCCTGGGA CGCGCATAAC GTGACCGAGG ATGCGGATCT CGGCATCCGC CTCGCGCGGC ACGGCTACCG CACCGACCTC ATCGACACGG TGACGGCCGA GGAAGCCAAC TGCCGCGCCA TCCCCTGGAT CAAGCAGAGA TCGCGCTGGA TCAAGGGCTT CATGATGACA TGGGCCGTCC ATATGCGCGC GCCGCGGCTG CTCTGGCGGC AGCTCGGCCC CTGGCGCTTC GCAGGCTTCC AGGTGATGTT CCTCGGCTCG ATCTCGCAGA CCCTGCTCGC GCCGGTGCTC TGGTCCTTCT GGCTGCTGGC GCTCGGCCTG CCGCATCCGG TGGCGCCGCT CGTGCCCGAG CCGCTGCTCT GGTCGATGAT CGGCCTTCTC ATCGGATCGG AGGGCACCGC CATTGCCATG GGCATCCTCG CCCTGCGGCA GACCCGGCAC CGGCTGAACC CGCTCTGGGT GCCGACCCTG CATCTCTACA ACCCGCTCGC CACCTTCGCG GCCTACAAGG CGCTGTGGGA GCTCCTGCGC GCGCCCTTCT ACTGGGACAA GACCTGCCAC GGGGTCTTCG ACGCCCAGAC CCGCGGCCGC CCTCTCCTGC AGCCCGCCTG A
|
Protein sequence | MPALPDSLPQ IRPGRAPAAP RPLPSPAEAA EPLGVMLLRE GHLAPHRIMA ALSHGGRPSA PLADVLLAEG ALSEDEILAM MARRSGLPVL DPAAERPDPR LIDRLGVRDC LREGLLPLRD TGSAVLLAAA APESFRRHRP RLEELFGTVI PALATRSSIE AALQEVRADA IGTAAELRVA PEESCRDWRT GQMTRFAALA GLALAAGLAL APGLVLLALT AWALLALACG TALRLATAVA SLRRPPPEPE SPPLLHLPMV SIIVALYREE DIAGRLVARL GRLDYPHDRL EILLVVEEAD RRTRRALLEA RLPPWMRIVV SPKGAIRTKP RALNVALDHC RGSIVGVYDA EDAPEPDQIR RVVEGFSRRG SHVACLQGRL DYYNPRTNWL SRCFTIEYAA WFRLMLPGLD RLGLVVPLGG TTLFFRRAAL EELGAWDAHN VTEDADLGIR LARHGYRTDL IDTVTAEEAN CRAIPWIKQR SRWIKGFMMT WAVHMRAPRL LWRQLGPWRF AGFQVMFLGS ISQTLLAPVL WSFWLLALGL PHPVAPLVPE PLLWSMIGLL IGSEGTAIAM GILALRQTRH RLNPLWVPTL HLYNPLATFA AYKALWELLR APFYWDKTCH GVFDAQTRGR PLLQPA
|
| |