Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1627 |
Symbol | |
ID | 5539103 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 2100952 |
End bp | 2102250 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640893764 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001431737 |
Protein GI | 156741608 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | [TIGR03087] sugar transferase, PEP-CTERM/EpsH1 system associated |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.154338 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.324472 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATTC TCCTGCTGAC CCAGATCGTT CCCTACCCGC CCGACAGCGG TCCGAAAGTC AAGACGTACC ACCTGCTGCG ACATCTGGCA GCGCGCTACA GGGTCACACT CGTCTGCTTC ACGCGCAATG CCCAGGAAGA AGCCGATGCC AACATCCTGC GCAACCTGGT TGCCGAAGTT CACACCGTAC CGCTTAGGCG ATCCACTGTT CGGAATGCGC TGACTCTGGC GGGCAGCCTG GTGCGCGGAC GCTCATGGAT CATCGAACGG GATGATAGCG CCACTATGCA TCGGTTGCTG ACACGCCTGG TGCGCGAAGC GGAGATCGCC GGACGACCGT ATGACCTGGT GCATGCGGAT CAACTCAACA TGGCGCAGTT TGCCGAACCG TTGCCGTTGC CGCGCCTGCT CGACGAACAC AACGCCGTGT GGACAGTCTT TCGCCGGGTA GCGGCACAGG AACGGGGTCT CAAGCGGCTC CTCTGGGAAC GTGAGTGGCG TCAGTTGCGC ACCTACGAGG GACGGGTCTG CCGCGAGTTC GAGGCGGTCA CAGCCGTCAG TCACGAGGAT CGTCAGGCAT TAATCGACGC GATGGGGATA GAGCGCCATA TTCCGGTCAT TCCGATTGCC GTCGATGCCG AACGGGAACA GCCAATTGCG CGTCAGCCGG ATGCGCGTGG CATCCTCAGC CTGGCGACGA TGATGTGGCC CCCGAATGTC GATGGCGTGC TCTGGTTTGC CCGCAGTATC TACCCGCTGA TCAAACAACA GGTTGAAGGA GTGCGCTTCT TTATCGTCGG GCAGCGCCCC GTACCAGAAG TGCGCGCACT GCCAGAACAA GATCCGACGA TTGAGGTGAC CGGCTACGTG CCAGACCCCA CGCCATACAT CGCCGCCTCT GCCTGCCTGA TCGTTCCCTT GCGCAGCGGC GGCGGCATGC GCGTGAAAAT CCTCGAAGCG CTGGCGCGCG GCATTCCGGT CGTTTCGACC ACAATTGGCT ACGAAGGGAT CGACCTGGTT CCCGGCGAAC ATCTGCTGGT CGGCGACACA CCAGAGGCGT TCGCCGATGC AGTCGTTCGC CTGCTGCGCG ACCCGGACTT CGGCGCGCAA CTGGCGGCAT CCGGGCGTCG GCGCCTGCTC GAACGCTACG ACTGGCGCGC TGTGTGCCCG GCAATGGATC GGGTGTATGA GCGGATGAAG AACCAAGAAC CAAGAACCAA GAACCAAGAA CCAAGAACCA AGAACCAAGA ACCGGAAACT GAGAACTGCA AGCCTATCCA AAGCGCCGAA GAGGGCTGA
|
Protein sequence | MNILLLTQIV PYPPDSGPKV KTYHLLRHLA ARYRVTLVCF TRNAQEEADA NILRNLVAEV HTVPLRRSTV RNALTLAGSL VRGRSWIIER DDSATMHRLL TRLVREAEIA GRPYDLVHAD QLNMAQFAEP LPLPRLLDEH NAVWTVFRRV AAQERGLKRL LWEREWRQLR TYEGRVCREF EAVTAVSHED RQALIDAMGI ERHIPVIPIA VDAEREQPIA RQPDARGILS LATMMWPPNV DGVLWFARSI YPLIKQQVEG VRFFIVGQRP VPEVRALPEQ DPTIEVTGYV PDPTPYIAAS ACLIVPLRSG GGMRVKILEA LARGIPVVST TIGYEGIDLV PGEHLLVGDT PEAFADAVVR LLRDPDFGAQ LAASGRRRLL ERYDWRAVCP AMDRVYERMK NQEPRTKNQE PRTKNQEPET ENCKPIQSAE EG
|
| |