Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_4088 |
Symbol | |
ID | 5211071 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 5125245 |
End bp | 5128343 |
Gene Length | 3099 bp |
Protein Length | 1032 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640597676 |
Product | glycosyl transferase family protein |
Protein accession | YP_001278382 |
Protein GI | 148658177 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.127971 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGAGC GCGCTCATCC TGACGATCCA GCCTTTCGTG GAACGATCTA CGAAGCGTGT CTTCGTGAGC GTTATGCCTT TTGCTTGCCG TTTGTCCATG GTCGAGATGT GCTTGATGTT CCTTGCGGGA CAGGATGGGG CAGTTCACTG CTGTCCGGCT ATGTTTCTCT CACCGGTCTT GATATTGATC ACGACGCTAT TGACTACGCG AGAAAGCATT ATGCCGGTAT TCGCTTTATT GAAGGTTCGA TGTGCCAGTT GCCTTTTGAG GATGCCAGTT TTGATACGGT TGTGTGCCTT GAGGGGTTAG AGCATGTCTA TCTGAGCGAT GCACAGCGCT TCTTGCATGA AGCGCACCGG GTTCTACGCG AGCATGGCAT CCTGGTAGTA ACGGCGCCAT TGCTCAACAA TGGCAGGCAT TCATCGAACC CTTACCATCT GTACGAATTT GTCGCTGCTG AGCTGAAATC GTTGCTTTCG CGTTATTTCA TCCCCATCCA CTGGGAGATT ATCCAGGGAG GCGATGGACC GGAAGCGCGT TTTGTCGGTC AGCGTAAGGC GGTTATTAAT GCTGGAGCAT TCAGGATGGC TTCATCGTTG ATGTTCGATC GCGTCTACGA CTGGGTGCTA TCGCTCACCG GAGAGCGCGG AGTGCGTTTT ACCGCCGGTG GCGAAGAAAG TATTATTGCG ACCAGTTGCG CAGTGCTGAT ACTTGAAGGA ATAGATCGCT TGCAGGATGT GTCGTCCTCC CTGCGCAACC AGTGGATCGC CTATCTCCAG AGTTGTCAGC GCCCCACCGA TGGTTTATTC GTTGATCCGT TGCTGGAACG TTTCCCGATC GAGAGCGCAA TCCACGATCA GGCGTATGTG CTCGATCAGA CAACCTATCT TGCTTTGCAG GCGCTTGATG CGCTGGGTAG CGCCCCGATT CATCCGGTAT CGGTTCAAGA GCGCTGGCCC AATCCACAGG CATTTATCGC CTGGATGGAA CGTCTGGACT GGTCGAATGC CTGGTTGCAG AGCAATCGGG TGATGTTTGC GCTGGCGTTC CTGGTGCATG CCGTCGAACA GTGCAATCAA CGGGAAGCAG CGTCAATCTA TCACGCCGCA CTCGACTGGC TTGATGCAAC ACAGGACCGC GAAACTGGCG TGTGGGGAAC ACGTCATGGT GCGTCATTGC TCAACGGGAT GGCAGCCGCT TACCATTTTC TGCCATTTTA TGAGTATGTG TGTCGTCCGA TTCAGTGTAT CAATCAACTG ATTGATACAA CACTTGCGTT GCAGCAATCG GATGGCTTGT TTGGTTCAGG ACTGGGGGGT GGAGCATGTG AAGACCTCGA TGCCATCGTA GTGCTCGCGG TTGCGGCACG TTACAGCCGC TACCGGGCTG AGGAGGTGAA GCGAGCAGCC ATACGCGCTT TCTGGGCGTT GTGGAATGCG CAAAACGAAG ATGGAGGATT CGGGTACGCA ATTCGTAGCG ATGATCAGGT CTATCGCTTC AGTAGCTGGG GCGCAGCAGA GAGCCGTGTG TGTTCGAGCG ATGTCTGGTC GTCCTTGGCG CGGCTCGTTG CCCTGGGTAC GATCCGGCAC TGGTTTCCCG ATGATACGCC TTCTTTGCCT TTGTGGCGTT TTCGGCGTTG GCCGGCTTTG GGATATCATC GATCAACCGA TAGACTGGAC GATAATGAGC GGGCACGCTT GAAGATATGG ATGCGTCCGC TCCCTGCACC CGAACGCCAT ACCGGGACTG AACCAGGGGT AAGTGTTATC ATTCCCTGCT ACAACCTGGG GCGCTATCTC TACGAGGCGC TGGCTTCGGC TTTGCAACAA ACATTGCAAC CGCTGGAAGT TATCGTTGTC GATGATGGCT CGACCGACGA CTACACCCGT CTGGTTCTTG ACACGATCGA CCATCCGCAG GTGCGCGTCA TCCGCCAGGA GAACTGCGGG CTGCCCGCTG CGCGCAATGC GGGTATTCGC ATGGCTCGCA GTCCATTCAT TTGCTGTCTT GACGCCGATG ATCGCTTGCT TCCCACGTAT TTTGAGCGAG TATTGCCATT ACTGGAGTCT GATCCGCAGG TGGGATTTGT CACCGGTCAC TACCGTGAAT TCGATGGACG ATCAGGAGTG GTGGCGCCTT CAACCTGCGC GTTACCCGAT ATGCTGGTCG TCAATCGAGC GATAGTCACG TCACTGTTCC GTCGGGAAGC ATGGGAGCGT GCTGGCGGGT ACTGCGAAGA ATTGAGCGGA ATGCACGACT GGGATCTCTG GATTGGCATC CTCGAAGCAG GATACCGTGC AGAGGTCGTG CCTGAGATAC TGTTCGAGTA TCGGGTACGT CCCGGCTCTA TGTATGCCAC GACCAGCCAG CCAGAGAACT ATGCCCGCCT TGTCGGGCAA ATAGTTGAGC GTCATGCGTC CCTCTACCAG CATTGGTGGC GTGATGTGGT CGTTCTGTAC GCTCGTGAAC ACGCCAGTCT TGCTGCATAT GCTGAAGGAC AGGGACGGTT GTCGTCGCAG ATCAGGAGCA ACAGTGCGCA GCAGGCGGCG AACTGGAAGC GTGCGGCGGA AGAGCGCGCG GCGTGGATGG CGGAACTGGA AGCGGCGCGC GATTACCACG CGCAGCAGGC GGCGAACTGG AAGCGTGCGG CGGAAGAGCG CGCGGCGTGG ATGGCGGAAC TGGAAGCGGC GCGCGATTAC CACGCGCAGC AGGCGGCGAA CTGGAAGCGC CTGGCGGAAG AGCGCGCGGC GTGGATGGCG GAACTGGAAG CGGCGCGCGA TTACCACGCG CAGCAGGCGG CGAACTGGAA GCATGTGGCG GAAGAGCGCG CGGCGTGGAT GGCGGAACTG GAAGCGGCGC GCGATTACCA CGCGCAGCAG GCGGTGAACT GGAAGCGTGC GGCGGAAGAG CGCGCGGCGT GGATGGCGGA ACTGGAAGCG GCGCGCGACT ACCACGCGCA GCAGGCGGCG AACTGGAAGC ATGTGGCGGA AGAGCGTGGG GCGTGGATAG CGGAATTGGA GCGCGGCTAC ATCCGTGTGC CGCGCAGTCG TGCCGTATGG CGCCGGTTGA TGCAACGCAA ACGAGTAAGA TCCACATGA
|
Protein sequence | MAERAHPDDP AFRGTIYEAC LRERYAFCLP FVHGRDVLDV PCGTGWGSSL LSGYVSLTGL DIDHDAIDYA RKHYAGIRFI EGSMCQLPFE DASFDTVVCL EGLEHVYLSD AQRFLHEAHR VLREHGILVV TAPLLNNGRH SSNPYHLYEF VAAELKSLLS RYFIPIHWEI IQGGDGPEAR FVGQRKAVIN AGAFRMASSL MFDRVYDWVL SLTGERGVRF TAGGEESIIA TSCAVLILEG IDRLQDVSSS LRNQWIAYLQ SCQRPTDGLF VDPLLERFPI ESAIHDQAYV LDQTTYLALQ ALDALGSAPI HPVSVQERWP NPQAFIAWME RLDWSNAWLQ SNRVMFALAF LVHAVEQCNQ REAASIYHAA LDWLDATQDR ETGVWGTRHG ASLLNGMAAA YHFLPFYEYV CRPIQCINQL IDTTLALQQS DGLFGSGLGG GACEDLDAIV VLAVAARYSR YRAEEVKRAA IRAFWALWNA QNEDGGFGYA IRSDDQVYRF SSWGAAESRV CSSDVWSSLA RLVALGTIRH WFPDDTPSLP LWRFRRWPAL GYHRSTDRLD DNERARLKIW MRPLPAPERH TGTEPGVSVI IPCYNLGRYL YEALASALQQ TLQPLEVIVV DDGSTDDYTR LVLDTIDHPQ VRVIRQENCG LPAARNAGIR MARSPFICCL DADDRLLPTY FERVLPLLES DPQVGFVTGH YREFDGRSGV VAPSTCALPD MLVVNRAIVT SLFRREAWER AGGYCEELSG MHDWDLWIGI LEAGYRAEVV PEILFEYRVR PGSMYATTSQ PENYARLVGQ IVERHASLYQ HWWRDVVVLY AREHASLAAY AEGQGRLSSQ IRSNSAQQAA NWKRAAEERA AWMAELEAAR DYHAQQAANW KRAAEERAAW MAELEAARDY HAQQAANWKR LAEERAAWMA ELEAARDYHA QQAANWKHVA EERAAWMAEL EAARDYHAQQ AVNWKRAAEE RAAWMAELEA ARDYHAQQAA NWKHVAEERG AWIAELERGY IRVPRSRAVW RRLMQRKRVR ST
|
| |