Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_4518 |
Symbol | |
ID | 5211503 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 5665210 |
End bp | 5666250 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640598096 |
Product | glycosyl transferase family protein |
Protein accession | YP_001278799 |
Protein GI | 148658594 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000506598 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTGA TTGTCCAGAT CCCCGTGCTT AACGAAGCCG AGTCGATCGC GCGCGTCCTC GCCGACATCC CGCGCGATAT TCCCGGTGTT GATAGCGTCG AGGTGCTGAT CATCGACGAT GGTTGCACCG ATGATACCAT CGCTGTCGCA CTGGCGCACG GCGCCGATCA CGTGGTGCGG CACACCAGTC GCAAAGGGCT GGCGACCGCG TATCAGACCG GCATCGATAC GGCGCTCCGG CTCGGCGCCG ATATTATTGT CAACACCGAT GGCGACAATC AGTATCCGGG GTATGAGATT CCGCGCCTGG TCGCCCCGAT CCTCGCCGGT CAGGCGGATA TGGTGATTGG GGATCGCCAG ATCGAGAATA ATGCGCACTT TCCACCGCTG AAAAAGACCC TTCAGGTTAT CGGGAGCGGC GTGGTGCGCT GGGCGTCTGG CACCAATGTT CCCGATACGG TCAGCGGCTT CCGCGCACTG TCACGCGAAG CGGCGCTGCG TACATTTGTG ACCAGCGATT TCTCGTACAC GGTAGAGAAC CTGATCCAGG CGGGCAAGCG TCGTCTGACG ATCCAGACGG TGCCGATTGC AACCAATCCG GTGCGTCGTG CGTCGCGGCT GCACACCGGC AACTGGAACT TCATCAAGCG CCAGGCTTCG ACGATTGTGC GCACCTATAC GGCATATGAG CCGCTCAGAA CGTTCACGTA CATTGCGCTT CCTTTCCTGA TCACAGGCGT CATTTTTTTA GGGCGCGCGC TGTACGTGTA TGTCATGCGC CAGCTCGTCA ACAACTTCCC GCAGGATAAT GCCCAATCGC TGGCGGTTGG CAGCACGGCG CTCATTCTTG GCTTCATCAT TTTCCTGATC GGACTGCTGG CAGACCGGAT CGGCGGGACG CGGCGGCTCA TCGAGGAGGT GCTCTACCGT GTTCGCTCCC AGGAAATCGA TGACCTGGCG TGGCGGCGTG AGGTGCGGAC GCGTCTCGAT CAGATCGAGC AAAAACTGGC GGAGGAGCGA GAACTGAGAA CTGAGAACTG A
|
Protein sequence | MKLIVQIPVL NEAESIARVL ADIPRDIPGV DSVEVLIIDD GCTDDTIAVA LAHGADHVVR HTSRKGLATA YQTGIDTALR LGADIIVNTD GDNQYPGYEI PRLVAPILAG QADMVIGDRQ IENNAHFPPL KKTLQVIGSG VVRWASGTNV PDTVSGFRAL SREAALRTFV TSDFSYTVEN LIQAGKRRLT IQTVPIATNP VRRASRLHTG NWNFIKRQAS TIVRTYTAYE PLRTFTYIAL PFLITGVIFL GRALYVYVMR QLVNNFPQDN AQSLAVGSTA LILGFIIFLI GLLADRIGGT RRLIEEVLYR VRSQEIDDLA WRREVRTRLD QIEQKLAEER ELRTEN
|
| |