Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2957 |
Symbol | |
ID | 5540448 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 3836383 |
End bp | 3837570 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640895077 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001433035 |
Protein GI | 156742906 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.000652955 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCACCTCG CCATCAATGG CATGTTCTGG TCACAACCGA CAGTCGGCAG CGGGCAGTAC CTGCGCATGC TGGTACACGC CCTGCCTGCC GTTGCGCGGA ATATTCGGCT GACGCTGCTG CTGCCCGCGT ATCGCGCGGT CACTGAACCA CTCCCGCCGG ACATTCAAGC AGTGCATGTG CAGACACCAT TCGATGGGCG CAGCGAAAAC CTGGCGAAGG TCTGGTTCGA GCAGGTCGCC GTCCCGCTCG CGGCGGCGCG CCTGCACGCC GATCTGCTCC ACGTTCCCTA TTTTGCGCCG CCGCTCGTTG CGCCGCTGCC AGCGGTCGTC ACCATTCTCG ACATCATCCC GCTCATTCTG CCGGAGTATC GCGGCAGTAC GGCGGTTCGC CTCTATGTGC GCCTGGTGGC GCGCGCCGCG CGGCAGACAA CGCAGATTAT CGCTATTTCG CAGCATAGCG CCGGGGACAT CATTCACCAC CTGGGCTGCT GCGCGGCGCG CGTGACGGTG ACGCCGCTCG CGGCTGGCGC ACAGTTCCAC CCCCGCGACC GCGCGTGCGC CGAACGAGAA GTTGCAGCGC GCTATGGCGT AACGCCACCG TTCGTCTACT ATGTCGGTGG GCTGGACGCG CGCAAGAACC TGGCAACCCT CGTACACGCA TTTGCACGTA TGCGCTACGC CGGAGGACCA CCCGCCACAC TGGTGATTGC CGGACGCGCG CCTGGCAGCG ATCCGCGCAT GTTCCCCGAC CTGGATGCAA TGATTGCATC CGCTGGCGCA GACTCGTTCG TGAGACGCAT CGACGTGCCT TACGAAGACG CGCCACTGCT CTATTCCGCC GCCACAGTGT TTGCCTTTCC GTCGCGGTAC GAAGGGTTTG GACTGCCGCC GCTCGAAGCC ATGGCGTGCG GTGCGCCGGT GATCGTCGCC GACGCAACCA GCCTTCCCGA GGTCGTTGGT GAGGCGGCGC TGCGGGTCCC GCCCGACGAT ACACCCGGCT GGATCACGGC GCTCTGGCGC GTGCTGGCGG ACGACACGCT GCGCGCCGAT CTCTCCCGAC GCGGGCTGGA GCGCGCGACC TGTTTTCGAC CCGAACGCCT TGCGCGCGAA ACGCTGGCGG TGTATGAACG CGCCCTGAAC TCAGGCGGAT GTCGTATTGC TCACAATGAC AGAACGACAT CGGTGTGA
|
Protein sequence | MHLAINGMFW SQPTVGSGQY LRMLVHALPA VARNIRLTLL LPAYRAVTEP LPPDIQAVHV QTPFDGRSEN LAKVWFEQVA VPLAAARLHA DLLHVPYFAP PLVAPLPAVV TILDIIPLIL PEYRGSTAVR LYVRLVARAA RQTTQIIAIS QHSAGDIIHH LGCCAARVTV TPLAAGAQFH PRDRACAERE VAARYGVTPP FVYYVGGLDA RKNLATLVHA FARMRYAGGP PATLVIAGRA PGSDPRMFPD LDAMIASAGA DSFVRRIDVP YEDAPLLYSA ATVFAFPSRY EGFGLPPLEA MACGAPVIVA DATSLPEVVG EAALRVPPDD TPGWITALWR VLADDTLRAD LSRRGLERAT CFRPERLARE TLAVYERALN SGGCRIAHND RTTSV
|
| |