Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1700 |
Symbol | |
ID | 5539178 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 2194244 |
End bp | 2195182 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640893839 |
Product | glycosyl transferase family protein |
Protein accession | YP_001431810 |
Protein GI | 156741681 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0275555 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGACA TCATCATTCC AAACTACAAT GGCGCCTCTC TTCTGCCGAC ATGCCTCGAC TCGCTGCGGG CGCAGACACG GCGCGATTTC TGCGTGGTTG TCGTGGATGA CGGTAGCGCC GATGATTCGG TCGCTCTGGT GCAGCGGCGT TATCCTGAAG TGCAGGTCAT CGCACTGCCG CGCAATCGCG GGCTGGCGGC GGCGGTCAAT ACCGGCATCG AAGCGACCGG TGGGGAGTAT GTTGTCCTGC TCAACAACGA CACCGAGGCG CATCCGCGTT GGCTGGAACA TCTGATCGGC GCATTGGATC GCTATCCGGC ATATGCGTTT GCCGCCAGCA AAATGATGTT GTTCGACCGG CGCGACCACA TTCATTCAGC GGGCGATTAC TATCGCTTCG ATGGCGTTCC CGGCAGTCGC GGCGTCTGGC AACGCGATGT GGGGCAGTTC GATGTGATGG AAGAAGTGTT TGGTCCCTGC GCCGGCGCTG CGGCATATCG GCGAGCGGCG CTGGAGGAAC TGGCGGAGGA CGGCAGGGTC TTCGATGAGG ACTTGGTGAT GTATTGTGAA GATGTCGATC TGAACGTGCG CGCCAGATTG CGCGGTATGC GCACCGTATA CGTGCCGCGT GCCGTGGTAT ATCATCGATT GAGCGCAACG GGGGGCGGCG CGCTGGCGAG TTACTACTGT GGGCGCAACT TTATGCTTGT GTGGACAAAG AATATGCCGG CGCCCCAGGT GCGACGCTAC CTGCCCCTGC TGATCTGGTC GCAGATCCGG TTTGTCGTGC ATTCACTCTG GCATATCCGC GAACCGGCAG CGCGCGCCCG TTTGCGTGGT CAGTTTGATG GGTTGCGTAC ACTGCCTCGA TTTGTGCGCA AGCGGCGCCA CACGACAGGC AGGGATGCCG GTCGGCTTGT TGTTGAAACA CGCTATTGA
|
Protein sequence | MIDIIIPNYN GASLLPTCLD SLRAQTRRDF CVVVVDDGSA DDSVALVQRR YPEVQVIALP RNRGLAAAVN TGIEATGGEY VVLLNNDTEA HPRWLEHLIG ALDRYPAYAF AASKMMLFDR RDHIHSAGDY YRFDGVPGSR GVWQRDVGQF DVMEEVFGPC AGAAAYRRAA LEELAEDGRV FDEDLVMYCE DVDLNVRARL RGMRTVYVPR AVVYHRLSAT GGGALASYYC GRNFMLVWTK NMPAPQVRRY LPLLIWSQIR FVVHSLWHIR EPAARARLRG QFDGLRTLPR FVRKRRHTTG RDAGRLVVET RY
|
| |