Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0757 |
Symbol | |
ID | 5538223 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 991074 |
End bp | 992150 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640892913 |
Product | glycosyl transferase family protein |
Protein accession | YP_001430896 |
Protein GI | 156740767 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.11098 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAACGT TTGATCCTGG CGTCACCATT GTGTTGATTA CCTACAATTC CGCTGCATAC GTTCGGGAAT GTTTACAATC GGTTCGTTGT GCAGCGCCTG AAGTGCATGT CCTGATTGTG GATAATGCTT CGACGGACAA CACGGTTGAG ATTGTGCGGC GCGATTTTCC TGAGTGTACG TTGGTGCCGC TTCCCCAGAA TATTGGACAC AGCGCCGCGT GCAATCTAGC GCTCCAGCGT GCCAAAACCG CCTGGGTACT GTTTCTTGAT CACGACACGA CGACACCTGT CGGCTGGCTT GAACCACTCC TCGCCATCGC AGCAGCCACG TGGCCCGACG TCGGAATGGT TGGCTCGCGC GCCGTACTCG TCGAACAAGG GCGCATTCAT CACGATGGCG GATATGCCCA TTATGTCGGT CATATGACCT TGCGAAACGG CTTTGCGCCT CTGGCAGATG TTGCCTGTGA CACAACACCG GTTGAAGTAG GAGCGCAAGC AAGTACGTCG CTTCTGGTAC ATCGAGAACG CGCACTTGCG GTAGGGGGAT TCGATCCGCG TTTTTTTATC TATCTGAATG ATTTCGATCT TTCACTCCGC ATGCGATTAC GCGGTTGGCG ATGCTACGTT GCGCCAGAGT CGGTCGTCTA CCACCGTCAG GGCAACCCAG AGACCAGTTG GCGCGGCAGC GGCGACTATC CAGAACGACG CGCGTATCTG ATCTATCGCA ATCGCTGGAT GCTGATCGCC AGAATGTACG CACTCCGAAC GCTCATGGTA TGCCTGCCAG CGCTTGTCGT CTATGAAATG GCGCTGGCGG CGATAGCGCT GCGCAAGGGA TGGCGGCGCG CATATTGGCG CGCTTTTCGT GATGTTGTGC GCCTCTGGCC CGCGTTGCGC TTCCACCGCA GCCGCATCCA GTCTTCACGC AGGATTTCCG ACCGTGAATT GCTTTCGGCT TACGGCTTTT CGTATGTTCC GGGCTTGCTC CGACATCCAG CTGAGCAAGC GATACAGCAG GTGTTCGAGC GAGCATTCGC TGCATACTGG CGCCTCGTCC AACCGCTTTT GGAGTGA
|
Protein sequence | METFDPGVTI VLITYNSAAY VRECLQSVRC AAPEVHVLIV DNASTDNTVE IVRRDFPECT LVPLPQNIGH SAACNLALQR AKTAWVLFLD HDTTTPVGWL EPLLAIAAAT WPDVGMVGSR AVLVEQGRIH HDGGYAHYVG HMTLRNGFAP LADVACDTTP VEVGAQASTS LLVHRERALA VGGFDPRFFI YLNDFDLSLR MRLRGWRCYV APESVVYHRQ GNPETSWRGS GDYPERRAYL IYRNRWMLIA RMYALRTLMV CLPALVVYEM ALAAIALRKG WRRAYWRAFR DVVRLWPALR FHRSRIQSSR RISDRELLSA YGFSYVPGLL RHPAEQAIQQ VFERAFAAYW RLVQPLLE
|
| |