Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3647 |
Symbol | |
ID | 5541149 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 4773406 |
End bp | 4774560 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640895767 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001433714 |
Protein GI | 156743585 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.79128 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.000752114 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAAAAACC TGGTACGCCG CGCGCTCAGG CGCTTTGGAT CATATTCCGA AGGATACGGC GCAGGCTGGT CATTGGGCGG ACAACCGCGG ATTCTCTTCG TCAGCGGCAT GGACGGCGCG CCGTTGCGGT ATCGTGTGTT GCACCAGGCA GAGCAGATCG CCCTCGCCGG CGGATCATGG ATGCTTGTGC GCGACACGGA GAGCCGACTG AGAGAGTGCG TGCAGCAATG CGACATTCTG TATCTGTACA AAGCGGGGAC CACGTTGCAG GCGTGTGAGG CGGTGCAGAC AGCGCGCAGG AATGCGCTCC CGGTTGTGTA CGACACCGAT GACCTGAACT GGGATGAGCG GTTGGTCGAA TACTGCGATC TTGAACGGTA CTATTCGCCG CCAGACGTTG TGCGGTTTCG GCGGATCTTT CGTGAAGCTG AACAATTGAT GCAGTCGGTG GATTGTTTCA TAACGTCAAC CGACTATCTC GCTGCCGCGC TTACTGCTCA TTTCGGCATT CCGGCGTATG TCAATGCCAA TGCGCTGTCG CAGCAGGCGA TCATGCGCGC CGAGCCGTTT TATCGGCGGC GCGCGGCGGC GCCGCCGCGC GCTCCTGTGA CGCTGGGGTA CTTCAGCGGC TGGCCCAAAG CGCATGAATC GGACCTGGCG GTTGCGCTTC CGGCGGTGCG TCGGGCGCTT GATGCACTTC CGGGTGCGCG ATTGCGGATT GTTGGGCACT TTGAACGCAG CGCCCTGCCG GTCGATCTGC GCGAATGGGT TGAGATCGCG CCGTTCGTTC CGTATGAACG GCTCTTCGCG GAGATTGCGC GCGTGGATAT TAATCTCGCG CCGCTGGTCG ATAATCCGCA TCGTCGCGCA AAGAGCGCCG TAAAGTTCCT CGAAGCGGCG CTGGTCGGCG TGCCGACGGT CGCCAGCAAT CTGGAACCCT ACCGTCTGAT CGATCATGGG CGCACCGGCA TGCTGGCGGC GAACGAGGAA GAGTGGTATG CCGCCATTAT GGCGCTGGCG ACCGATCCGC TGCGCCGTCG CGCAATCGGA GATGCGGCGC GCAGGTATGT TCTCGAACAC GAAACGACAT CTGTGCGGGC GCCTGGATTT GCGAACCTGC TGCGTCATCT CATCGATACA CTTCCACTGA GATAA
|
Protein sequence | MKNLVRRALR RFGSYSEGYG AGWSLGGQPR ILFVSGMDGA PLRYRVLHQA EQIALAGGSW MLVRDTESRL RECVQQCDIL YLYKAGTTLQ ACEAVQTARR NALPVVYDTD DLNWDERLVE YCDLERYYSP PDVVRFRRIF REAEQLMQSV DCFITSTDYL AAALTAHFGI PAYVNANALS QQAIMRAEPF YRRRAAAPPR APVTLGYFSG WPKAHESDLA VALPAVRRAL DALPGARLRI VGHFERSALP VDLREWVEIA PFVPYERLFA EIARVDINLA PLVDNPHRRA KSAVKFLEAA LVGVPTVASN LEPYRLIDHG RTGMLAANEE EWYAAIMALA TDPLRRRAIG DAARRYVLEH ETTSVRAPGF ANLLRHLIDT LPLR
|
| |